Gene Noca_4050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_4050 
Symbol 
ID4596564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp4273477 
End bp4274853 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content74% 
IMG OID639778656 
Productglycosyl transferase, group 1 
Protein accessionYP_925234 
Protein GI119718269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID[TIGR03449] UDP-N-acetylglucosamine: 1L-myo-inositol-1-phosphate 1-alpha-D-N-acetylglucosaminyltransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCGCTG ACCGGCCGGG GCACCGCTCG AGGGGAATCA ATCCGGGCCC CGGCATGTTC 
ACCTTGGTTG GACCCGACGA GCGCGACGAC CCCGAGGTGG CAGTGGACCC GATCCGGCGT
GTGGCGATGA TCAGCCTGCA CACCTCGCCC CTCGACCAGC CCGGGACGGG TGATGCGGGT
GGGATGAACG TCTACGTCAT CGAGCTGTCC AAGCGACTGG CCGCCCAGGG CATCGCCGTC
GACATCTTCA CCCGGGCCAC CACCTCGGCG GTCGAGCCGC TGGTCGAGGC GTACGACGGG
GTGCAGGTCC GGCACATCCA CGCCGGGCCG TTCGAGGGGC TCACCAAGGC CGAGCTGCCC
GGCCAGCTCT GCGTCTTCGC CCGCGAGGTG CTGCGCGCCG AGGCGGCCCA GCCGGTCGGG
CACTACGACG TCGTGCACTC CCACTACTGG CTCTCCGGGC AGGTCGGCGC GCTGGCCCGC
GACCGGTGGG GTGTGCCGCT GGTGCACTCC ATGCACACGA TGGCGAAGGT CAAGAACGAC
GCGCTCGCCG AGGGCGACAC CCCCGAGCCC GCGGCCCGCA TCATCGGCGA GGAGCAGGTC
GTCGAGGCCG CCGACATGCT GGTCGCCAAC ACCGACATCG AGGCCAAGCA GCTGGTCAAC
ATGTACGACG CCGACCCCAG CCGGGTCGAG GTCGTCCACC CCGGCGTCGA CCTCGGGGTG
TTCCGGCCCC AGGACCGCTC GACCGCCCGG GCCCGGCTCG GCCTGCCGGA GGACGCCGCG
GTGCTGCTGT TCGCCGGCCG GATCCAGCCG CTCAAGGCCC CGGACGTGCT GCTGCGCGCC
GTCGCCGAGC TGCTCGCCCA GACCCCGGAG CTGCGCTCAC GGCTGGTCGT CCCGATCGTC
GGCGGGCCCT CGGGCTCCGG GCTCGAGCAC CCCGAGTCGC TGGCCCAGCT GGCGAGCGAG
CTCGGGCTCG ACGGCGCCGG CGGCACCGGC CCCGTGGTGC GCTTCGTGCC TCCGGTCTCC
CAGGAGGAGC TGGCCCGCTG GTGCGCGGCC GCGACCCTGG TCGCGGTGCC GTCGTACAAC
GAGTCCTTCG GGCTGGTCGC GGCCGAGGCC CAGGCCACCG GCACCCCGGT CGTCGCCGCC
GCGGTCGGCG GCCTCACGAC GGTCGTGCGC GACGGCCGCA GCGGCCTGCT CGTCGACACC
CACGACCCGC GGGACTGGGC CGACGCGCTG CGCCGCGTGG TCGAGAACGA CGCGTTCCGC
GACCGGTTGG CCGCGGGGGC GCTCGAGCAG GCGCGGCTGT TCTCCTGGGA GCACACCGCG
CGGCAGACCC TCGACGTCTA CCGGCGGGCG CGGGCCGAGA TCCGGGAGGC CGTGTGA
 
Protein sequence
MRADRPGHRS RGINPGPGMF TLVGPDERDD PEVAVDPIRR VAMISLHTSP LDQPGTGDAG 
GMNVYVIELS KRLAAQGIAV DIFTRATTSA VEPLVEAYDG VQVRHIHAGP FEGLTKAELP
GQLCVFAREV LRAEAAQPVG HYDVVHSHYW LSGQVGALAR DRWGVPLVHS MHTMAKVKND
ALAEGDTPEP AARIIGEEQV VEAADMLVAN TDIEAKQLVN MYDADPSRVE VVHPGVDLGV
FRPQDRSTAR ARLGLPEDAA VLLFAGRIQP LKAPDVLLRA VAELLAQTPE LRSRLVVPIV
GGPSGSGLEH PESLAQLASE LGLDGAGGTG PVVRFVPPVS QEELARWCAA ATLVAVPSYN
ESFGLVAAEA QATGTPVVAA AVGGLTTVVR DGRSGLLVDT HDPRDWADAL RRVVENDAFR
DRLAAGALEQ ARLFSWEHTA RQTLDVYRRA RAEIREAV