Gene Arth_1217 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1217 
SymbolglmU 
ID4446280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1326329 
End bp1327855 
Gene Length1527 bp 
Protein Length508 aa 
Translation table11 
GC content67% 
IMG OID639689025 
Productbifunctional N-acetylglucosamine-1-phosphate uridyltransferase/glucosamine-1-phosphate acetyltransferase 
Protein accessionYP_830711 
Protein GI116669778 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGCCCCG AGACAACCGG CCCCGCTGCC GTCATCGTCC TAGCTGCAGG TGCCGGCACC 
CGGATGAAGT CGCGTACCCC CAAGATCCTC CATGAAATCG GCGGCCTGTC GCTTGTGGGC
CATGCCCTGC GTGCCGCCCG CACCATCGAT CCCCGGCAGC TGGCAATTGT CGTCCGGCAC
GAACGCGACC TCGTGGCAGC CCACGTGTCC GGGCTCGATC CGGCAGCCCT GATAGTGGAC
CAGGACGAAG TGCCCGGCAC CGGCCGCGCG GTGCAGGCGG CACTGGAGGC CCTCGACGCC
AAAGGTCCGC TGAGCGGCAC AGTTGTGGTT ACCTACGGAG ACGTTCCGTT GCTCTCGGCG
GAACTGCTTG CCGAACTGGT GCTGATCCAC GAACACGAAC GCAACGCAGT CACCGTCCTG
ACAGCAGTCC TGGACGATGC CGCAGGCTAT GGCCGTATCC TCCGTGCGGC GGACGGCACC
GTTACGGGCA TCCGCGAGCA CAAGGACGCC TCGGACGAGG AACGTGAAAT TCGGGAAATC
AACTCCGGGA TCTACGCCTT CGACGCCGCC ATCCTTCGCG ATGCACTGGC GCACGTCACC
ACGGATAACG CGCAGGGCGA GATGTACCTC ACCGATGTCC TGGGACTTGC GCGCACGGCC
GGCGGACGCG TAGCCGCAGT GGTTACCAAC GACCGCTGGC AGGTGGAAGG CGCCAACGAC
CGTGTGCAGC TCTCCGCGCT GGGAGCCGAA CACAACCGCC GAACTGTCGA AGCCTGGATG
CGCTCCGGCG TTACGGTGGT GGACCCAGCC ACCACGTGGA TCGACTCCAC GGTGAACCTG
GCAGAGGACG TCCGGATCCT GCCCAACACC CAGCTGCACG GTTCCACCAC GGTGGCGAGG
GACGCCGTCG TCGGCCCTGA CACGACGCTG ACCGACGTCA CCATTGGCGA AGGCGCCAAA
GTGACCCGGA CGCACGGCTC CGGCGCCACG ATCGGCGCGA ACGCCAGCGT CGGCCCGTTT
ACCTACCTCC GCCCGGGAAC AGTGCTGGGG GAGACGGGCA AGATCGGCGC CTTCTACGAG
ACGAAGAACG TGAAGATCGG CCGCGGCTCG AAGCTGTCGC ACCTCGGCTA CGCCGGCGAT
GCGGAGATCG GCGAGGACAC CAATATCGGC TGCGGCAACA TCACCGCCAA CTACGACGGC
GAGAACAAGC ACCGCACCGT GATCGGCTCG GGCGTGCGCA CGGGTTCCAA CACGGTCTTC
GTGGCTCCGG TCCAGGTGGG CGACGGCGCC TACAGCGGGG CCGGTGCCGT GATCCGCCAG
GACGTGCCCG CCGGCGCGCT GGCAATCTCG GTGGCCAAGC AGCGGAACGC CGAAGGCTGG
GTCATCTCGC ACCGGCCCGG CTCCATCTCA GCAGTACTGG CCGAAGCCGC GGGGGCTACG
GCGCCGGCCA GCGAAACAGC AGCGGCCCAG CCGCCCCAGG ATTCCTCAAG CACCCCGGCA
CCTACAGAAG AGGGCAAGCA ATCATGA
 
Protein sequence
MSPETTGPAA VIVLAAGAGT RMKSRTPKIL HEIGGLSLVG HALRAARTID PRQLAIVVRH 
ERDLVAAHVS GLDPAALIVD QDEVPGTGRA VQAALEALDA KGPLSGTVVV TYGDVPLLSA
ELLAELVLIH EHERNAVTVL TAVLDDAAGY GRILRAADGT VTGIREHKDA SDEEREIREI
NSGIYAFDAA ILRDALAHVT TDNAQGEMYL TDVLGLARTA GGRVAAVVTN DRWQVEGAND
RVQLSALGAE HNRRTVEAWM RSGVTVVDPA TTWIDSTVNL AEDVRILPNT QLHGSTTVAR
DAVVGPDTTL TDVTIGEGAK VTRTHGSGAT IGANASVGPF TYLRPGTVLG ETGKIGAFYE
TKNVKIGRGS KLSHLGYAGD AEIGEDTNIG CGNITANYDG ENKHRTVIGS GVRTGSNTVF
VAPVQVGDGA YSGAGAVIRQ DVPAGALAIS VAKQRNAEGW VISHRPGSIS AVLAEAAGAT
APASETAAAQ PPQDSSSTPA PTEEGKQS