Gene Daro_0220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_0220 
Symbol 
ID3569612 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp240702 
End bp242060 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content60% 
IMG OID637678658 
Productglucosamine-1-phosphate N-acetyltransferase / UDP-N-acetylglucosamine pyrophosphorylase 
Protein accessionYP_283449 
Protein GI71905862 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones48 
Plasmid unclonability p-value0.600609 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCG TTATTCTCGC TGCCGGTCAA GGCAAGCGCA TGCATTCCAA CCTCCCCAAA 
GTGTTGCATC CGATCGCTGG CAAGCCGCTG GCCCAGCATG TGATCGATAC GGCGCGCCAG
TTGTCACCGG AAAAGCTGAT TGTGGTCTAT GGTCATGGCG GCGAAGTGGT TCGCTCCACG
CTGGCTGCCC CTGATCTTTC CTGGGCCGAG CAGGCACAGC AACTGGGCAC CGGCCATGCG
GTGGCGCAGG CCTTGTCCGA ATTGGGTAGT GCCGCCCAGA CGCTGGTACT TTACGGCGAT
GTGCCGTTGA CCACGGTGGC GACACTGAAA CGTCTGCTGC AGGCAGGCAA GGATGCCTTG
TCGGTGCTGA CCGTCGATCT TGCCAATCCG AGCGGCTATG GCCGTATCGT GCGCGATGGC
GCCGGCAACA TGATCAGCAT CGTCGAGGAA AAGGATGCGA GTGCCGAGCA GAAGGCGATT
CGAGAAGTGA ACACCGGGAT CATGGCCGTG CCGACGGCCC GTCTCGCCGA CTGGTTGGGC
AAGTTGAAGA ATGACAATGC GCAGGGCGAG TATTACCTGA CCGACATCAT CGCGCTGGCG
GTGGCCGAGG GCATGCCTGT GCGCACGGCG CAGCCGGAGG GCGAATGGGA AGTGCTCGGC
GTCAATAGCA AGGTCCAGTT GGCCGAACTG GAGCGCCAGC ATCAGCTCAA TCTGGCCGGT
GAGTTACTGG TCGCTGGCGT CAGACTGGCC GATCCGGCCC GTATCGATAT CCGCGGCGAA
CTGACGCACG GTCGCGATGT GGCGATCGAT GTCGGTTGCG TCTTCGAAGG CAAGGTTGAA
CTGGCTGACG CTGTCGAGGT CGGTCCTTAC TGCGTGCTGA AGAACGTCAA GGTTGGCGCC
GGAACGAGGA TTGCGGCGTT TTGCCATTTC GAGGATGCGG TCATTGGTCC GGATGGCGTG
CTCGGTCCTT ATGCCCGCCT GCGGCCGGGT ACCGAACTTG GCCCGGAAGT GCACATCGGC
AACTTCGTCG AGGTCAAGAA GAGCATCATC GGTGCCCAGT CCAAGGCGAA CCATCTGGCC
TATATCGGCG ATGCCGAGAT CGGTCAGCGT GTCAATGTTG GTGCCGGGAC CATTACCTGT
AATTACGATG GGGCCAACAA GTTCAAGACC GTTATCGAAG ACGATGTCTT CATTGGTTCC
GATACCCAAC TGGTCGCTCC TGTTACTGTG GGTCGCGGGG CAACGCTGGG GGCTGGCACG
ACGCTGACCA AGGATGCCCC GCCCGATGCC TTGACCTTCT CGCGCCCCAG GCAGATGACA
CTGCCGGGTT GGGAGCGTCC GAAAAAGGTG AAGAAATAA
 
Protein sequence
MNIVILAAGQ GKRMHSNLPK VLHPIAGKPL AQHVIDTARQ LSPEKLIVVY GHGGEVVRST 
LAAPDLSWAE QAQQLGTGHA VAQALSELGS AAQTLVLYGD VPLTTVATLK RLLQAGKDAL
SVLTVDLANP SGYGRIVRDG AGNMISIVEE KDASAEQKAI REVNTGIMAV PTARLADWLG
KLKNDNAQGE YYLTDIIALA VAEGMPVRTA QPEGEWEVLG VNSKVQLAEL ERQHQLNLAG
ELLVAGVRLA DPARIDIRGE LTHGRDVAID VGCVFEGKVE LADAVEVGPY CVLKNVKVGA
GTRIAAFCHF EDAVIGPDGV LGPYARLRPG TELGPEVHIG NFVEVKKSII GAQSKANHLA
YIGDAEIGQR VNVGAGTITC NYDGANKFKT VIEDDVFIGS DTQLVAPVTV GRGATLGAGT
TLTKDAPPDA LTFSRPRQMT LPGWERPKKV KK