Gene Moth_0075 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0075 
Symbol 
ID3832684 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp75036 
End bp76418 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content59% 
IMG OID637828007 
ProductUDP-N-acetylglucosamine pyrophosphorylase / glucosamine-1-phosphate N-acetyltransferase 
Protein accessionYP_428957 
Protein GI83588948 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1207] N-acetylglucosamine-1-phosphate uridyltransferase (contains nucleotidyltransferase and I-patch acetyltransferase domains) 
TIGRFAM ID[TIGR01173] UDP-N-acetylglucosamine diphosphorylase/glucosamine-1-phosphate N-acetyltransferase 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGATA CAGTCGCCGT CATCCTGGCC GCTGGCCAGG GGAAGCGGAT GCATTCCCGG 
CGACCAAAGG TATTACACCG TATTGCCGGT CGCTGCCTGG TGGAACATGT CCTGGCGGCA
GTCGGGGAGG CCGGTATAAA AAAGCAGATC GTCGTCATCG GCCACGGGGC GGAAGAAGTT
CGGGAGGCCC TGGGCCCGGA ATATACCTAT GTCCTACAGG AGCAGCAACT GGGCACCGGC
CACGCCCTGG CCCGGGCCCG GGAAGCGGCT GGCACGGCAG CAACCGTGCT GGTGCTCTGC
GGGGATACCC CCCTCCTCAG GCCGGCAACC TTGGCCCGGC TTCTAAAGGA GCACCGGGAC
AGGCAAGCTG CCGTTACTAT TTTAACAGCC GTACTCGATG ACCCTACCGG ATATGGCCGC
ATTATTCGCG ACGGCCAGGG TATGGTTGCG GGGATTGTCG AAGAGCGGGA CGCCAACCCG
GTAGAGAAGG CTATCAGGGA GATCAATACC GGCATTTATT GTTTCGAAGC TGCTTACCTC
TGGCCCTTTC TGGAACAATT ACAGCCGAAT AACGACCAGG GAGAGTACTA CCTTACCGAT
GTGGTGGGCC TGGCCTGCCG GGAAAACCTG CCCGTCCAGG CTGTAGCCGC CGGCGATCCG
GAAGAGATCC TGGGGGTTAA TGACCGGGCT CAGCTGGCCA AAGCCGGGGC TATTTTAAGA
CGCCGGATAA ACATGGGCCT GATGCAAGCC GGGGTAACCA TTATAGACCC GGAAACTACT
TATATCGATG CCACCGTCAG GATCGGCCCG GACACCATTA TTTATCCCGG CACCTTTTTA
GAAGGGAATA CTATTATTGA GGAGGGATGC TCCCTTGGCC CGGGAACTAC CCTCCGGGAC
TGCCAGGTAG GTAAGGGCAG CCATGTTATC CATACCGTGG CCCTGGAGAG TGAAATAGGC
CCTGGTTGCC AGGTAGGTCC CTTTGCCTAC CTGCGTCCCG GGACGGTCCT GGATGCCAGG
GTCAAGGTTG GGGATTTCGT AGAGATCAAG GCCTCCCGGA TTGGGGCTGG CTCCAAAGTA
CCTCACCTGA CCTACCTAGG TGATACCACG GTGGGTACCG GGGTGAATAT CGGCGCCGGG
ACCATTACCT GTAATTACGA CGGTGAGAAG AAGTGGCCGA CGGTCATTGA GGATGGAGCC
TTTATCGGTA GTAATAGCAA CCTGGTTGCT CCCGTCCGGG TAGGGGCCGG AGCCCTGGTG
GGGGCCGGGT CTACCATAAC GGAAGATGTA CCCGCCGGTT CTATGGCCCT GGCCCGGGGA
AGGCAGGTAA ACTTATCCGG CCGCAGTAAA AAAAGTAGCG AAAAAAGGCA GGAAAAAGGT
TAA
 
Protein sequence
MADTVAVILA AGQGKRMHSR RPKVLHRIAG RCLVEHVLAA VGEAGIKKQI VVIGHGAEEV 
REALGPEYTY VLQEQQLGTG HALARAREAA GTAATVLVLC GDTPLLRPAT LARLLKEHRD
RQAAVTILTA VLDDPTGYGR IIRDGQGMVA GIVEERDANP VEKAIREINT GIYCFEAAYL
WPFLEQLQPN NDQGEYYLTD VVGLACRENL PVQAVAAGDP EEILGVNDRA QLAKAGAILR
RRINMGLMQA GVTIIDPETT YIDATVRIGP DTIIYPGTFL EGNTIIEEGC SLGPGTTLRD
CQVGKGSHVI HTVALESEIG PGCQVGPFAY LRPGTVLDAR VKVGDFVEIK ASRIGAGSKV
PHLTYLGDTT VGTGVNIGAG TITCNYDGEK KWPTVIEDGA FIGSNSNLVA PVRVGAGALV
GAGSTITEDV PAGSMALARG RQVNLSGRSK KSSEKRQEKG