Gene Moth_2387 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2387 
Symbol 
ID3832026 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2509738 
End bp2510913 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content62% 
IMG OID637830306 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_431212 
Protein GI83591203 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase 


Plasmid Coverage information

Num covering plasmid clones51 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.0541306 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGCCTT ATAAGATACT CACCGTCTTC GGCACCAGGC CCGAGGCTAT TAAGATGGCC 
CCGGTGGTCA AAGAACTCAA TCTCCACCCC GAGGAGTTTA CCTGTCTGGT GGCAGTCACG
GCCCAGCATC GAGAGATGCT CGACCAGGTC CTGCGCCTCT TCCATATCAA ACCCGATTAC
GACCTGGATA TTATGCGGCC ACGCCAGACC CTGGAGGAGA TTACCACCAG GGCCCTGACC
GGCCTGGCCG GGGTTCTCAA AGAGGCCCGC CCGGACCTGG TCCTGGTCCA CGGCGACACC
ACCACCACCT TTGTCGCCGC CCTGGCGGCC TTTTACCAGC AGATACCCGT CGGCCATGTC
GAGGCGGGCC TAAGGACCGG CGACCGCTAT GCCCCCTTTC CCGAGGAAAT GAATCGCCGC
CTGGCCGGGG TACTGACCGA CATCCACTTC GCGCCCACAG CCAAGGCCCG GGACAATCTC
CTCCGCGAAG GCATAGCTCC GGAGCATATC TATGTCACCG GTAACACGGT CATCGACGCC
TTAAAAGCCA CCATCCGGGA AGAATACCAG TTTGGAGACC ACGGCCTGGC GGGGCTGGAC
TTACGGGAAA AGCGGGTCAT CCTGGTGACG GCCCACCGGC GGGAGAACTG GGGCGAACCC
CTTAAGGAGA TCTTTACGGC TCTGCGGGAT TTAATCCGGC GCCATCCCGA CACAGCCCTG
ATTTTTCCCG TTCACTATAA CCCGCGGGTC CGGCAACTGG CCCGGGAGGT CCTCGGCGGC
CAGGAGCGGG TTTATTTAAT CGAACCCCTT GATTACGAGC CCTTTGTCAA CCTCATGAAC
CGAGCCTATC TGGTCCTGAC GGATTCCGGC GGCCTGCAGG AAGAAGCCCC GGCCCTGGGC
AAGCCCGTGC TGGTCCTGCG GGAGGTTACG GAACGGCCGG AAGCCGTAGC CGCCGGCACC
GTCCGCCTGG TGGGCACCGC CTACCGTGAC ATCCTGGCGG CGGCGGAGGA ACTCCTGACT
GACAGGCAGG CTTACCTGCA AATGGCCCAC GCCGTCAACC CTTATGGTGA CGGCCAGGCC
TCCCGGCGCA TTCGCAGCGC CCTCCGCCAT TACTTCGGAA TGACTGTTGC CCGGCCCCAG
GAATTTCAAC CCTTGGGGGC AACCGGACAA AAATAA
 
Protein sequence
MPPYKILTVF GTRPEAIKMA PVVKELNLHP EEFTCLVAVT AQHREMLDQV LRLFHIKPDY 
DLDIMRPRQT LEEITTRALT GLAGVLKEAR PDLVLVHGDT TTTFVAALAA FYQQIPVGHV
EAGLRTGDRY APFPEEMNRR LAGVLTDIHF APTAKARDNL LREGIAPEHI YVTGNTVIDA
LKATIREEYQ FGDHGLAGLD LREKRVILVT AHRRENWGEP LKEIFTALRD LIRRHPDTAL
IFPVHYNPRV RQLAREVLGG QERVYLIEPL DYEPFVNLMN RAYLVLTDSG GLQEEAPALG
KPVLVLREVT ERPEAVAAGT VRLVGTAYRD ILAAAEELLT DRQAYLQMAH AVNPYGDGQA
SRRIRSALRH YFGMTVARPQ EFQPLGATGQ K