Gene Moth_0755 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0755 
Symbol 
ID3831468 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp791266 
End bp792468 
Gene Length1203 bp 
Protein Length400 aa 
Translation table11 
GC content40% 
IMG OID637828686 
ProductUDP-N-acetylglucosamine 2-epimerase 
Protein accessionYP_429616 
Protein GI83589607 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0381] UDP-N-acetylglucosamine 2-epimerase 
TIGRFAM ID[TIGR00236] UDP-N-acetylglucosamine 2-epimerase
[TIGR03568] UDP-N-acetyl-D-glucosamine 2-epimerase, UDP-hydrolysing 


Plasmid Coverage information

Num covering plasmid clones45 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.585499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAAAAA TCATGGCTTT TACAGCCACA AGGGCCGAAT ATGGTTTGCT TAAACCAATT 
TTAAGAGGTA TCAAACAGTC ACCCAAACTA GAATTGTCTT TGGTCGTAGC TGGAGAGCAT
TTAGTGCAGT GGAAAGGTAA TACCATAAAT GAAATTAAAA GAGATGGATT TAATATAAGT
GAAATAATTA ATAGTTCTTT GGCTTCAGAC AGAACAGAAG CCATTATTAA ATCAGTTGGA
CTGACTACGA TTTTACTTTC TGATACCATA AGAAAAGTGG AACCCGATTT TTTATTGTTA
TTAGGTGATA GATATGAGCT ATTTGCTCCT GCTATAGCAG CCCTGATCCA AAGAATTCCC
ATAGCTCATA TCGCCGGCGG GGAAACGACC TATGGAGTTT TAGATGAGCA AGTAAGACAT
GCAATTACTA AAATGGCCCA TCTTCATTTC CCAACCACTT GGGAATACGG CTGGCGTATA
CGACAAATGG GTGAGGAAGC ATGGCGCATT CATGTTGTAG GATCACCTGG TATAGAAAAT
ATTAATAATT GCGATTATAT GCTGCCGAAT GAGTTGGAAG AGGACTTCGG TATTAATCTT
GAAAAACCAA TTATCCTGGT CACTTATCAT CCTGAAACTC TAGAAAAAGG ATACCAGGCA
CCAAAGGACA TAGAACAATT GGTTAGGGCC TTAAAACATT TTTCCGGACA CCAGCAGGTA
ATTACCTACC CCGGGACTGA GGTGGGCTAT CAAAATATCA TCGAAGCATG GCAGCAGTAT
GCCGCTGATA GACCTAACGT AATTTTGAAG AAAAGCTTGG GTTCCAGGGG CTATTTAGGA
GTAATGCGCT TAGCGTCCGT AGTGGTAGGC AATTCCTCGA GTGGAATTAT TGAAGCACCG
AGTTTTCATG TTCCTACTGT AAACATCGGT GAACGACAAA AAGGAAGAAT TAGGGCCGAT
AGTATAATAG ATGTCCCTTG CGAAGAAAAA GAAATTGTTA AAGGGATAAA AAAAGCCCTG
GAAGATCGAG ATTTTCGGCA AAGTTTAAAA GATGTCTTCA ATCCTTATGA TCCTTATGGA
GACAGCAACG CTAGTGGCAG GATTGTTAAG GTGCTAGAAG AAATACCCAT TAACCGTAAG
CTTTTAGAAA AACGACTGGA CTTTCCTTCA CCTGAAGAAA AGAGGAATTA CCATGTTCAA
TAG
 
Protein sequence
MRKIMAFTAT RAEYGLLKPI LRGIKQSPKL ELSLVVAGEH LVQWKGNTIN EIKRDGFNIS 
EIINSSLASD RTEAIIKSVG LTTILLSDTI RKVEPDFLLL LGDRYELFAP AIAALIQRIP
IAHIAGGETT YGVLDEQVRH AITKMAHLHF PTTWEYGWRI RQMGEEAWRI HVVGSPGIEN
INNCDYMLPN ELEEDFGINL EKPIILVTYH PETLEKGYQA PKDIEQLVRA LKHFSGHQQV
ITYPGTEVGY QNIIEAWQQY AADRPNVILK KSLGSRGYLG VMRLASVVVG NSSSGIIEAP
SFHVPTVNIG ERQKGRIRAD SIIDVPCEEK EIVKGIKKAL EDRDFRQSLK DVFNPYDPYG
DSNASGRIVK VLEEIPINRK LLEKRLDFPS PEEKRNYHVQ