Gene Moth_2345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_2345 
Symbol 
ID3832063 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp2465846 
End bp2467186 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content43% 
IMG OID637830268 
Productmajor facilitator transporter 
Protein accessionYP_431174 
Protein GI83591165 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.205861 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAAGG AAAACAGTAA CAGTAGCCTG ATTACATGGT GGCGTTACCG GTACCTTATT 
GCGAGCATTC TATTTTTTGC ATATAGCATC CAGTACCTGG ATCGGATTAA AACAACAGCC
CTTATTCCCC TGATTATGGA TAGTATTCAT CTTAGTCACG CCGATGTAGG TAACGGTATT
TTCTTAATGC TAATTTTTTA TGGACCGTCA CAATTCATTT CAGGGATTAT ATGCGACAAA
TACGGCGCTA AGAAAGTACT TATCTTTTCA TTGATTGGCT GGAGCCTTCT AACTTTTTGG
ATGGCATTCT TACAATCCAG GGACGAGTGG TACATCCGGA ACGCCCTTTT TGGAATTTTT
ATTGGAACTG AATTTATACC TAGTGCCCGC CTTCTTTCAC GGTGGTTCCC ATCACGGCAG
CGAGCACGGG CTCAAAGCAG TCTTTCCTGG GCTTGGATCC TTACACCGGC ATGGGCTACT
ATTGTAGCAA CACAGCTTGC TTCATTTTTT GGCAGCTGGC GTCCCGTATT TATAGTGGTT
GCAATTATTG GCTTAGTTCC CTTGGCATTA ATAATCTGGC TAATTAAAGA CCGTCCAGAA
CAGGTTAAAC ATCTTTCATT AGCAGAAATA AAGGAAAGTT ACGAGGATGA GATTTCCTCG
GGCGTAATCT CCAGTGATGA AATTAATAGG AGAGAAGTAT CTACCCAAAC TATCAAAAAG
GCACAGATCC CCCTCCGTAA TATACTTACT TATCGTGGCT TTTGGGCGAT AGCTTTTGTT
GATATTGCCT CCCAGATGAT GTACTGGGGA GTTGTATCTT GGTCGCCAAC CTATCTTAAA
GACGTATTTA AGTTTAGTAT AACAGGGATG GGTTTCTGGG CTTCTATTTA TTTTGCTGCG
GGTGTATTGG GTGCCTATTT GAGTTCTATT ATCAGTGATA AAGTGATGAA ATCAAAAAGA
AAACCCATGA TTGTTATTTC CTTTTTAGGC ACCCTTCCTT TTATTGTCAT ATTATCGCAG
TTGCATTCAG GAGTCAGTCA TGCTGTCATT TTGCTTGTGC TCTCTTGTGC GGGATTCTTT
GCTAATATGG CCTGGGGCCC CTTCCTTTCC TGGCCGGCGG ACGTTTTTTC TCCTGAGGTT
TACGGTACTG CCATGGGCTT CGTAAACATG CTGGCATATA TCGGAGGAGC ATTTGCACCC
TTAATTATGA GCCGCCTAAT CCGTGTAGGC CAGGTTGGCC CCGACTATAC CTATGCTTGG
ATTTTTATCG CTTGTGCTGC TTTCGTCGGA TTCATTGCTT CCTGCCTTGT AAAGGATAAA
AAATATAGTC AGGCTAATTA G
 
Protein sequence
MNKENSNSSL ITWWRYRYLI ASILFFAYSI QYLDRIKTTA LIPLIMDSIH LSHADVGNGI 
FLMLIFYGPS QFISGIICDK YGAKKVLIFS LIGWSLLTFW MAFLQSRDEW YIRNALFGIF
IGTEFIPSAR LLSRWFPSRQ RARAQSSLSW AWILTPAWAT IVATQLASFF GSWRPVFIVV
AIIGLVPLAL IIWLIKDRPE QVKHLSLAEI KESYEDEISS GVISSDEINR REVSTQTIKK
AQIPLRNILT YRGFWAIAFV DIASQMMYWG VVSWSPTYLK DVFKFSITGM GFWASIYFAA
GVLGAYLSSI ISDKVMKSKR KPMIVISFLG TLPFIVILSQ LHSGVSHAVI LLVLSCAGFF
ANMAWGPFLS WPADVFSPEV YGTAMGFVNM LAYIGGAFAP LIMSRLIRVG QVGPDYTYAW
IFIACAAFVG FIASCLVKDK KYSQAN