Gene Moth_0700 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0700 
Symbol 
ID3832701 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp730282 
End bp731760 
Gene Length1479 bp 
Protein Length492 aa 
Translation table11 
GC content42% 
IMG OID637828632 
ProductABC transporter related 
Protein accessionYP_429562 
Protein GI83589553 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000766615 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000015345 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGAAAAAG TTTTGGAATT AAAAAATATC GTTAAAAGAT ATGGCCAGGT GCCGGTTTTA 
AAAGGTGTAG ATTTTGATCT TTATGCCGGA GAGGTACATG CCATTGTTGG TCAGAATGGG
GCTGGAAAGA GTACTTTAAT GAAAATTTTA GCTGGCGTTA TCACTGATTA TGAAGGAACT
GAGGTTTTAA AAGGGAAACC TGTCAGGTTT CGTTCAGCTG GCGAAGCTCA GGTCCATGGC
ATTGGCATGG TACATCAGGA ACTGAGCGTT ATTCTAAAAC TTTCTGTAGC TGAGAATTTA
TTTATTGGTA CACCCGGTAG TAAAAAAACC TTTGTAAACT GGTGGGAAAT GGAGAATAAA
GCTCGACAAT TACTAAAGGA TTTTGGCCTG GAAAGGATTA ATGTAAAACG TCCCCTGGGT
AGTTACCTCT TGGGTATTCA ACAAATGATT GAAATCATTC GTACTATCCA TTCTGGCGCT
AAAATCATCA TAATGGATGA ACCGACCTCG GCTCTATCAC CACCCGAGGT AAAGCGGTTG
TTTGAACTTA TCGGCCGGTT GAAACAGGCC GGTACTAGCA TTATCTTTAT TTCCCATTTT
CTTGATGACG TCTTGGACAT CGCTGATCGG ATTACCGTTC TCCGGGATGG ACGTAAGATA
ACAACCTTGG AGAATAAGGA TATTAATAAG GCGGAACTTA TCAGATTAAT GCTTGGTAGC
AGTGAGGGTA TAAGTGAAAA CACGGAGATC GAGCTATCAG CTAGTGAAAA GGAACCTGTT
TTGGAGATCA AGGATCTGAG TTGTAGGCGT TTATTTAGGG ACGTAGCCTT TAGCGTTGGG
AAAGGTGAAG TGGTAGGACT TTTTGGCTAT ATGGGTGCCG GCCATATGGA ATTACCACGA
GTGTTATTTG GTCTTGAAGT ACCGGAAAAA GGACGGGTGA TTCTACAAGG AAAAGAAGTC
AAAATTAAAT CTCCGGGTCA TGCCAGAAGT TTGGGGTTAG CTTATGCTCC GGAAAGCAGA
AAAAAGGCCC TGTGCCTTAC AAAACCTATT TACGCTAATA TAACTCTACC TTTCCTGGCA
ACTATCGGCA GGTTTGTTAA TAATCGTACG CGGGAACTAG AGATCAGCCG CCAATTAATC
GAACGCACTG CCCTTAGACC ACCAAAACCC CTTTTAAATG TCGGTAATCT CAGTGGAGGT
AATCAGCAGA AAGTCTCAGT CTCCCGTTGG TTACCTACTC ATCCTATCGT TTTTATTCTC
AGCGAACCTA CCAGGGGCAT GGATGTCGGA GCCAAAGAAG AGATAATTAA TCTAGTCCGT
GACCTTAAGG CCCAAGGTAT GGGAATCATT GTTGCTTCCT CAGAACCAGA AACGATTTTT
GCTCTGGCCG ATCGCATATT GGTGTTCTCG AAAGGTAAAA TTGTGCATGA GTTTAAGCAA
GGTAAAGTCA ATAAAGAGCT TTTATTTCAG TATGCTTAA
 
Protein sequence
MEKVLELKNI VKRYGQVPVL KGVDFDLYAG EVHAIVGQNG AGKSTLMKIL AGVITDYEGT 
EVLKGKPVRF RSAGEAQVHG IGMVHQELSV ILKLSVAENL FIGTPGSKKT FVNWWEMENK
ARQLLKDFGL ERINVKRPLG SYLLGIQQMI EIIRTIHSGA KIIIMDEPTS ALSPPEVKRL
FELIGRLKQA GTSIIFISHF LDDVLDIADR ITVLRDGRKI TTLENKDINK AELIRLMLGS
SEGISENTEI ELSASEKEPV LEIKDLSCRR LFRDVAFSVG KGEVVGLFGY MGAGHMELPR
VLFGLEVPEK GRVILQGKEV KIKSPGHARS LGLAYAPESR KKALCLTKPI YANITLPFLA
TIGRFVNNRT RELEISRQLI ERTALRPPKP LLNVGNLSGG NQQKVSVSRW LPTHPIVFIL
SEPTRGMDVG AKEEIINLVR DLKAQGMGII VASSEPETIF ALADRILVFS KGKIVHEFKQ
GKVNKELLFQ YA