Gene Moth_0613 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0613 
Symbol 
ID3832588 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp638460 
End bp639953 
Gene Length1494 bp 
Protein Length497 aa 
Translation table11 
GC content45% 
IMG OID637828554 
ProductABC transporter related 
Protein accessionYP_429486 
Protein GI83589477 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1129] ABC-type sugar transport system, ATPase component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCTCCC CACTCTTACA ACTAAGGGGA ATATCAAAAT CCTTTTCGGG GATTAAGGTC 
CTCGATAATA TCGACCTGGA TATTTATCCC GGGGAAGTTC ATGCTCTCCT GGGAGAAAAC
GGCGCCGGGA AATCAACACT AATAAAAATA ATCTCCGGAG TCTACCAGAG AGACTGTGGT
ACCATAACAT TTAAACAAAA ACCGGTGGAG TTTACCAATA CTCGCCAGGC CCTGGATGCA
GGGATTAGTG TTATACACCA GGAACTCAGT CTTATTCAGG ATTTAAGTGT AGCTGAAAAT
ATTTTTTTAG GGCGAGAACC CATTAAATCG AGAGTTTTTA TTGATAAAGA AAAAATGGTT
AGCGAGACCA TAGCTATCGC CCGTTCCCTG GGTATTGATC TGAAACCATG GGCGATGGTC
CGGGACCTCA ATGTAGCCGA GCAACAGATG GTAGAAATAG CCAAGGCTGT GTCCTGCAAT
GCCTCACTGG TTATTATGGA TGAGCCAACC TCCTCCCTCT CTGATCGTGA AACTGAGACC
TTGTTTAAGA TTATCAAGCG CTTAAAAAAG GATAACGTGG CTGTTATCTA TATTTCCCAC
CGTTTAAAGG AACTCGAGGA ATTGGCTGAC CGGCTCACTA TTTTACGTGA TGGAAAACTT
GTAAAAACCA TGGTCGGGGA GGAAATGAAG AAATATAACT GGGTTTCTTT AATGGTAGGC
CGGGAGATCA AAGATTTTAG CCGGAAAGCC CAGAAACCCG GGGAAGTAAT TTTGCAGGTA
AAGGATTTCA CTGATCCCCC GAAATACTGG GATATCAATT TTGAATTGCG GCAGGGTGAG
ATTTTAGGCA TTGCCGGGTT GGTAGGGGCT GGCCGGACAG AAGTATTACA GGGCATATTT
GGAGTTAAAA AGCCAAAACA CGGTTCCCTA TACCTTAACG GGCAGAAGGT ACTTTTCAAT
TCGCCGGCTG AGGCCATCAG CAATGGAATC GGTTTCGTCC CTGAAGACCG TCGGCTCCAG
GGGGTAATCT TGGCACAATC TGTCAAAGAT AATATCTCCC TCCCCAGTCT ATATGATAAA
TCGAACTATG GTTTTATAAA TTTCCTTTGG GAAAACCAGG TGAGTGAAGA TTATATAAAG
AAAATGCGTA TCCGTACCCC CTCGGCTAAA ACTATTGTTA AAAATTTGAG CGGTGGCAAC
CAGCAGAAGG TAGCCCTGGC CAGATGGCTG GCGGCCCATG CCAAAATTCT GTTCCTGGAT
GAGCCAACCC GCGGTATTGA CGTTAACGCC AAGGCCGAGA TTTATAACTT GATGAATTCC
TTTACCAGTG AGGGAGGAAG CATCATCATG GTATCCTCGG AGTTACCGGA AATTTTAAGT
ATGAGTGATC GAATTGTAGT CATGCATGAG GGCCGTGTGG CCGGTATACT GGACCGGCAC
GAAGCCAGCG AAGAAAAAAT TATGGAACTG GCATGTGGAA AGATAGCCGG TTGA
 
Protein sequence
MASPLLQLRG ISKSFSGIKV LDNIDLDIYP GEVHALLGEN GAGKSTLIKI ISGVYQRDCG 
TITFKQKPVE FTNTRQALDA GISVIHQELS LIQDLSVAEN IFLGREPIKS RVFIDKEKMV
SETIAIARSL GIDLKPWAMV RDLNVAEQQM VEIAKAVSCN ASLVIMDEPT SSLSDRETET
LFKIIKRLKK DNVAVIYISH RLKELEELAD RLTILRDGKL VKTMVGEEMK KYNWVSLMVG
REIKDFSRKA QKPGEVILQV KDFTDPPKYW DINFELRQGE ILGIAGLVGA GRTEVLQGIF
GVKKPKHGSL YLNGQKVLFN SPAEAISNGI GFVPEDRRLQ GVILAQSVKD NISLPSLYDK
SNYGFINFLW ENQVSEDYIK KMRIRTPSAK TIVKNLSGGN QQKVALARWL AAHAKILFLD
EPTRGIDVNA KAEIYNLMNS FTSEGGSIIM VSSELPEILS MSDRIVVMHE GRVAGILDRH
EASEEKIMEL ACGKIAG