Gene Cthe_2306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2306 
Symbol 
ID4809233 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2751534 
End bp2752943 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content45% 
IMG OID640107712 
ProductMATE efflux family protein 
Protein accessionYP_001038701 
Protein GI125974791 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.30452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGTTGCAAT TTTTGCGTAA ACGGAAATGG TCTCAAAAAT ATTCAAAACA AATGATATGG 
GAAGTGATAT CTCTCTCGTG GCCGGCGTTT ATAGAACTGG TTATGTCAAC CCTATTTAGC
ATGATTGACA TGATTATGGT GGGGCGGCTC AACTCAGCAG CAATAACTGC GGTTGGACTT
ACCACCCAGC CTTTTATGGT TTTACTGGCG ATATTTTCGG CTGTCAATGT GGGTACAACC
ACGATTGTGG CATGGAATAT TGGCGCCGGC AACACTAAAA AAGCTAATGA GGTTGCACGG
CAGTCCATTA TTCTGAACTT TATCATGGGA ATTATCATAA GCACGATAGG AGTTTTTATG
GCTCATGACA TTGTAGTCTT TATGGGAGCC GAAGCGGATA CTGTGAAGGA TGCCACCGTA
TACTTTCAAA TAGTATCTGC CGGACTGGTT TTCCAGGCAG TTAACATGGG GGTTACGGCT
GCTCTGAGGG GAGCGGGGGA AACAACGATT CCCATGATAT ACAATGTAGG TAGTAATCTT
TTCAATGTAC TGGGAAACTA TTTGCTTATA TTCGGAAAGC TCGGTTTGCC CAAACTTGGC
GTGGCAGGAG CTGCAATTTC CACGTCCGTA TCGAGATTTT TAGCGTGTGT GGTTGGTTTG
TGTGTAGTAT TTTTCTTAAA ATGGTCTGCT ATCTCAATTA GGCTTAAGGG CAGCTACCGG
ATAAATTTTG ATATTGCCAG AGAAATTTTT TCAATAGGTT TGCCGTCGGC AATGGAACAA
TTTGTGGTTC AGGGCGGACT TATGATGTTT GCCCGTACGG TTTCAAGCCT GGGTACCGTA
ACATTTGCCG CTCATCAGAT AGGGCTCAGT ATTTGCGGAC TTACTTTTTC ACCCAGCATG
GCTTTTGGTG TTGCCGGCAC AACTCTGGTG GGACAAAGTC TTGGAGCAAA TGACGAGGAA
CGGGCTAAAA GGTATGCCGA TATCATACAT CATATGGCTA TTGCAGTTGC CTGCTTTATG
GGATTGATGT TTATCTTGTT CTCATATCCC CTGGCCTGTC TGTATACAGA AGACCTTAAA
GTTGCCGCAA TGGCCAGTAT TGTGCTTAAA ATAATGGCTT TGGCCCAGCC CGGACAATCG
ACGCAGCTTT CCCTTGCCGG TGTGCTCAGG GGAGCGGGAG ATACTATGTT TCCACTATAT
TCATCTATTG CCGGCATTTG GGGTTTTCGT GTGGTAGTTG CTTATATTTT TGTAAGTGTT
TTCCGCTGGG GGCTTATAGG AGCATGGGTT GCTCTCGTGC TGGACCAATA TACAAGGGCT
GCTATTGTGT ATTTTAGGTA TGCTTCGGGA AAATGGAAGT ATGTTAAGGC AAGAAACCAA
GAGGTTGAGA AGATGAGAGC ATGTTCATGA
 
Protein sequence
MLQFLRKRKW SQKYSKQMIW EVISLSWPAF IELVMSTLFS MIDMIMVGRL NSAAITAVGL 
TTQPFMVLLA IFSAVNVGTT TIVAWNIGAG NTKKANEVAR QSIILNFIMG IIISTIGVFM
AHDIVVFMGA EADTVKDATV YFQIVSAGLV FQAVNMGVTA ALRGAGETTI PMIYNVGSNL
FNVLGNYLLI FGKLGLPKLG VAGAAISTSV SRFLACVVGL CVVFFLKWSA ISIRLKGSYR
INFDIAREIF SIGLPSAMEQ FVVQGGLMMF ARTVSSLGTV TFAAHQIGLS ICGLTFSPSM
AFGVAGTTLV GQSLGANDEE RAKRYADIIH HMAIAVACFM GLMFILFSYP LACLYTEDLK
VAAMASIVLK IMALAQPGQS TQLSLAGVLR GAGDTMFPLY SSIAGIWGFR VVVAYIFVSV
FRWGLIGAWV ALVLDQYTRA AIVYFRYASG KWKYVKARNQ EVEKMRACS