Gene Cthe_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_0299 
Symbol 
ID4808517 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp374726 
End bp376096 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content43% 
IMG OID640105710 
ProductMATE efflux family protein 
Protein accessionYP_001036730 
Protein GI125972820 
COG category[V] Defense mechanisms 
COG ID[COG0534] Na+-driven multidrug efflux pump 
TIGRFAM ID[TIGR00797] putative efflux protein, MATE family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000180901 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAAAG AAACGGCGTC AATAAAAAAG ATGAGTGTCT TTGCTCTCAC ATGGCCGATA 
TTTATTGAAA CGCTGCTGAG AACAATGCTG GGAAATGTGG ATACTTTTAT GTTGAGTACA
TATTCCGACG ATGCCGTGGG GGCAGTAGGG GTTGTAAGTC AAATAAGTTA TATACTTATC
ATGCTGTACA ACGTTGTTTC GTCAGGAACA CTGGTGCTTA TATCCCAGTA TCTGGGCGCA
AAAAAGAAAA AGGAAGCTTC AGTGGTGGCT GTCACTTCAA TTGCAGGCAG TTTGATATTT
GGTTTGTTTG TCGGATTGGC TGTATTTCTG TTCAGAAGCC AGATATTAAC ATTTCTTAAT
TTGCCGCCCG AACTTATGGG ATATGCTATG ACATTTTTGG GAATTGTCGG AGGATTTTCT
TTTACCCAGG CATTGATAGC CACTTTGTCT GCAATAATCA GAAGCTATGG CAACACCAGG
ATAACCATGT ACATTTCTGT CGGCATGAAT ATCCTTAATA TTATTGGAAA CAGTATTTTC
CTGTATGGAC TGCTGGGGGC GCCGAAAATG GGAGTGACCG GTGTTGCCAT TGCAACTGTA
ATAAGCCAGG CTGTCGGTGT TGTTGCTATG CTGATTGTAA TGCTGACAGG ACTTAATCAA
AAATTTTCTT TCCGGGACCT TGTGCCGCTG CCGTGGGAGA TTTTAAGGGA TATATTGAAA
ATCGGACTTC CTTCTGCGGG TGAAGGAATT GCTTACGAAG CATCTCAGCT TACCATTACC
CGTATTATAA CGGTATTGGG AAAGGTTGCC CTTACAACCA GGGTATACAC TTTAAACATT
ATGTATTTTG TAATGGTTTT TTCAGTAGCG GTTGGTCAGG GAACTCAAAT TGTTGTAGGC
CATCTTGTGG GGGCGGGCGA TAATGAAAAA GCATACAAAA CATGTATTAA AAGTCTGAGA
TATGCTGTTG TGGTGGCAAT CATTCTTGCG GGAATTGTTT CGTTCTTTTC GGAGCAGCTT
CTTGGAATCT TTACGGATGA CCGGGCTATA ATTGAAATGG GGAGCAAACT CTTGCTGATT
GCAGTTATTT TGGAGCCGGG AAGAGTTTTC AATATTGTTA TAATAAACTC TCTGAGAGCG
GCGGGTGATG CCAGATTTCC CGTTATTATG GGTATTATAT CCATGTGGGG AATAGGAGTG
TTGCTGTCAT ATTTCCTGGG TGTGGCCTGC GGCTTGGGAT TGATAGGTGT ATGGATAGCC
TTTGCCAGTG ATGAATGGTT CAGAGGGATT GCCATGCTTC TGCGCTGGAG ATCCCGCGTC
TGGTATAAAA TGGCACTTGT AAAAAATCAG AATATTGAAA TGCCGGCTTA G
 
Protein sequence
MQKETASIKK MSVFALTWPI FIETLLRTML GNVDTFMLST YSDDAVGAVG VVSQISYILI 
MLYNVVSSGT LVLISQYLGA KKKKEASVVA VTSIAGSLIF GLFVGLAVFL FRSQILTFLN
LPPELMGYAM TFLGIVGGFS FTQALIATLS AIIRSYGNTR ITMYISVGMN ILNIIGNSIF
LYGLLGAPKM GVTGVAIATV ISQAVGVVAM LIVMLTGLNQ KFSFRDLVPL PWEILRDILK
IGLPSAGEGI AYEASQLTIT RIITVLGKVA LTTRVYTLNI MYFVMVFSVA VGQGTQIVVG
HLVGAGDNEK AYKTCIKSLR YAVVVAIILA GIVSFFSEQL LGIFTDDRAI IEMGSKLLLI
AVILEPGRVF NIVIINSLRA AGDARFPVIM GIISMWGIGV LLSYFLGVAC GLGLIGVWIA
FASDEWFRGI AMLLRWRSRV WYKMALVKNQ NIEMPA