Gene Cthe_1762 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1762 
Symbol 
ID4810006 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2083795 
End bp2085060 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content42% 
IMG OID640107175 
ProductRND family efflux transporter MFP subunit 
Protein accessionYP_001038176 
Protein GI125974266 
COG category[V] Defense mechanisms 
COG ID[COG1566] Multidrug resistance efflux pump 
TIGRFAM ID[TIGR01730] RND family efflux transporter, MFP subunit 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0464275 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAAAG TTGTGAAAAT TATAATAGTA ATAGCAGTTC TTGGAGGACT TGTAGGGTTA 
ACCTATCTTT TGAGCGGCAA TATGGAGCAG GTGTTTAGCA GCAGTACCGT TTACAGTGTA
AAAACCGTCG AAATCGGGAA GGGCAGCATA TCTTCGTCGG TTTCGGCTTC GGGAAAAATT
GAGGAAGTTG ATTCCTATGA TGTCTATATA GACAATCCTG TAAAGGTTAA GAAGCTTTTG
GTTGAAAAAT ACCAAAAGGT GTCAAAGGGA CAACAGCTTG TGGAGTTCGA TATTGAGGAT
ATGGAAACGG AGCTTGAAAA GCTTAAAATA AACAAAAAAG TTCAGGAGCT TTCTCAGAAC
TCTCCGACGG TTGATGCCGA GATAAAAAGG GCCGAGTCGG CGGTAAAAAG CGCGGAGCAG
GCTTTAAGCG ACGCTGAAAA GAAATACGAG GACAGCAAAA AACTTTTTGA AGCACAGGCA
ATATCAAAAA GTGAACTGGA TATGGCTGAA AATGCCGTAA GAGATGCAAG AACCGCGCTG
GAGAATGCAA CAGTTGCCTA TAATGCCGCC GTAGAAAGCA AAGACGTGGA CCAAAAGGTA
AAAAAAGAGA ACCTCAACGC CCTTATACTT AGTATAAGCG ACCTTGAGAA GAAAATTCAA
AAGACAAAGG AATCAATGTT TTGCCCGTTT GACGGGATTG TTACCGAGAT AAATATTCAG
GAGGGAGCTT ATACAAGCAA TATGCAGCCT GCCTTCAGGA TTGTCAATCT TGACAAGCTA
AAGGTAAAGG CCATGGTAAA CGAATACAAT ATAAAAGATG TAAAAGTGGG ACAGAAAGTC
AGAATCACCG GAGATGCCAT CAGGGAAGAT GTCGAGGTTT TCGGAGTGGT GGAAAGTATT
TCACCGGTTG CAAAAACAAA TATGACAGCC TCGGGCGAAG AAGTTGTAAT TGAAGTGGAC
ATATCGATAG ACAATACTGC TTTGAACCTG AAACCGGGAC TTAGTGTCGA TTGTGAAATA
TTAACCGATG AGAAAAAAGA TGTAGTTGTG GTGCCCATGG GAGTCTTAAA GACTGACAAG
GACGGAAACG AATATGTGCT GCTGGTGGAC AAAGAGAAAG GTGTCCTTAT GCAGCGAAAT
GTAAAGCTTG GTATTATCTC CGACATGACT GCCGAGGTGT TGGAAGGGCT TGCGGAAGGA
GATATTGTTG TCGATGATCC ACAGTCGTTC CACAAGGATG GATCTAAAGT CAGGATAATT
GAATAG
 
Protein sequence
MKKVVKIIIV IAVLGGLVGL TYLLSGNMEQ VFSSSTVYSV KTVEIGKGSI SSSVSASGKI 
EEVDSYDVYI DNPVKVKKLL VEKYQKVSKG QQLVEFDIED METELEKLKI NKKVQELSQN
SPTVDAEIKR AESAVKSAEQ ALSDAEKKYE DSKKLFEAQA ISKSELDMAE NAVRDARTAL
ENATVAYNAA VESKDVDQKV KKENLNALIL SISDLEKKIQ KTKESMFCPF DGIVTEINIQ
EGAYTSNMQP AFRIVNLDKL KVKAMVNEYN IKDVKVGQKV RITGDAIRED VEVFGVVESI
SPVAKTNMTA SGEEVVIEVD ISIDNTALNL KPGLSVDCEI LTDEKKDVVV VPMGVLKTDK
DGNEYVLLVD KEKGVLMQRN VKLGIISDMT AEVLEGLAEG DIVVDDPQSF HKDGSKVRII
E