Gene Cthe_3147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_3147 
Symbol 
ID4809710 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3717291 
End bp3719036 
Gene Length1746 bp 
Protein Length581 aa 
Translation table11 
GC content41% 
IMG OID640108580 
ProductABC transporter related protein 
Protein accessionYP_001039535 
Protein GI125975625 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000208385 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTTAAAG ATCTGGCACG AAGCATTCGT GAGTACAAAA AAGTATCAAT TATAACTCCC 
ATACTGATAA GCCTGGAAGT GGTGATTGAA TGCATAATCC CATTCATCAC TGCGACTTTG
GTTAACAAAA TCAAAAGCGG ATGTGAATTA AACACAATTA TCAACTATGG TATAGTCTTG
ATTATCATGG CATTTCTGTC GTTGATGTTC GGTGCGATTG CAGGCTCAAC CGCTGCCACT
GCGTCCTGCG GGTTTGCAAA GAATTTAAGA AAAGATATGT TCTATAGTAT TCAGAACTAT
TCTTTTGAAA ATATCGACAA ATTCATGACA TCTTCACTGA TTACCCGTAT GACCACCGAT
GTTACCAATG TGCAGCATGC GTATATGATG CTCATTCGAG TGGCGGTTCG CGCTCCTTTA
ATGTTGATTT TTGCATTTGT AATGGCATTT GTAATGGGCG GGCGCATGGC ATGGATTTTC
CTGGTTGTCG ATCCGTTCCT TGCAATTGGC CTTAGTGTTA TAATTTACAA AGCATTGCCT
TTGTTCAGAA AAGTGTTTAA AAAATATGAC GCTTTAAACC GTTCCATTCA GGAAAACATT
AAAGGTATGC GTGTTGTAAA ATCTTTTGTC CGCGAGGACT ATGAGCAAAA GAAATTTGAT
GCGGCGGCGG AAGATGTGTG CGCGGACTTT ACAAGAGCGG AACGTATTTT GGCTTTTAAC
GGCCCCTTGA TGCAGTTTTG CATGTATGTA GTCATGGTTT TCGTTTTGTC CTTTGGTTCC
TATACGATTA TCACCAGCCG GGGATTGGAT TTTGATATCG GGCAGTTTTC AGCCATATTG
ACATACAACT TTATGATTTT AATGAGCCTT ATGATGCTTT CCATGGTGTT TGTAATGATT
ACCATGGCCG GTGAATCCGC AAAGCGTATT GTTGAAGTAA TTAATGAGAA AAGTACGATG
ACAAATCCGG AAAACCCGAT TTATACGGTA AAAGATGGTT CAATATCCTT TGAGAATGTA
AGTTTCAAAT ATTCCGAAAA GGCAGAAAGA ATGACGCTGG AAAATATAAA TTTGGAGATT
AAATCCGGGG AAACCATCGG AATTATAGGG GGCACAGGTT CTTCAAAGAC GACTCTTGTC
CAGCTGATTC CACGTTTGTA CGACGCTACC GAAGGAGTTG TGAAAGTAGG GGGCGTGGAT
GTAAGAAATT ATGATTTGGA AACTTTGCGC AATGAAGTTG CCATGGTGCT GCAGAAAAAT
GTCCTGTTTT CCGGAACCAT CAAGGAAAAT CTTCGCTGGG GAAACAAAGA TGCCACGGAT
GAAGAATTGA TAGAGGCTTG CAAGCTTGCT TGTGCCCATG AATTTATCAG TCAATTTCCC
AAAGGCTATG ATACCTATAT TGAGCAGGGC GGTACCAATG TGTCGGGCGG ACAGAGGCAA
AGACTCTGTA TTGCAAGGGC GCTTCTGAAA AAACCCAAAA TATTGATTCT GGATGATTCC
ACCAGTGCCG TGGATACCAA GACTGATGCA AAAATTCGCA AGGCACTTAA AAATTACATG
CCCGAGACAA CCAAGATAAT TATCGCCCAG AGAACGGCTT CTGTTGAAGA TGCAGACAGA
ATTATAGTAA TGGATGGCGG AACCATAAAC GGAATCGGAA CCCATGAGCA GTTGCTGGCT
GAGAATACAA TCTACAGGGA AATATATTTT TCTCAAAACA AGGCAGGTGT GGAAAGTGGA
AAATAA
 
Protein sequence
MVKDLARSIR EYKKVSIITP ILISLEVVIE CIIPFITATL VNKIKSGCEL NTIINYGIVL 
IIMAFLSLMF GAIAGSTAAT ASCGFAKNLR KDMFYSIQNY SFENIDKFMT SSLITRMTTD
VTNVQHAYMM LIRVAVRAPL MLIFAFVMAF VMGGRMAWIF LVVDPFLAIG LSVIIYKALP
LFRKVFKKYD ALNRSIQENI KGMRVVKSFV REDYEQKKFD AAAEDVCADF TRAERILAFN
GPLMQFCMYV VMVFVLSFGS YTIITSRGLD FDIGQFSAIL TYNFMILMSL MMLSMVFVMI
TMAGESAKRI VEVINEKSTM TNPENPIYTV KDGSISFENV SFKYSEKAER MTLENINLEI
KSGETIGIIG GTGSSKTTLV QLIPRLYDAT EGVVKVGGVD VRNYDLETLR NEVAMVLQKN
VLFSGTIKEN LRWGNKDATD EELIEACKLA CAHEFISQFP KGYDTYIEQG GTNVSGGQRQ
RLCIARALLK KPKILILDDS TSAVDTKTDA KIRKALKNYM PETTKIIIAQ RTASVEDADR
IIVMDGGTIN GIGTHEQLLA ENTIYREIYF SQNKAGVESG K