Gene Cthe_1685 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1685 
Symbol 
ID4808935 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2009036 
End bp2010760 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content28% 
IMG OID640107099 
ProductABC transporter related protein 
Protein accessionYP_001038100 
Protein GI125974190 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGGATT ACATAAAATT TTTAAAAGAA AATGTTTCTA AATATAGAAA TAGAATTATA 
GCAGCAAGTG TGTTATCTCT AGTATCTATG TTCTTTATAT TATTGAACCC ATTAATAACC
CGCGTATTAA TAGATAGTGT TTTAACCAGC AGAAATTTCT CAGTATTAAA TAAAGTGATT
ATCTTTATTT TTACTTTTAC CATAGTGAAT TCTATCGTAC ATTTTATTTA TAGTTATTCA
CTAAATAGAA TATTCTTAAG TTTGGGTTTA GATATAAAGA AAAAGATATT CAAACATATC
GTAAAACTAG ATTTATTAGT TACTAAAAAA ATGAGTGTCG GAGAAATAAA CTTTAGAATT
TTTAATGATT CAGATTTATT GAAACAGTCA TTTGGTCAAA TTATGTTTGG AGGGTTATTT
AATGTTATTT TATTAATAAT AATTTTTATT TATATGTTTA CAATTAGTTC TGTAATGACC
ATTTTTGTCG GGGTCATGAA TTTAATACAG GTACCAATAA TTATATACTT TACAAAAAGG
ATAAGAGCTA TAACATATGA GAGAAAGGTT GTCACAGAAG CGACCCTAAA CCGAACAATA
GAAGTAATAA GTTCATTTCA CCTATTAAAA GGTTGTAATA ATGAAAAAAG CGAAATTGAT
AAATTTGATA AGAATAATGA AAAAGTTTTG GACAAGCAAA TAAAAGAAGC AAACCTGAAC
CTATTGTTTA CGGAGATATC TAGTGTAATT ATGTCATGTA TAGGTTTTGG ATCTATCTGG
ATTGGTGGAA ATTTCGTGAT AAATGGGATA ATTACAATGG GTGAGCTTAT ATCCTTTTTA
TTAGTTGCAA ATATAATAAA TTCTCCAATA AATTCAATTG TAAGTGTTAT AACAGGGCTT
CAAGATGCAC TATCTAGTAC CAAACGAATA AGTGATGTTA TGAAGTTGGA GAATAGTATT
ATTCAAAGTA ATGATTGTAT AAAGGTGATG CATAAAATTG TTGATGAAAT CACTATAAAA
GATTTGAACT TTAGCTATGA TGATGGAAAA GAAGTGTTAA AAGATATTAA CCTAACAGTT
AAAAAGAATA CGATATGCTC AATAATAGGA AGAAGTGGGG CAGGTAAAAC AACCTTGTGC
TTATTAATAG CAAGATTTTT CGATCCTTCA AGTGGAGAAA TTTTGTTAGA TGATGTAAAT
ATTAAAGATA TAGATGTAGA TACTTATAGA AAGAGTGTAG GAATTGTTTT ACAAAATAGC
TTTTTATTTA GTGGTTCAAT AAGAGAAAAT ATATTACTAG GAAAAAGTGA TGCTACCGAA
GAGGAAATTA TTGAAGCGGC TAAATTAGCA AATGCTCATG AATTCATATC AGAACTTGAA
GATGGTTATT GGACTCAAAT AGGAAGCAAA GGGAGAAACC TATCGGGGGG TCAATTACAA
AGAATAGCAT TGGCTAGAAT ATTTTTGCAA AGACCTCAAA TTGTAATACT TGATGAACCA
ACCTCCTTTA TTGATTCAGA AAGTGAGGAG CTTATTCAAG AATCTATAAA TAAGCTTAAA
GAATACTCAA CAATATTTGT TATAAGCCAT AAATTAAGCA CTGTAAAAGG ATCAAACAAG
ATAGTTGTTC TTAATAATGG AAGGATTGAA GAATCTGGAA CACATCTTCA GTTATTAGAG
AAAGAGGGAG AATACAGCAA GCTTTATAGG AAAATATTAG CATAG
 
Protein sequence
MKDYIKFLKE NVSKYRNRII AASVLSLVSM FFILLNPLIT RVLIDSVLTS RNFSVLNKVI 
IFIFTFTIVN SIVHFIYSYS LNRIFLSLGL DIKKKIFKHI VKLDLLVTKK MSVGEINFRI
FNDSDLLKQS FGQIMFGGLF NVILLIIIFI YMFTISSVMT IFVGVMNLIQ VPIIIYFTKR
IRAITYERKV VTEATLNRTI EVISSFHLLK GCNNEKSEID KFDKNNEKVL DKQIKEANLN
LLFTEISSVI MSCIGFGSIW IGGNFVINGI ITMGELISFL LVANIINSPI NSIVSVITGL
QDALSSTKRI SDVMKLENSI IQSNDCIKVM HKIVDEITIK DLNFSYDDGK EVLKDINLTV
KKNTICSIIG RSGAGKTTLC LLIARFFDPS SGEILLDDVN IKDIDVDTYR KSVGIVLQNS
FLFSGSIREN ILLGKSDATE EEIIEAAKLA NAHEFISELE DGYWTQIGSK GRNLSGGQLQ
RIALARIFLQ RPQIVILDEP TSFIDSESEE LIQESINKLK EYSTIFVISH KLSTVKGSNK
IVVLNNGRIE ESGTHLQLLE KEGEYSKLYR KILA