Gene Cthe_1500 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1500 
Symbol 
ID4810538 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1822430 
End bp1824322 
Gene Length1893 bp 
Protein Length630 aa 
Translation table11 
GC content45% 
IMG OID640106920 
ProductABC transporter related protein 
Protein accessionYP_001037921 
Protein GI125974011 
COG category[V] Defense mechanisms 
COG ID[COG1132] ABC-type multidrug transport system, ATPase and permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000079767 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGCCGG GCAGAGGGAA AATGTCAGGA GGCTTTGGCG GACCCGGCGG GAAAAGACCG 
CTTGGATTTG GTGGTCCAAG AGGTATGCGC GGTGCTGTGG GCGGTGCCAA ACCGAAAAAT
GCATCGGCTA CAATTAACAG GCTGCTGGCT TATATCGGAC GGGACAAGAT TAAAATTCTG
TTTGTTTTTG CGTGTGTACT GGGCAGCAGC CTGGCAAGCC TTGCCGGAAG CTATATTTTA
CGTCCTGTCA TAAACAATCT GGTTTATTCC GACGGAACTG CAAAAGAAAA AATAAACAAT
CTTGTAATCG GTATACTTAC CATGGCATGC ATTTATTTGG CCGGAGTGGT ATGTTCTTAC
CTGCAGCAGA GAATCATGAT AGGAGTTTCG CAGAATGCGC TGATTAGAAT CAGGGAAGAG
TTGTTTCGCA AGATTCAAAA GCTGCCTTTA AAATATCATG ACACTCACAC TCACGGTGAT
ATAATGAGCC GTTTTACCAA TGACCTTGAT GCGGTTGGAG AAATGCTCAA TAACACCATG
TCCCAGATTT TTTCAGGTAT TATTACCCTG GTTGGCACTG TTGCTCTTAT GTTTTTTACA
AACTGGATAC TTGCAATCAT TATCATCGTG ACGGTGCCTT TGATGGTCTA CGTGGGAGGC
ATGATCGGAA AGCAGAGCAG AAAATATTTC ATAGGTCAGC AGCAGGCCCT CGGTGCGGTT
AACGGATATA TTGAGGAAAC TGTTACGGGT CAAAAGGTGA TAAAAGTGTT TTGCCATGAA
GAAACAGTAG TGGAAGAATT CGAGTTTTTA AGTGACAATT TGCGTGAAAA ACAGAGAAAG
GCACAGTTTT TTGGAAGTAT TATGGCACCT GTTATGGGTA ATTTAAGCCA AATAAGCTAT
GCGTTGTCTG CCACAATCGG AGGTGTTCTC TGCCTGACGC TGAATTTTGA TATTGGCGGC
CTTGCCATAT TTACTAACTA TTCAAGACAT TTTGCAAGGC CGATTAGTGA ATTGTCCATG
CAGATGAATA CGATTTACGC GGCATTGGCC GGAGCGGAGA GAGTGTTTGA AGTGATGGAT
GAAACACCTG AGAAGGGTGA TTCCCCGGAC GCTATAGAAA TCTGTTCCGA AGCAAACCGG
GTGGAAGGAG CACAGCCAAT TAAAGGAGAA GTGGTGTTGA AAAACGTAAC CTTTGGTTAT
GTACCGGGAA AAACCGTGCT GAAAAATATT AATGTAACAG CCAAACCCGG CCAGAAAATC
GCTTTTGTGG GTTCCACGGG TGCCGGGAAA ACGACCGTTA CGAACCTTAT AAACAGGTTT
TATGAGATTG AAGAAGGAGA AATACTGATT GACGGTATCA ATATCAAGAA TATTAAAAAG
GACTCATTAA GAAGCAATAT AGCAATAGTG CTTCAGGATA CCCATTTGTT TTCCGGAACG
GTCAGAGAGA ATATTCGGTA TGGCCGTCTT GACGCAACCG ATGAAGAAGT CGTTGCAGCG
GCCAAGGTGG CCTGCGCCCA TTCCTTTATC GAAAGGCTGC CCCAGGGATA TGATACGGTA
TTGGATGGGG ACGGAGCGAA TTTGAGCCAG GGTGAACGGC AACTTCTCAA TATTGCGCGG
GCTGCCATAT CAAAAGCTCC CATTCTGATA CTGGACGAAG CCACCAGTTC GGTTGACACG
AGGACCGAAA AATACATTGA GCGTGGTATG GACAGGTTGA TGAAAAACCG GACTACTTTT
GTAATCGCGC ACAGACTGTC CACCGTGCGA AACGCAGATC TGATTATAGT GCTGGAACAC
GGTGAAATTA TAGAACAGGG AACCCATGAA GAACTCCTTG GGATGGGCGG AAGATATTAT
CAGCTTTATA CAGGAGCGGT TGAACTGGAC TAG
 
Protein sequence
MKPGRGKMSG GFGGPGGKRP LGFGGPRGMR GAVGGAKPKN ASATINRLLA YIGRDKIKIL 
FVFACVLGSS LASLAGSYIL RPVINNLVYS DGTAKEKINN LVIGILTMAC IYLAGVVCSY
LQQRIMIGVS QNALIRIREE LFRKIQKLPL KYHDTHTHGD IMSRFTNDLD AVGEMLNNTM
SQIFSGIITL VGTVALMFFT NWILAIIIIV TVPLMVYVGG MIGKQSRKYF IGQQQALGAV
NGYIEETVTG QKVIKVFCHE ETVVEEFEFL SDNLREKQRK AQFFGSIMAP VMGNLSQISY
ALSATIGGVL CLTLNFDIGG LAIFTNYSRH FARPISELSM QMNTIYAALA GAERVFEVMD
ETPEKGDSPD AIEICSEANR VEGAQPIKGE VVLKNVTFGY VPGKTVLKNI NVTAKPGQKI
AFVGSTGAGK TTVTNLINRF YEIEEGEILI DGINIKNIKK DSLRSNIAIV LQDTHLFSGT
VRENIRYGRL DATDEEVVAA AKVACAHSFI ERLPQGYDTV LDGDGANLSQ GERQLLNIAR
AAISKAPILI LDEATSSVDT RTEKYIERGM DRLMKNRTTF VIAHRLSTVR NADLIIVLEH
GEIIEQGTHE ELLGMGGRYY QLYTGAVELD