Gene Cthe_1113 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_1113 
Symbol 
ID4811411 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp1325435 
End bp1327291 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content25% 
IMG OID640106535 
ProductSecC motif-containing protein 
Protein accessionYP_001037538 
Protein GI125973628 
COG category[S] Function unknown 
COG ID[COG3012] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAGT TAATTATGGA ATTAAAAGAA AAATTAAGCC ACTATAAAAC AGAGGAAATT 
CTGGGTTATG TTGCAAATGA TTTTTTATAT ACACCTTTTG GAGCCAATAC TATACATGAA
AAAACAAATT TGGATTCACC TGCTAAACAG TTAACTTATT TAATAGGACT ATTAATGTCT
ACTGAGTATT TAGATAATCA GGATTATAAA GAAGAAAAAA AATTTAAGAA TGATGATTTA
AAAGAATATA AAAATGAAAT GGATAATTTA TATAAATCAA TTCAAAAAAT AACATTCAAA
TACCTAGATT CTTTTTTTCC AAAAGATAGT ATAGAAAACT ATGATGAAGA GTGGAAGAAA
AAACGAGAAG CAATGTTACC TGTTTTTTTA AGTTATTTTA ATAATATTAC ACTCTGCAAT
GAAGAACAAA TTATTAGAAG GATACATAAT TGGTTCTCAA GATTTGATGA TAAATTAAAA
GAAAACTATA ATTTTGATAC TATACTGTTA ATAGAATTCT ATAAATTTGT TAAAAACAAG
TTAGAAAATG TGTTTGATAG TTTTCAAGAA GTTGTTAAAA AGTGTCAAGA TGAATTACTC
AAGTTAGAAG ATGAGTTTCT AAGAAATACG AATTATACAA GAGATACAGA GTCAAATAAT
TGGAACTTTG AAGAATTATT TAGAAAACAA TCTAATTTTG AAAAGACGAA AAACCAATTT
TTCAAAATGG CTAAAGAAAT TCAAGAATTG TTTATAATAA AAAAACAAGA TATTGAAGAT
GTTTTTGGAA AAGAGAATGC AATGTTATTT GAAGAATTAT TTGTTCTAGA GAGGATAAAT
AGAGAGTTTA AGTACTATAC TGAAAATAAT CCGGTTTTAG AGAAACCTTT ATGCAAAGTA
GACAATGATA AATATTTTTT AATTCAGCCA CGGTTCTTAC TAGATGCTAT TTATAATCTA
TGTTATTCAA AACTTGAAGA ACTTCATAGA GAAAATAAAG TGAATTTTTA TAAAACTCGT
GGAGAAGTAG TTGAAAAAGA AGTATTACAT TTGCTTAATC AGGTATTTAA AGATAAGGCT
AAATACTATA CTGCAGTTTG TGAGACACCT AACTATAATG AGCATGACAT AATTGTTTTA
TATAAAGAAA ATATTTTAAT TATTGAAATT AAAAGTTCTA AAACTAAAGA ACCTCTCAGA
AATCCTGACA ATGGGTTTGA AAGAATGAAA GAACACTTCA ATTCCAAAAA AGGCATTGGA
GGAGGTTTTG TTCAAGCCAA CAATTTAAAA AAATATATTT TAGAAAATGA AGAAGTTACA
CTCTATAATA ATAAAGTTGA ACCATTTAAA ATATCAAGAA AAGATTATAA AAATATATTT
TGTATAGTAA TTACAGCGGA ACAATTTTTT TCACTCAATG TAAATACATC TATGTTTATA
GAAAAGGATG ATAGAGATGA ATATCCTTGG ACATGTGATT TATATAACTT AGAAGTTTTA
ATTGAGGGAT TCCAATACTG TAATAAAACT GTTGATGATT TTATTGAATA TATAAAGCAA
AGAATATGTT ATCATAAAAA ATTTATTACT GATGACGAAT TAGAAATTGC TGAATACTTT
CTTATAAAAG GGGATTTTAA TGATGAAAGA ATAAAGAAAA GCATTTTTAT AGGTTTTTTG
CCTACTACAT CAAACTTATT TGATAAAATC TATATGGAGA AAAAGAATAT ACCATATAAT
TATAATAGTG AAGAACAGTC ATTATTTTTT ATGATACCAG GTGGTAAGAT TGGTAGAAAT
GATAAATGTC CATGTGGAAG CGGAAAGAAA TATAAGAAGT GTTGTGGTCA ATATTAG
 
Protein sequence
MDELIMELKE KLSHYKTEEI LGYVANDFLY TPFGANTIHE KTNLDSPAKQ LTYLIGLLMS 
TEYLDNQDYK EEKKFKNDDL KEYKNEMDNL YKSIQKITFK YLDSFFPKDS IENYDEEWKK
KREAMLPVFL SYFNNITLCN EEQIIRRIHN WFSRFDDKLK ENYNFDTILL IEFYKFVKNK
LENVFDSFQE VVKKCQDELL KLEDEFLRNT NYTRDTESNN WNFEELFRKQ SNFEKTKNQF
FKMAKEIQEL FIIKKQDIED VFGKENAMLF EELFVLERIN REFKYYTENN PVLEKPLCKV
DNDKYFLIQP RFLLDAIYNL CYSKLEELHR ENKVNFYKTR GEVVEKEVLH LLNQVFKDKA
KYYTAVCETP NYNEHDIIVL YKENILIIEI KSSKTKEPLR NPDNGFERMK EHFNSKKGIG
GGFVQANNLK KYILENEEVT LYNNKVEPFK ISRKDYKNIF CIVITAEQFF SLNVNTSMFI
EKDDRDEYPW TCDLYNLEVL IEGFQYCNKT VDDFIEYIKQ RICYHKKFIT DDELEIAEYF
LIKGDFNDER IKKSIFIGFL PTTSNLFDKI YMEKKNIPYN YNSEEQSLFF MIPGGKIGRN
DKCPCGSGKK YKKCCGQY