Gene Cthe_2553 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2553 
Symbol 
ID4809160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp3022259 
End bp3023284 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content36% 
IMG OID640107968 
Productradical SAM family protein 
Protein accessionYP_001038947 
Protein GI125975037 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGCCG AATTTAAGCC GAATTATGAC ACAAACAGAA AGAAGCTGGC CGATATTATA 
CCTCTTGCTG CACCGTTTAC GGTGTATATA GAACAAACCA GGTACTGTAA TTTCAAATGT
TTTTATTGTA TACATGCCAC AAGAGATAAA GAGGACGGAG AATTTCGCAA GCTGGGATTT
TCGGTAAAGC ATATGGATTT TGAAATGTAC AAAACAATCG TCAGTCAGCT TAAAGAATTT
ACAGAACCAA TAAAGAGAAT AGTATTTTCA GGTCTGGGCG AACCTCTAAT GAATCCCAGG
CTGCCTGAAA TGGTCAAGCT CGCTGTGGAA GCAAAAATAG CCGATAGAGT GGAAATAATT
ACAAATGGAT TATTATTAAC ACCGGAAACA TCAAGAAAAC TGATTGATGC AGGAATCACA
AACATTAACA TATCCGTCCA GGGTGTAAGC AAGGAGAGAT ACAAAGAAAC TTGCGGAGTA
GAAATTGACT TTGATGAGTA CGTTAAGAAC CTTAGTTATC TATACAGTAT TAAAGGTAAT
ACACAAATAT ATATAAAAGC AATAGATGCT ACACTAAAAT CCAAGGAAGA AGAAGAAAAG
TTTTTCAATA TTTTTGGAAA TATATGCGAC AAAATTTATA TAGAACATTT AATTGTCATG
CAGCAGCAAA TGGGTGAACT CAAAAAAATA GTGGACGGCA CTAAAAACTT TTACAATGAG
GAATTGGATT TAAACAGAAA AGTATGTGCC CAATCTTTCT ATTTCTTACA AATTGGATGC
GATTACGATA CCTTCCCCTG CCCTGTACCG GGTTTGCCAA AAAGTTTATC CATGGGAAAT
ATAAAGGATA ATACTATAAA AGAAATTTGG AACGGTGAAA AAAGAAGAGA ACATTTAAGA
ACAATGCTCA GCTATCAGAA AGACAGCATA CCTGAGTGCA ACAATTGTAC ATGCTTTAAT
GCGATTAACA ATCCACTGGA GAATTTGGAC CCGGATGCGC CCAGACTTTT AAAACTGTTT
GAATAA
 
Protein sequence
MKAEFKPNYD TNRKKLADII PLAAPFTVYI EQTRYCNFKC FYCIHATRDK EDGEFRKLGF 
SVKHMDFEMY KTIVSQLKEF TEPIKRIVFS GLGEPLMNPR LPEMVKLAVE AKIADRVEII
TNGLLLTPET SRKLIDAGIT NINISVQGVS KERYKETCGV EIDFDEYVKN LSYLYSIKGN
TQIYIKAIDA TLKSKEEEEK FFNIFGNICD KIYIEHLIVM QQQMGELKKI VDGTKNFYNE
ELDLNRKVCA QSFYFLQIGC DYDTFPCPVP GLPKSLSMGN IKDNTIKEIW NGEKRREHLR
TMLSYQKDSI PECNNCTCFN AINNPLENLD PDAPRLLKLF E