Gene Cthe_2398 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2398 
Symbol 
ID4811050 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2863682 
End bp2864671 
Gene Length990 bp 
Protein Length329 aa 
Translation table11 
GC content34% 
IMG OID640107811 
Productputative spore coat protein 
Protein accessionYP_001038793 
Protein GI125974883 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAGATT TAGACAAGGA ATTGTCCAGG ATATATGATT TTGACATAGA CAAGATTTAT 
CCGTTAAAGA ACTATTATGT TATTGAAACA TCGGAAGGAA AGAGAATTTT AAAAAGCGTC
AACTGTTCAC CAGAACGCAT AATGTTTGTG CATGGAGCAA AAGAACACCT GTATAGTAAT
GGCTTTAAAA ACATAGACAG GTATGTGTGT AACAAATCCA AAAGCCCGGC ATCATTTATA
AACGGGATTC TTTATACAGT TTCGGAGTCG GTGGAGGGAA GAGAATGTGA TTTCAACAAC
AGAGATGATG TGATAAGGGC TTCGAAAACA CTGGCAATGC TGCACAAGAC TTCAAAAGGA
TATATTCCTC CTCAAAACAG CATAATAAGG AGTGATTTGG GCAAACTTCC CGAGTATTTC
AGCAAGAGAC TGGAAGAAAT AAAAAGGACA AAAAAGATGG CGCAAAGGGA AAGAAATGAA
TTTGATTATC TTGTTTTGGA ATATATTGAC TATTTTTATG AGCTGGGAGA GAATGCATTG
GAGAAAATAC ACAATTCAAA ATATTATGAT GTGGTAAAAA AAAGCCAGGA AGAAAGATTG
TTTTGCCATC ATGATTATAC TCATTGTAAT ATAATCTGCA AGGATTTGGA AACATCAGTT
ATAAATTTTG AACATTGCAC TTTTGATCTG AAAGTATATG ATGTGGCCAA TTTATTGAGA
AGAAAAATGA GAAAATGTAA CTGGGATATA AATGAAGCGA TGGTCATAAT TGATGCCTAT
ACATCCATAG AACCAATTTC AAAAGAAGAG TTTGAGATTT TGGAAATCAT GCTTCAGTTT
CCCCAGAAAT TCTGGAGAGT GGTTAACAGA TACTACAACA GCAGACGCAT AAAAAGGGAA
AAGAACTTTA TTGCAAGGTT TAACGAAGTA ATTGAGGAAA TTGAGTATCA TAAAAGATTT
TTAAATGAAT TCAATAAAAT TGTTCAATAA
 
Protein sequence
MQDLDKELSR IYDFDIDKIY PLKNYYVIET SEGKRILKSV NCSPERIMFV HGAKEHLYSN 
GFKNIDRYVC NKSKSPASFI NGILYTVSES VEGRECDFNN RDDVIRASKT LAMLHKTSKG
YIPPQNSIIR SDLGKLPEYF SKRLEEIKRT KKMAQRERNE FDYLVLEYID YFYELGENAL
EKIHNSKYYD VVKKSQEERL FCHHDYTHCN IICKDLETSV INFEHCTFDL KVYDVANLLR
RKMRKCNWDI NEAMVIIDAY TSIEPISKEE FEILEIMLQF PQKFWRVVNR YYNSRRIKRE
KNFIARFNEV IEEIEYHKRF LNEFNKIVQ