Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cthe_2398 |
Symbol | |
ID | 4811050 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Clostridium thermocellum ATCC 27405 |
Kingdom | Bacteria |
Replicon accession | NC_009012 |
Strand | + |
Start bp | 2863682 |
End bp | 2864671 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 640107811 |
Product | putative spore coat protein |
Protein accession | YP_001038793 |
Protein GI | 125974883 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAAGATT TAGACAAGGA ATTGTCCAGG ATATATGATT TTGACATAGA CAAGATTTAT CCGTTAAAGA ACTATTATGT TATTGAAACA TCGGAAGGAA AGAGAATTTT AAAAAGCGTC AACTGTTCAC CAGAACGCAT AATGTTTGTG CATGGAGCAA AAGAACACCT GTATAGTAAT GGCTTTAAAA ACATAGACAG GTATGTGTGT AACAAATCCA AAAGCCCGGC ATCATTTATA AACGGGATTC TTTATACAGT TTCGGAGTCG GTGGAGGGAA GAGAATGTGA TTTCAACAAC AGAGATGATG TGATAAGGGC TTCGAAAACA CTGGCAATGC TGCACAAGAC TTCAAAAGGA TATATTCCTC CTCAAAACAG CATAATAAGG AGTGATTTGG GCAAACTTCC CGAGTATTTC AGCAAGAGAC TGGAAGAAAT AAAAAGGACA AAAAAGATGG CGCAAAGGGA AAGAAATGAA TTTGATTATC TTGTTTTGGA ATATATTGAC TATTTTTATG AGCTGGGAGA GAATGCATTG GAGAAAATAC ACAATTCAAA ATATTATGAT GTGGTAAAAA AAAGCCAGGA AGAAAGATTG TTTTGCCATC ATGATTATAC TCATTGTAAT ATAATCTGCA AGGATTTGGA AACATCAGTT ATAAATTTTG AACATTGCAC TTTTGATCTG AAAGTATATG ATGTGGCCAA TTTATTGAGA AGAAAAATGA GAAAATGTAA CTGGGATATA AATGAAGCGA TGGTCATAAT TGATGCCTAT ACATCCATAG AACCAATTTC AAAAGAAGAG TTTGAGATTT TGGAAATCAT GCTTCAGTTT CCCCAGAAAT TCTGGAGAGT GGTTAACAGA TACTACAACA GCAGACGCAT AAAAAGGGAA AAGAACTTTA TTGCAAGGTT TAACGAAGTA ATTGAGGAAA TTGAGTATCA TAAAAGATTT TTAAATGAAT TCAATAAAAT TGTTCAATAA
|
Protein sequence | MQDLDKELSR IYDFDIDKIY PLKNYYVIET SEGKRILKSV NCSPERIMFV HGAKEHLYSN GFKNIDRYVC NKSKSPASFI NGILYTVSES VEGRECDFNN RDDVIRASKT LAMLHKTSKG YIPPQNSIIR SDLGKLPEYF SKRLEEIKRT KKMAQRERNE FDYLVLEYID YFYELGENAL EKIHNSKYYD VVKKSQEERL FCHHDYTHCN IICKDLETSV INFEHCTFDL KVYDVANLLR RKMRKCNWDI NEAMVIIDAY TSIEPISKEE FEILEIMLQF PQKFWRVVNR YYNSRRIKRE KNFIARFNEV IEEIEYHKRF LNEFNKIVQ
|
| |