Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0543 |
Symbol | |
ID | 7408669 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 613353 |
End bp | 614345 |
Gene Length | 993 bp |
Protein Length | 330 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 643714926 |
Product | spore coat protein, CotS family |
Protein accession | YP_002572442 |
Protein GI | 222528560 |
COG category | [R] General function prediction only |
COG ID | [COG2334] Putative homoserine kinase type II (protein kinase fold) |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGCTC TGAGAGAGGA GATAACTCAA TTTTTTGATA TAGAGGTTTT TTATTTTATG CCTATCCGTG ACATTTTGGT TCTTTCAACA GACAGAGGTC TTAAGTGTTT TAAAAGAGTA GATTATTCGA TAGAAACGCT TTTGTTTATT CACGGGGGCA AAGAACACCT TGTTTCAAGA GGATTTATAG ACATTGACAG GTTTAATTTA AGCAAGGAAG GCTTGCCTTA TGTTATGTTG GGTGATGAAA TCTATGTTCT GACAGACTGG ATTGACGCGA GAGAGTGTGA ACTTGAAAAC CCGATAGAAT TGAAAGCTGC GACAGAAAAA CTTGCTATGA TGCACGAGGC ATCTATAGGT TATACAAATG TTCCCGAAGG TGCAAGGGTC AGAGATGATT TGGGAAAACT TTTGACTAAG TTTGAAAAGC GCTGCAATGA ATTTTTGCGT ATGAGAAAGA TGGCAGAGAA AAAAAAGAGC ATGTTTGATT ACGAGTATTT ATTTACATAC TCATATTATT TTGATCTTGC AAAGGAAGCG CTTGAAAAAC TTAAAAATTC AAATTACTTA AAACTTTGTG ATGAGGCAAG AGAAAAAAGA GGATTTATTC ACAGAGATTA CTCTTACCAC AATATTCTCT ACACTCACGA TGGTGATGTG TATATAATAG ACTTTGATTA TCTTACCTAT GACCTTCGAA TAGTTGACCT CACAAGCTTT ATGCAAAAGG TGTTAAAAAG GATTCACTGG GACATAAAAA CAGGTGAGAG CATCTTGAAC TGGTATTCAA ATGTATCGCC GCTGAACAAA GACGAGCTTG AACTTGTCTA TATAATCCTG CTTTTTCCCT ACAGATATTG GAAAACATGC AACAGATATT ATAATGGTAA AAAGAGCTGG TCTGAAAAAG CATTTACAAA CAAGCTTCAT GAAGTTATTG CAGAAAAAGA ATTTCACTAT GATTTTATTA GGTGGCTTGA AAAACTGATA TAA
|
Protein sequence | MIALREEITQ FFDIEVFYFM PIRDILVLST DRGLKCFKRV DYSIETLLFI HGGKEHLVSR GFIDIDRFNL SKEGLPYVML GDEIYVLTDW IDARECELEN PIELKAATEK LAMMHEASIG YTNVPEGARV RDDLGKLLTK FEKRCNEFLR MRKMAEKKKS MFDYEYLFTY SYYFDLAKEA LEKLKNSNYL KLCDEAREKR GFIHRDYSYH NILYTHDGDV YIIDFDYLTY DLRIVDLTSF MQKVLKRIHW DIKTGESILN WYSNVSPLNK DELELVYIIL LFPYRYWKTC NRYYNGKKSW SEKAFTNKLH EVIAEKEFHY DFIRWLEKLI
|
| |