Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Athe_0541 |
Symbol | |
ID | 7408667 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Anaerocellum thermophilum DSM 6725 |
Kingdom | Bacteria |
Replicon accession | NC_012034 |
Strand | + |
Start bp | 611335 |
End bp | 612348 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 30% |
IMG OID | 643714924 |
Product | spore coat protein, CotS family |
Protein accession | YP_002572440 |
Protein GI | 222528558 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0510] Predicted choline kinase involved in LPS biosynthesis |
TIGRFAM ID | [TIGR02906] spore coat protein, CotS family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 41 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTATATTT TCATGGGAAC ACCTGAACTT AAGTTAGTTG AAGAGAATTA TTATATAAAG ATAGATGAAA TAAAACAAAT AAAATCTAAT GCCTATTTTG TGAAGACAAA GGATGGTAAA GAATTTTTTT TGAAGGTCAG TAGAGTTGCC AAAGACTATG TTGATTTTAT TATCAAGATT TTTTCACACC TGAAAAATAC AAGTTTTAAA AGCCACCTGA TTGATTTTCA GAAGACCATT GACGGTGGCT TTTATTTTTT AGATGAAAAC AAAAAGGTTT ATCTTCTATG TAAGTGGATA GATGGCAGAA GTGCAGATTT CAGAAATGTT TATGACTTGA GAAGAGTAGT TTCAATCTTG TATCACCTTC ATTTAGCCTC ACTTTCTTTT GCTGAGGAAA TAAAAGATAG TTTTTATCCA TCTTATCAAG AAGTGTTTTG TAGAAAGTAT TCACAAGTTA TCCAAATGAA GAATATTATA CATCAAAAAG ATAATCTTAG CTATTTTGAT GAGATATTTT TGAATGTTCT AAGTAGATTT GAAGATAGAT TTGTGGAAAG CATACATATG ATAAAAAAAA TTGAAGACTA CTTCAAAGAA GAGAATCAAA AGGTATTAAT TCATCATGAT CCTGCTCATC ACAATTTTAT ATTTTCTGAA AAAGGTGTAT ACCTTATCGA CTTCGATTAT GCGATGGTAG ATTATAGTGT ACATGACTTT GCAAACCTTG GTGTGAGGGT TTTGAAAACA AATGATTGGG ACAGAAATAT GTTTAGAATT TATTTAAAAT TTCTACAGGA TAAAAATATC TTAAATAAAT TCTGGTTGCA AACATTTTGG ATTTTGATGT ATTTCCCGCA AGAGATTTGG CAAATTGGAC TTCAGTACTA TTTCGAAAAA CAACCGTGGA CAGAAGAGTA TTTTCTCAAA AGACTTAAAG GATCAGAAAG AATACAAGAA GAAAAGGAGA TGATAATTAA GGAATTTGCG GGAGGGATTT TTAAATGGCA TTGA
|
Protein sequence | MYIFMGTPEL KLVEENYYIK IDEIKQIKSN AYFVKTKDGK EFFLKVSRVA KDYVDFIIKI FSHLKNTSFK SHLIDFQKTI DGGFYFLDEN KKVYLLCKWI DGRSADFRNV YDLRRVVSIL YHLHLASLSF AEEIKDSFYP SYQEVFCRKY SQVIQMKNII HQKDNLSYFD EIFLNVLSRF EDRFVESIHM IKKIEDYFKE ENQKVLIHHD PAHHNFIFSE KGVYLIDFDY AMVDYSVHDF ANLGVRVLKT NDWDRNMFRI YLKFLQDKNI LNKFWLQTFW ILMYFPQEIW QIGLQYYFEK QPWTEEYFLK RLKGSERIQE EKEMIIKEFA GGIFKWH
|
| |