Gene Athe_0543 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0543 
Symbol 
ID7408669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp613353 
End bp614345 
Gene Length993 bp 
Protein Length330 aa 
Translation table11 
GC content35% 
IMG OID643714926 
Productspore coat protein, CotS family 
Protein accessionYP_002572442 
Protein GI222528560 
COG category[R] General function prediction only 
COG ID[COG2334] Putative homoserine kinase type II (protein kinase fold) 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTGCTC TGAGAGAGGA GATAACTCAA TTTTTTGATA TAGAGGTTTT TTATTTTATG 
CCTATCCGTG ACATTTTGGT TCTTTCAACA GACAGAGGTC TTAAGTGTTT TAAAAGAGTA
GATTATTCGA TAGAAACGCT TTTGTTTATT CACGGGGGCA AAGAACACCT TGTTTCAAGA
GGATTTATAG ACATTGACAG GTTTAATTTA AGCAAGGAAG GCTTGCCTTA TGTTATGTTG
GGTGATGAAA TCTATGTTCT GACAGACTGG ATTGACGCGA GAGAGTGTGA ACTTGAAAAC
CCGATAGAAT TGAAAGCTGC GACAGAAAAA CTTGCTATGA TGCACGAGGC ATCTATAGGT
TATACAAATG TTCCCGAAGG TGCAAGGGTC AGAGATGATT TGGGAAAACT TTTGACTAAG
TTTGAAAAGC GCTGCAATGA ATTTTTGCGT ATGAGAAAGA TGGCAGAGAA AAAAAAGAGC
ATGTTTGATT ACGAGTATTT ATTTACATAC TCATATTATT TTGATCTTGC AAAGGAAGCG
CTTGAAAAAC TTAAAAATTC AAATTACTTA AAACTTTGTG ATGAGGCAAG AGAAAAAAGA
GGATTTATTC ACAGAGATTA CTCTTACCAC AATATTCTCT ACACTCACGA TGGTGATGTG
TATATAATAG ACTTTGATTA TCTTACCTAT GACCTTCGAA TAGTTGACCT CACAAGCTTT
ATGCAAAAGG TGTTAAAAAG GATTCACTGG GACATAAAAA CAGGTGAGAG CATCTTGAAC
TGGTATTCAA ATGTATCGCC GCTGAACAAA GACGAGCTTG AACTTGTCTA TATAATCCTG
CTTTTTCCCT ACAGATATTG GAAAACATGC AACAGATATT ATAATGGTAA AAAGAGCTGG
TCTGAAAAAG CATTTACAAA CAAGCTTCAT GAAGTTATTG CAGAAAAAGA ATTTCACTAT
GATTTTATTA GGTGGCTTGA AAAACTGATA TAA
 
Protein sequence
MIALREEITQ FFDIEVFYFM PIRDILVLST DRGLKCFKRV DYSIETLLFI HGGKEHLVSR 
GFIDIDRFNL SKEGLPYVML GDEIYVLTDW IDARECELEN PIELKAATEK LAMMHEASIG
YTNVPEGARV RDDLGKLLTK FEKRCNEFLR MRKMAEKKKS MFDYEYLFTY SYYFDLAKEA
LEKLKNSNYL KLCDEAREKR GFIHRDYSYH NILYTHDGDV YIIDFDYLTY DLRIVDLTSF
MQKVLKRIHW DIKTGESILN WYSNVSPLNK DELELVYIIL LFPYRYWKTC NRYYNGKKSW
SEKAFTNKLH EVIAEKEFHY DFIRWLEKLI