Gene Athe_0541 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0541 
Symbol 
ID7408667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp611335 
End bp612348 
Gene Length1014 bp 
Protein Length337 aa 
Translation table11 
GC content30% 
IMG OID643714924 
Productspore coat protein, CotS family 
Protein accessionYP_002572440 
Protein GI222528558 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0510] Predicted choline kinase involved in LPS biosynthesis 
TIGRFAM ID[TIGR02906] spore coat protein, CotS family 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATATTT TCATGGGAAC ACCTGAACTT AAGTTAGTTG AAGAGAATTA TTATATAAAG 
ATAGATGAAA TAAAACAAAT AAAATCTAAT GCCTATTTTG TGAAGACAAA GGATGGTAAA
GAATTTTTTT TGAAGGTCAG TAGAGTTGCC AAAGACTATG TTGATTTTAT TATCAAGATT
TTTTCACACC TGAAAAATAC AAGTTTTAAA AGCCACCTGA TTGATTTTCA GAAGACCATT
GACGGTGGCT TTTATTTTTT AGATGAAAAC AAAAAGGTTT ATCTTCTATG TAAGTGGATA
GATGGCAGAA GTGCAGATTT CAGAAATGTT TATGACTTGA GAAGAGTAGT TTCAATCTTG
TATCACCTTC ATTTAGCCTC ACTTTCTTTT GCTGAGGAAA TAAAAGATAG TTTTTATCCA
TCTTATCAAG AAGTGTTTTG TAGAAAGTAT TCACAAGTTA TCCAAATGAA GAATATTATA
CATCAAAAAG ATAATCTTAG CTATTTTGAT GAGATATTTT TGAATGTTCT AAGTAGATTT
GAAGATAGAT TTGTGGAAAG CATACATATG ATAAAAAAAA TTGAAGACTA CTTCAAAGAA
GAGAATCAAA AGGTATTAAT TCATCATGAT CCTGCTCATC ACAATTTTAT ATTTTCTGAA
AAAGGTGTAT ACCTTATCGA CTTCGATTAT GCGATGGTAG ATTATAGTGT ACATGACTTT
GCAAACCTTG GTGTGAGGGT TTTGAAAACA AATGATTGGG ACAGAAATAT GTTTAGAATT
TATTTAAAAT TTCTACAGGA TAAAAATATC TTAAATAAAT TCTGGTTGCA AACATTTTGG
ATTTTGATGT ATTTCCCGCA AGAGATTTGG CAAATTGGAC TTCAGTACTA TTTCGAAAAA
CAACCGTGGA CAGAAGAGTA TTTTCTCAAA AGACTTAAAG GATCAGAAAG AATACAAGAA
GAAAAGGAGA TGATAATTAA GGAATTTGCG GGAGGGATTT TTAAATGGCA TTGA
 
Protein sequence
MYIFMGTPEL KLVEENYYIK IDEIKQIKSN AYFVKTKDGK EFFLKVSRVA KDYVDFIIKI 
FSHLKNTSFK SHLIDFQKTI DGGFYFLDEN KKVYLLCKWI DGRSADFRNV YDLRRVVSIL
YHLHLASLSF AEEIKDSFYP SYQEVFCRKY SQVIQMKNII HQKDNLSYFD EIFLNVLSRF
EDRFVESIHM IKKIEDYFKE ENQKVLIHHD PAHHNFIFSE KGVYLIDFDY AMVDYSVHDF
ANLGVRVLKT NDWDRNMFRI YLKFLQDKNI LNKFWLQTFW ILMYFPQEIW QIGLQYYFEK
QPWTEEYFLK RLKGSERIQE EKEMIIKEFA GGIFKWH