Gene Athe_0218 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_0218 
Symbol 
ID7407209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp265756 
End bp266832 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content37% 
IMG OID643714619 
Productstage II sporulation protein D 
Protein accessionYP_002572142 
Protein GI222528260 
COG category[D] Cell cycle control, cell division, chromosome partitioning 
COG ID[COG2385] Sporulation protein and related proteins 
TIGRFAM ID[TIGR02669] SpoIID/LytB domain
[TIGR02870] stage II sporulation protein D 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAAAAAG AGGTGAGTAA AAGAAGGTAT AATTTCTTTT GGCTTGACAA AACTTTGATG 
CTGCTTGCTC TTGTTGTGCT TTTGATTGCA AATATTGTTA CAAAACCAGA ACAAAATAAT
TACAAAAATA TGCAAAAAGA GAAAGACAAG GTTACACAAG AAAGTGGTAT TCAATCTACA
CCGACTTTGC CACAGAACAA AAATCAAAAA GAGGATATTT ATATAAATCT CCTAAGAAAA
AATAAGAACA GAATAGAAAG AATTTCGCTT GAAGAATATG TTATAGGCGT TGTAGCAGCA
GAAATGCCTG CCGAGTTTAA TTTGGAGGCT TTGAAAGCCC AGACAGTTGC TTCAAGAACA
TATGCTGCGC GAAAGATTGT AGGGAAAGCT TTGCACAAAG GATATGAAGA AAAAAAGGTC
TATCTTTGCG ATGACTTTTC CCACTGCCAG GCATACATTG ACAAGGATGA GATGAAGAGA
AGGTGGGGTA AAAACTTTGA AAAGTACTAT AAGAAGATAC GCATGGCTGT GGAGGAAACA
AAAGGACAAG TGTTGGTTTA CAAAGGTCAG GTTATAGATA GTCTGTTTCA TGCCGCATCA
GGTGGGAGAA CTGAGGATGC TAAAGAGGTG TTTAAAGAAG AAATTCCATA CTTAAAAAGT
GTTGTGAGCC GTGGCGAGGA AAGCTGTCCT AAGTTTTCTG GTGAGTTTTA TTTTACCTAT
AACGATTTGC TAAAAAGATT AAAAAAATAT TTTCCCGGAT TAAAAGTGAA CACACAAAAT
ATATCTTCTC AAATAAAGGT GGTTGAAAGG ACAGCCACAC AAAGAGTGAA AACTGTAAAA
ATTGGAAATA CAATTTTGAG TGGGAATCAG TTCAGGAGTA TCTTTGGACT GTATTCCACA
GAGTTTTGGC TTTACCCTGA CAGAAGGGGG TTGAGAATCC AAACAAGAGG ATATGGGCAT
GGGCTTGGGA TGAGCCAGTG GGGAGCAAAC CATCTTGCAC GACAGGGAAA AAATTATAAG
CAAATTCTTC TTTATTACTA TCAAAATGTC AAGATATGTA GGTTAAAATA TAAATAG
 
Protein sequence
MKKEVSKRRY NFFWLDKTLM LLALVVLLIA NIVTKPEQNN YKNMQKEKDK VTQESGIQST 
PTLPQNKNQK EDIYINLLRK NKNRIERISL EEYVIGVVAA EMPAEFNLEA LKAQTVASRT
YAARKIVGKA LHKGYEEKKV YLCDDFSHCQ AYIDKDEMKR RWGKNFEKYY KKIRMAVEET
KGQVLVYKGQ VIDSLFHAAS GGRTEDAKEV FKEEIPYLKS VVSRGEESCP KFSGEFYFTY
NDLLKRLKKY FPGLKVNTQN ISSQIKVVER TATQRVKTVK IGNTILSGNQ FRSIFGLYST
EFWLYPDRRG LRIQTRGYGH GLGMSQWGAN HLARQGKNYK QILLYYYQNV KICRLKYK