Gene Athe_1219 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1219 
Symbol 
ID7409693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1309271 
End bp1310638 
Gene Length1368 bp 
Protein Length455 aa 
Translation table11 
GC content36% 
IMG OID643715584 
ProductRNA methylase, NOL1/NOP2/sun family 
Protein accessionYP_002573092 
Protein GI222529210 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAATTTGC CAGAAGAATT TTTGTCAAAA ATGAGAGAGA TTTTAAATGA TGAATTTGAC 
CAGTTTATAA AAATATATGA CTTTGACAGT TATAAAGGTT TTAGGGTCAA CACTGCCAAA
GTTTCAGTCA AAGAGTTTAT AGATAAAATG GGAATTGAAT TTGAAAGAAT TCCATGGTGT
AAAGATGGTT TTTTCTACAC TGAAGAGTTA AGACTGAGCA AACACCCATA CTATTTTGCT
GGGCTTGTAT ATATTCAGGA ACCATCAGCC ATGTTTCCGG TTGAGGCTTT GGATGTAAAG
GAAGGCGAAA AAGTTTTGGA CTTGTGTGCT GCACCTGGGG GAAAGAGCAT TCAGATAGCA
GCAAGACTTG GTCAAAATGG ATTGCTTATA TCCAATGATG TAAAACCATC AAGAATCAAG
GCGCTTGTAA AGAATGTTGA AAATCTTGGG CTTACAAATG TTGTCATTCT GAACAACAAA
CCAAAAGAGA TAGCGGAAAG CTATGGTGCA TATTTTGACA AGATTTTAGT TGACGCACCT
TGTTCTGGTG AGGGAATGTT TCGCAAAGAC CCAACGGCAG CCAAAAAGTG GACTTCCAAT
CATCCTGAAA AGTATGTCAA TCTGCAGAGA AGTATAATGA CAGAGGTGGA TGAACTTTTG
AAAGTGGGTG GTGAGATAGT ATATTCCACC TGTACGTTTG AAGTAGAAGA GAATGAAGGA
ATTATTGACT GGTTCTTAAA AAAACATAAA AACTATGAGG TTGTTGAGAT AAAAAAATAT
GAGGGTTTTT CGGATGGAAT TGAGATAAAT GGCAATGAAA ATTTGAAAAA AGCGGTGAGA
ATTTACCCGC ATAGGGTCAG AGGCGAGGGA CATTTTATTT GCAAGTTAAG AAAGGTGCGA
GAAAGTGGAT TTGAGTGGAC TTTTCAGCCA CAAAGATTAG AGGTGGACAG TGAGGATTTA
AAAATTTTCG AAAAGTTTTG TAATAAATAC TTGAACATAG ACTTAAGTAA CTTTAAAGAT
AGGGTGTTTT ATAAAAAAGC AAACAAACTG TATTTGGGTT TTGACGGACC TTTTGATAAA
ATAACACCGC TTCGAAATGG TCTTTTGCTT GGAGAAGTTT ATAAAGGAAG ATTTTATCCT
TCTGCTCATT TGATTGCGAG TTTAAAGTGC GAAAATCTCA AGGTTGCTAT TAACTTTTCT
CAAGAAGATG AAAGGTTGTG GAGGTATTTA AAAGGCGAGA CGATAGAGAA CAAAGAAAAT
CTGAATGGAT TTGTAGGGAT ATGTGTAGAT GGCTTTACTC TTGGTTGGGG TAAAGCAGAG
GGACATATAA TAAAAAATTA TTTCCCAAAA GGATGGAGAT TAGAATAA
 
Protein sequence
MNLPEEFLSK MREILNDEFD QFIKIYDFDS YKGFRVNTAK VSVKEFIDKM GIEFERIPWC 
KDGFFYTEEL RLSKHPYYFA GLVYIQEPSA MFPVEALDVK EGEKVLDLCA APGGKSIQIA
ARLGQNGLLI SNDVKPSRIK ALVKNVENLG LTNVVILNNK PKEIAESYGA YFDKILVDAP
CSGEGMFRKD PTAAKKWTSN HPEKYVNLQR SIMTEVDELL KVGGEIVYST CTFEVEENEG
IIDWFLKKHK NYEVVEIKKY EGFSDGIEIN GNENLKKAVR IYPHRVRGEG HFICKLRKVR
ESGFEWTFQP QRLEVDSEDL KIFEKFCNKY LNIDLSNFKD RVFYKKANKL YLGFDGPFDK
ITPLRNGLLL GEVYKGRFYP SAHLIASLKC ENLKVAINFS QEDERLWRYL KGETIENKEN
LNGFVGICVD GFTLGWGKAE GHIIKNYFPK GWRLE