Gene Athe_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1042 
Symbol 
ID7409599 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1136087 
End bp1137307 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content32% 
IMG OID643715408 
ProductFmu (Sun) domain protein 
Protein accessionYP_002572916 
Protein GI222529034 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00563] ribosomal RNA small subunit methyltransferase RsmB 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAGAAAT TCCATCAAAA GTTAAAAAAT GAAAAGGACA GGGCTTTGTT TGTTGAGCTT 
GTTCATGGTG TTTTGAGATA CAAAAGCCTT ATTGACTACT ATATTAATTT TGTTGCTAAA
AAAGGAGTAA AGGATAAAAG GATATTAAAC ATTTTAAGGG TTGCCACATA TGAACTTCTT
TTTCTTGAAA AGATTCCAGA GTATGCAACA GTAAATGAGG CATGTGAGGT TGCAAGCAAA
ATAAATCCAC ATTTAAAGGC TTTTGTGAAT GCAATTTTGA GAAATATAAT CAGAAACAAA
AATCAAATAG AAGAGTCATT GGAAAGAATT AAGGACGTGG ACTATAAAAG TTATTTATCA
ATAAAGCTTT CTTATCCCAG ATTTTTAATA GATTATTTAG AAGAGAGTTA TGGACTTGAA
AAAACTATAA AAATTTTAGA ATTTTTAAAT ACAAAACCTC CTCAGAGTAT AAAGATAAAT
ACTAAAAAAA CCAATGTAAA TACATTAACA CAAGAGCTTG AGAAAAACGG ATTTAAGTAT
GAGATTAATT CTCGTAACAA TGAAATAGTC CTTATTTTGA AGGGCAACAT AAAGGAAACA
GAACTTTATA AGGAAGGCTA TTTCTATTTT CAGGATTTGG CATCTTCTCT TGTTGTAAAG
TTTAACCAAG AAGATTTTAA AAGAGCAAAG AAAGTGATAG ACCTGTGTGC CGCACCAGGT
GGAAAGACTT TTAACTGCGC AGAGGTTATA GATGGGTTTG TTGTTGCATG TGATATAAAC
GAACATAAGC TTGATATATT GCGAGAAAAC ATTTTGCGGC TTGGTTTTGA TAATATCATT
GTTGCAAAAA GTAACGCTGA GGTTTTTAAC CCTGATTTTG CCGAAAAATT TGACATTGTG
ATTGCCGACC TTCCATGTAC TGGTTTTGGC GCAATTAGGA AAAAGCCTGA TATCAAATGG
AATAAAAGTT ATCAGGACAT TGAGAATCTT CATGAACTGC AGGTAAGAAT ACTTGACAAT
TCAGCAGGTT ACCTAAAAAG AGGAGGAATA CTTTTTTATT CTACATGTAC GCTTGGGAAA
AAAGAAAATG AAGAAACAGT TATAGAGTTT TTAGAGAAGC ACAAAGATTT TTCGTTGGTA
TCCCTAACTA CTATTTTTCC CGATGAGTTT GAATGTGATG GATTTTTTAT AGCTAAACTT
AGAAAAGAGG GCGAAAGATA G
 
Protein sequence
MEKFHQKLKN EKDRALFVEL VHGVLRYKSL IDYYINFVAK KGVKDKRILN ILRVATYELL 
FLEKIPEYAT VNEACEVASK INPHLKAFVN AILRNIIRNK NQIEESLERI KDVDYKSYLS
IKLSYPRFLI DYLEESYGLE KTIKILEFLN TKPPQSIKIN TKKTNVNTLT QELEKNGFKY
EINSRNNEIV LILKGNIKET ELYKEGYFYF QDLASSLVVK FNQEDFKRAK KVIDLCAAPG
GKTFNCAEVI DGFVVACDIN EHKLDILREN ILRLGFDNII VAKSNAEVFN PDFAEKFDIV
IADLPCTGFG AIRKKPDIKW NKSYQDIENL HELQVRILDN SAGYLKRGGI LFYSTCTLGK
KENEETVIEF LEKHKDFSLV SLTTIFPDEF ECDGFFIAKL RKEGER