Gene Athe_1043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAthe_1043 
Symbol 
ID7409600 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAnaerocellum thermophilum DSM 6725 
KingdomBacteria 
Replicon accessionNC_012034 
Strand
Start bp1137313 
End bp1138338 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content36% 
IMG OID643715409 
Productribosomal RNA large subunit methyltransferase N 
Protein accessionYP_002572917 
Protein GI222529035 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAGGC TTATAAAAGA TTTGACGTTT GATGAGCTAA AAAAGTGGCT CGAAAATATT 
GGTGAAAAAC CTTTTAGAAC AAGCCAGATT TTTGAGTGGC TTTACAAGAA AAATGCTACT
GATGTAATGC AGTTTACCAA TCTACCACTC GAACTTCGAG AAAAGATTGA GGATGAGTTT
TTGATAAACT CTTTACAGAT TTTGAAACAT CAAAGTGATG GAGAGAGTAT AAAATTCCTG
TTTGAACTTT GCGATAAAAA TGGAGTTGAA AGTGTGTTTT TACCTTATCG GTATGGGAAT
GCAATATGCG TCTCAACACA AGTTGGATGC AAAATGAACT GCAGGTTTTG TGCCTCTGCC
ATAGGCGGAT TTGTAAGAAA CCTTTCGGCA GGGGAGATGG TTGACCAGAT AATCAACGTA
GAAAACTTTA CAGGCAAAAG AATAACAAAT GTGGTTCTGA TGGGAAGTGG CGAGCCATTT
GACAACATTG AAAATGTGTT TAAATTTATT GAGATAATAA ACTCAAAAGA GGGGAAAAAC
ATAGGGGCAA GGCATATCAC CATTTCCACA GTTGGCATAG TTGAAGGAAT TTATAGGCTC
TGTGATTTTC CAAAACAAGT AAACCTTGCA ATATCTCTGC ATGCCCCAAA TAATAGCCTG
AGAGACAAGC TTGTTCCGAT AAACAAAAAG TATCCTGTTG AAGATATTAT GAAAGCAGTT
GATTACTACA TTAAAAGGAC TAATAGAAGA GTTACTTTTG AGTACGCCCT GATAGATGGG
GTAAATGATT CTATTGAATG TGCTCAAGAG CTTGGCAAGA TGCTAAAAGG TAAGCTTGTA
CATGTAAATT TGATACCTGT TAACCCAGTT GAAGAAAAAG GGTTTAGAAG ACCTTCAAAA
GAAAAAATAA AAGTATTTTT TGAAACCTTA AAATCATATC AAATTAATGT TACAATTAGA
AGAGAGCTTG GCAGCAGTAT ATCTGCAGCG TGTGGACAGC TGAGAAAACG ATATTTTAAC
ATATAA
 
Protein sequence
MKRLIKDLTF DELKKWLENI GEKPFRTSQI FEWLYKKNAT DVMQFTNLPL ELREKIEDEF 
LINSLQILKH QSDGESIKFL FELCDKNGVE SVFLPYRYGN AICVSTQVGC KMNCRFCASA
IGGFVRNLSA GEMVDQIINV ENFTGKRITN VVLMGSGEPF DNIENVFKFI EIINSKEGKN
IGARHITIST VGIVEGIYRL CDFPKQVNLA ISLHAPNNSL RDKLVPINKK YPVEDIMKAV
DYYIKRTNRR VTFEYALIDG VNDSIECAQE LGKMLKGKLV HVNLIPVNPV EEKGFRRPSK
EKIKVFFETL KSYQINVTIR RELGSSISAA CGQLRKRYFN I