Gene GWCH70_1591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1591 
Symbol 
ID7976241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1661811 
End bp1663175 
Gene Length1365 bp 
Protein Length454 aa 
Translation table11 
GC content46% 
IMG OID644798480 
ProductRNA methylase, NOL1/NOP2/sun family 
Protein accessionYP_002949652 
Protein GI239827028 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0144] tRNA and rRNA cytosine-C5-methylases 
TIGRFAM ID[TIGR00446] NOL1/NOP2/sun family putative RNA methylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.558771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGAAATTAC CAAGCGAATT CATCGAAAAA ATGGAGAGGC TTTTGGAAGA CGAAGCTTCT 
CGTTTTTTTT CCACCTATCA TGAAGAAAAA GCAAACGGTT TGCGGTTCAA TCCATTGAAA
ATCGACCGCG AGACGTTTTT AACGCTCGTC CCGTTTGCAC TTTCCCCTGT GCCGTTTTGC
CCAACTGGCT TTTATTATGA CGCACACGAA CAACCTGGAA AGCATCCATA TCATGCGGCA
GGGCTCTATT ATATCCAAGA GCCGAGTGCG ATGTTTGTAG CGGAAGTGTT AAAGCCAAAT
CCAGGGGAGT TTGTTCTTGA CCTTTGTGCC GCACCTGGCG GAAAAACGAC TCAGCTTGCG
GCAATGATGA AAAATCAAGG GCTGATTATC GCCAATGAAA TTCATCCGAA ACGCGTCAAA
GCACTATCGG AAAATATCGA GCGGTTTGGC ATTACGAATG CGCTTGTTAC GAATGAAACA
CCGGAAAAGC TCGCAAAATA TTTCCCTGGT TTTTTTGACA AAATTTTAGT AGACGCTCCG
TGTTCGGGGG AAGGCATGTT TCGAAAAGAC GAAGAAGCCG TGCAATTTTG GAGCCAAGCA
CACGTCGAAC AATGCGCCAT CAAACAACGG CATATTTTAG ACTGTGCATA CGAGATGTTA
AAAGAAGGCG GCATTCTCGT CTATTCCACT TGCACGTTTT CTCCGGAAGA AAACGAACAG
ACCATAGAAG CTTTTTTACA AACCTATGAT GATCTTGAAT TGCTGTCGAT TGAAAAAGTT
CATGGCATTC AGCCGGGAAG ACGGGAATGG ACGAACACGA ACTTCGAGGA AATGGAGAGA
ACGGCTCGGC TATGGCCGCA TTCGTTAAAA GGGGAAGGCC ATTTTGTCGC GAAAATAAAA
AAAACAGGCC CGTCCCCTTC ATGGAATGGA CGCTATGCCA AGCCAAACGC CTCCAAACAA
ATGGTTCGCG AGTATCGGCA GTTTGAACAA GAAGTATTGC AAACAGAAAT CGAAAAACCG
ATGTATGCCT TTCAACACCA TCTATTCGCC CTACCTGACC ACTGCCCGAA TTTCGATGGC
CTGAAAGTCG TGCGGGCAGG TCTTCACTTA GGAGAAGCGA AAAAGCAGCG GTTTGAGCCG
AACCATGCAC TTGCCTTATC ACTAAAGCCG CAAGACGTTC GTTACTCCCT TGACTTGTCA
AGCGACAGCG TAGAATGTCT AAAATATTTG CGCGGAGAAA CGATTCAGAC GGGAGAAGAC
CGCGGCTGGC TGCTTGTGAC CGTTGATGGT TATCCGCTCG GGTGGGGAAA AGAAGTAAAA
GGTATGGTGA AAAACTTTTA TCCGAAAGGA CTGCGAATCA ACTAA
 
Protein sequence
MKLPSEFIEK MERLLEDEAS RFFSTYHEEK ANGLRFNPLK IDRETFLTLV PFALSPVPFC 
PTGFYYDAHE QPGKHPYHAA GLYYIQEPSA MFVAEVLKPN PGEFVLDLCA APGGKTTQLA
AMMKNQGLII ANEIHPKRVK ALSENIERFG ITNALVTNET PEKLAKYFPG FFDKILVDAP
CSGEGMFRKD EEAVQFWSQA HVEQCAIKQR HILDCAYEML KEGGILVYST CTFSPEENEQ
TIEAFLQTYD DLELLSIEKV HGIQPGRREW TNTNFEEMER TARLWPHSLK GEGHFVAKIK
KTGPSPSWNG RYAKPNASKQ MVREYRQFEQ EVLQTEIEKP MYAFQHHLFA LPDHCPNFDG
LKVVRAGLHL GEAKKQRFEP NHALALSLKP QDVRYSLDLS SDSVECLKYL RGETIQTGED
RGWLLVTVDG YPLGWGKEVK GMVKNFYPKG LRIN