Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_3321 |
Symbol | |
ID | 7979220 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 3343775 |
End bp | 3344641 |
Gene Length | 867 bp |
Protein Length | 288 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 644800088 |
Product | modification methylase, HemK family |
Protein accession | YP_002951227 |
Protein GI | 239828603 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.000192098 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCTATA AAATTTACGA AGTCCTCACA TGGGCTTCTT CTTTTTTAAA ACAACATGGC AAAGAAGAGC GGGCCGCGGA ACTATTGCTT TGCCACCATT TGCAAATCAC AAGAGCGCAG CTGTTTGCCC GGCTGCGCGA TCCAATCGAT GAAAACGTGC GCCAATCATT TGAAGCAGAT GTTCGTAAGC ATGTGTATGA ACATGTTCCG GTGCAGCATC TGATCGGTTT GGAACAATTT TACGGCCGGC CGTTTCTCGT CAATCGCAAT GTGCTGATCC CCCGCCCGGA AACAGAGGAA TTAGTGGAAG GCGTGCTAAC CAGAATCACA CAATTATTCC CGGGAAATAA AACGATCGAT GTGGTCGACG TCGGGACAGG AAGCGGGGCG ATTGCGATTA CGCTGGCATT AGAAAACAAA TCGCTTCGCG TTGCGGCCAT CGATATTGCT CCCGAGGCTC TGGAAGTGGC AAAGCGAAAT GCGGAACGTC TTGGAGCGGA TGTTGCATTC ATTTGCGGCG ACTTATTGCA GCCTCTTGTA GAGGCCAGTC GCAAAGTAGA CGTTGTCGTT TCCAATCCGC CATATATTCC GGAGAATGAA ATTGCTTCGC TTTCCCCTGT CGTAAAAGAT TATGAGCCTT TACGGGCGCT TTCCGGCGGC AAAGACGGCC TTGATTTTTA CCGCCGTTTC GCCCGCGAGC TGCCATTCGT ATTAAAAGAA CGCGCGCTTG TCGCGTTCGA AGTAGGAGCT GGGCAGGGAG AGGCGGTCGC GGCAATATTG CGGCAGACGT TTCCACAAGC AGAGGTGGAA GTTGTTTTCG ACATTAACGG AAAAGACCGA ATGGTATATG CATCATTAGG AAAATAA
|
Protein sequence | MSYKIYEVLT WASSFLKQHG KEERAAELLL CHHLQITRAQ LFARLRDPID ENVRQSFEAD VRKHVYEHVP VQHLIGLEQF YGRPFLVNRN VLIPRPETEE LVEGVLTRIT QLFPGNKTID VVDVGTGSGA IAITLALENK SLRVAAIDIA PEALEVAKRN AERLGADVAF ICGDLLQPLV EASRKVDVVV SNPPYIPENE IASLSPVVKD YEPLRALSGG KDGLDFYRRF ARELPFVLKE RALVAFEVGA GQGEAVAAIL RQTFPQAEVE VVFDINGKDR MVYASLGK
|
| |