Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_1875 |
Symbol | |
ID | 4571217 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | - |
Start bp | 2172627 |
End bp | 2173532 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 639766457 |
Product | HemK family modification methylase |
Protein accession | YP_912315 |
Protein GI | 119357671 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.210428 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTGAAC AGGGAAGAGG GACAAAAGAG TGGCAGGTTG TCGAACTGTT GAAAACCACC ACGGCTTTTT TTGTGCAGAA ACAGGTTGAC GAGGCCCGGA TCAGCGCAGA GCTTCTGCTC GCCTCTGTTC TCGGTCTTGA CAGGCTTGGC CTCTATCTTA ATCACAATCG CCCTGTTTAT CCGGGCGAAC TCGAAGCTTT CAGGGCACTT TGTCGTCAGC GTCTTGAGGG TAAGCCGGTG CAGTATATAA CCGGTGAACA GTTTTTTTAC GGATTGCCTT TTTTTGTCGA TAAGCGGGTT CTGATTCCAC GTCCCGAAAC AGAGCTGCTT GTTGAACACG CACTTGAGTT TCTTGGTCAC GTTTCTGCAG CAGACGTTTC AGAAGCTGCT CTGCATCTGC TGGATATTGG TACAGGGAGT GGCTGTATTG CTGTTACTCT TGCCAGCAGG TTGCCCTGCC TGATGGTTAC AGCCATCGAT ATTTCTACGG AAGCACTCGT TGTCGCCCGC AATAATGCTG AAAGGCATGG TGTTGCAGAT CGGATACGTT TTCTGCATGC CGACCTGTTC TCTCTCCCGG ACGAAAGAGG GCTGTCTGCT CCTTTTGATG TTATTGTCTC CAATCCGCCG TACATTGCTG AAGATGAGTG GGCTGGTCTG CAGCCGGAAG TCCGACTTTT CGAGCCACAG CTTGCGCTGA CCACCAGAGA TGGGATTGAG TGCTATCATG CGGTGGCAGA AGTCGCGCCC TCTCTGTTGA AATCAGGAGG GATGCTCTGT TTTGAATCCC ATGCTGATGC GGCTTTGAAG GTTGCCGGGA TCATGGAGCG TTGGGGGTTC TCATCGGTTG CGGTGATGAA AGATTACTCG GGACTTGACA GGGTCGTTTC GGGAAAGATC GGTTGA
|
Protein sequence | MIEQGRGTKE WQVVELLKTT TAFFVQKQVD EARISAELLL ASVLGLDRLG LYLNHNRPVY PGELEAFRAL CRQRLEGKPV QYITGEQFFY GLPFFVDKRV LIPRPETELL VEHALEFLGH VSAADVSEAA LHLLDIGTGS GCIAVTLASR LPCLMVTAID ISTEALVVAR NNAERHGVAD RIRFLHADLF SLPDERGLSA PFDVIVSNPP YIAEDEWAGL QPEVRLFEPQ LALTTRDGIE CYHAVAEVAP SLLKSGGMLC FESHADAALK VAGIMERWGF SSVAVMKDYS GLDRVVSGKI G
|
| |