Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_22051 |
Symbol | hemK |
ID | 4778022 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1959454 |
End bp | 1960374 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087721 |
Product | putative protein methyltransferase |
Protein accession | YP_001018205 |
Protein GI | 124023898 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG2890] Methylase of polypeptide chain release factors |
TIGRFAM ID | [TIGR00536] HemK family putative methylases [TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCATTGC TGGAGATTTG CGCAGCAGAG CTATTGGCCT GGCGTCGTTT GCAGTTGGCT GAAGGAGGCC GTGCTGTTGA TTTTGACTGG TTGCTTGATT TGGGTGGAGG TTTGCGCTGG AGCGATCTTC AGCAGCTTTA TCTCGATCCA CGACGTTCAG TGCTGCTTGA GCGATCCCTT GATCAGCTGG CAATGATCTG GAAGCAACAT CTTGATCATC ACATTCCTCT TCAGCATTTG ATCGGTTGCT GCCCTTGGAG GGACGTTGAG CTGGAGGTCA GTGCGGCGGC ATTGATTCCC CGTCAGGAGA CTGAGCTGTT GGTGGATTTT GCTTTGCAAG CATTTGCTCG GAAACCTTTT GGTTGCTGGG CCGATTTGGG GACAGGTTCA GGAGCACTAG CTGTGGCATT GGCTCGCGCG CTCCCTGTTT GGCGTGGACA TGCAGTGGAT TGCAGCATCG AGGCCTTGGC TTTGGCGAAA CGGAACTTGC AAAGGCTTGC GCCCCATGCC TTATGGCAGC TGCATCAGGG CAGTTGGTGG GAGCCATTGC GGCCTTGGTG GGGAGAGTTC AGCTTGGTGC TGGTCAATCC TCCCTACATC CCAGAAGCTG TAATGGCTCA ACTTGAACCT GTGGTCCGTG ATCATGAGCC GCATTTGGCT CTATGCGGGG GAGCTGATGG GTTGGTGGCC ACACGTCAAA TCATTGTTGG TGCCATGCAG GCCTTAGAGC CTGGTGGTTG GCTGTTCTTG GAACACCACC ATGACCAGAG CGATGCCGTG CTGGCTTTGA TGCGTCAGCA GGGTTTGGAG AATGTGGAGT ACAAGTCAGA TTTACTGGGG GTAAGGCGCT TTGCCATTGC TCGGCATCCT GAACATCACG ACTCTTTGAA CCATGGCACT TCTGCCTTCG GCTGTTCTTG A
|
Protein sequence | MALLEICAAE LLAWRRLQLA EGGRAVDFDW LLDLGGGLRW SDLQQLYLDP RRSVLLERSL DQLAMIWKQH LDHHIPLQHL IGCCPWRDVE LEVSAAALIP RQETELLVDF ALQAFARKPF GCWADLGTGS GALAVALARA LPVWRGHAVD CSIEALALAK RNLQRLAPHA LWQLHQGSWW EPLRPWWGEF SLVLVNPPYI PEAVMAQLEP VVRDHEPHLA LCGGADGLVA TRQIIVGAMQ ALEPGGWLFL EHHHDQSDAV LALMRQQGLE NVEYKSDLLG VRRFAIARHP EHHDSLNHGT SAFGCS
|
| |