Gene P9303_22051 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22051 
SymbolhemK 
ID4778022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1959454 
End bp1960374 
Gene Length921 bp 
Protein Length306 aa 
Translation table11 
GC content55% 
IMG OID640087721 
Productputative protein methyltransferase 
Protein accessionYP_001018205 
Protein GI124023898 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG2890] Methylase of polypeptide chain release factors 
TIGRFAM ID[TIGR00536] HemK family putative methylases
[TIGR03534] protein-(glutamine-N5) methyltransferase, release factor-specific 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATTGC TGGAGATTTG CGCAGCAGAG CTATTGGCCT GGCGTCGTTT GCAGTTGGCT 
GAAGGAGGCC GTGCTGTTGA TTTTGACTGG TTGCTTGATT TGGGTGGAGG TTTGCGCTGG
AGCGATCTTC AGCAGCTTTA TCTCGATCCA CGACGTTCAG TGCTGCTTGA GCGATCCCTT
GATCAGCTGG CAATGATCTG GAAGCAACAT CTTGATCATC ACATTCCTCT TCAGCATTTG
ATCGGTTGCT GCCCTTGGAG GGACGTTGAG CTGGAGGTCA GTGCGGCGGC ATTGATTCCC
CGTCAGGAGA CTGAGCTGTT GGTGGATTTT GCTTTGCAAG CATTTGCTCG GAAACCTTTT
GGTTGCTGGG CCGATTTGGG GACAGGTTCA GGAGCACTAG CTGTGGCATT GGCTCGCGCG
CTCCCTGTTT GGCGTGGACA TGCAGTGGAT TGCAGCATCG AGGCCTTGGC TTTGGCGAAA
CGGAACTTGC AAAGGCTTGC GCCCCATGCC TTATGGCAGC TGCATCAGGG CAGTTGGTGG
GAGCCATTGC GGCCTTGGTG GGGAGAGTTC AGCTTGGTGC TGGTCAATCC TCCCTACATC
CCAGAAGCTG TAATGGCTCA ACTTGAACCT GTGGTCCGTG ATCATGAGCC GCATTTGGCT
CTATGCGGGG GAGCTGATGG GTTGGTGGCC ACACGTCAAA TCATTGTTGG TGCCATGCAG
GCCTTAGAGC CTGGTGGTTG GCTGTTCTTG GAACACCACC ATGACCAGAG CGATGCCGTG
CTGGCTTTGA TGCGTCAGCA GGGTTTGGAG AATGTGGAGT ACAAGTCAGA TTTACTGGGG
GTAAGGCGCT TTGCCATTGC TCGGCATCCT GAACATCACG ACTCTTTGAA CCATGGCACT
TCTGCCTTCG GCTGTTCTTG A
 
Protein sequence
MALLEICAAE LLAWRRLQLA EGGRAVDFDW LLDLGGGLRW SDLQQLYLDP RRSVLLERSL 
DQLAMIWKQH LDHHIPLQHL IGCCPWRDVE LEVSAAALIP RQETELLVDF ALQAFARKPF
GCWADLGTGS GALAVALARA LPVWRGHAVD CSIEALALAK RNLQRLAPHA LWQLHQGSWW
EPLRPWWGEF SLVLVNPPYI PEAVMAQLEP VVRDHEPHLA LCGGADGLVA TRQIIVGAMQ
ALEPGGWLFL EHHHDQSDAV LALMRQQGLE NVEYKSDLLG VRRFAIARHP EHHDSLNHGT
SAFGCS