Gene P9211_00591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_00591 
Symbol 
ID5731700 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp65307 
End bp66431 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content37% 
IMG OID641284401 
ProductRNA methylase family protein 
Protein accessionYP_001549944 
Protein GI159902600 
COG category[L] Replication, recombination and repair 
COG ID[COG0116] Predicted N6-adenine-specific DNA methylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.950833 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACTTG TAGCGATAAT TTCTCCAGGC CTTGAAGCTG AGGCAGCAAA AGAATTGTAT 
GAATTGGGTG CAGCAGAAAT TCAACCTTTA CCACGATGCG TAGAATTTCA AGTTGATTTG
CAATGTTTTT ATAGGATCCA TTTGAGAAGT AGATTACCCT TTCGCTTCTT AAGAGAAATA
GCCCGATTTC ATTGTAATAA TCCAGAATCT TTATATACAA ATGCACAGCA AGCTTTTGAT
TGGATTAGAT GGCTTCCTCC ATCCAAAACG TTTAAGGTTG ATGTCTCAGG AACTAGTTTT
GGCTTAACGC ATAGTCATTT CACAGCCTTA CAAGTAAAAA ATGCCATTAT TGATTTACAA
CGAAGTTCTT GTGGAAAAAG ATCCGATATA AGCGTTCAGG ATCCAGACAT ATGTATTCAT
TTACATTTGC ACAACAATCA GGCTGTTTTG AGTCTTGATT CATCCGCTCA TAGCTTGCAT
AGAAGAGGTT TTCGTCCAGC GATGGGAGTT GCACCTTTAA AAGAAAACCT TGCTGCTGGC
TTATTGCGTC TGACTAATTG GGACTTTTCT ATGCCCTTAG TAGATCCATT GTGTGGCTCT
GGGACTTTCT TAATCGAGGG AGCTGCACTA GCGCTTGGCT TAGCTCCAGG CTTACATCAA
AAGTTTCTTT TTACAAATTG GCCCGATTTT GATACTTCTT TATGGGAGCA GGAAAAGCAT
TTAGCTCAAG TTAGTCAATT ACCTAAGCAA CAGTTACCAA AAATTATCGG ATGTGAGAAA
AATAGTGAAA TAGCTAATCA GGCAAAAAGT AATATAATTG AGTCAGGTTT AGGCTTAGAA
ATAAAGATTC AGAATAGTCA TTTTTTTGAT CTTGAATTGC CTAATGATAA AGGACTGATT
GTTTGTAATC CCCCTTATGG AAAAAGATCT GGAAAAGAAG AAGATTTGGA AACTTTATAT
AATGAACTTG GCTCTTTTTG TAAAAAGAAA GCTTCTGGAT GGAATCTATG GCTACTTAAT
GGTAATCCAA ACTTAAGTAA GTTTCTTAGA TTAAAAGCTA AAAGGCGTAT ACCAGTCAGT
AACGGAGGTA TAGATTGTCG ATGGCTGCAT TATGAGATTA ATTGA
 
Protein sequence
MKLVAIISPG LEAEAAKELY ELGAAEIQPL PRCVEFQVDL QCFYRIHLRS RLPFRFLREI 
ARFHCNNPES LYTNAQQAFD WIRWLPPSKT FKVDVSGTSF GLTHSHFTAL QVKNAIIDLQ
RSSCGKRSDI SVQDPDICIH LHLHNNQAVL SLDSSAHSLH RRGFRPAMGV APLKENLAAG
LLRLTNWDFS MPLVDPLCGS GTFLIEGAAL ALGLAPGLHQ KFLFTNWPDF DTSLWEQEKH
LAQVSQLPKQ QLPKIIGCEK NSEIANQAKS NIIESGLGLE IKIQNSHFFD LELPNDKGLI
VCNPPYGKRS GKEEDLETLY NELGSFCKKK ASGWNLWLLN GNPNLSKFLR LKAKRRIPVS
NGGIDCRWLH YEIN