Gene P9211_03591 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03591 
Symbol 
ID5730610 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp336304 
End bp337560 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content36% 
IMG OID641284708 
ProductHD superfamily phosphohydrolase 
Protein accessionYP_001550244 
Protein GI159902900 
COG category[R] General function prediction only 
COG ID[COG1078] HD superfamily phosphohydrolases 
TIGRFAM ID[TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.603218 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.796242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAAATA GAACTTTTTA TGATCCCCTT CACAAGGGGA TAAGACTAGA TAGCAAAGTT 
CCCGAGGAAG GAATGGTAAT TAAATTAATT GATTCTGCAC CCTTCCAAAG GCTCAGAAGA
ATTAAACAAC TTGGTCCTGC CTACTTAACT TTTCATGGGG CAGAGTCAAG CCGATTTACT
CATTCTTTAG GTGTATTTCA TATAGCTCGT AGAGCACTAA AAAAACTAAT TGAATTAAAC
CCAAGCCTTA TAGATTTTAG AGGTTTGTTA TATGGTTCTG CTCTATTGCA TGATATTGGA
CATGGCCCCT TAAGTCATAC TAGTGAAGAA ATGTTTGGAA TGAAGCATGA GAATTGGACT
TCAAAATTAA TACGTGAACA TCCACAAATT AGCAATGCAT TAAATGAATT CAAGTCTGGG
CTAGGGGAAC AAGTTGCAAG CCTAATCGAT GGAAGCGAAA CACCCTGTAA AGTCATAAAG
ACTTTGGTTA GCAGTCAACT AGACTGCGAT AGACTCGATT ACTTAATGCG CGATAGCTAC
AGTAGTGGCG CAGCATATGG TCAACTAGAT TTAGAAAGAA TTTTGTCAGC TCTTACTTTG
TCTCCAGATG GGGATCTTGC GATAAATCCA AAAGGGTTGT TAGCCGTAGA GCATTATTTA
ATTGTCCGCA ATTTAATGTA TAGAAGTATC TATAATCATC GTTTAAATGA AGTTTGCAAT
TGGTTATTAG AGCAAATTAT TCGAATAGCA AAGGAGTTAG GGCCTAAGAG AGTTTGGGCT
GATAACATTA TGGGGAAATG GCTTTGGAGA AATTCAGAAA TAGATCTTTA TGATTTTCTA
GGGAACGATG ATAATCGAAC TTCATATCAT CTTTTACGTT GGAGCGAAAA TTCTCAAGAA
CCTCTAAATA TACTTTGCAG GAATTTTCTA AATAGAAACC TTTTAAAGGC AATAGATATA
GAAGATCTAA AGAAAGAGTC TCAATTAGAA GCCCTTGCTA TAGCAAGAAA ACTTTCTGAG
AAAGCCAGTA AAGACCCTGC TATCTATTGT GGACTCCGTC ATAATAAAAT TTTTGGTTAT
CATCCCTATA AGAGTGGTCT AAGATTATGG GATGGAAAGA ACCTTAAAGC GCTTGAACGA
GAATCATCCC TAGTAGAAAA TCTAATCAGT CCATCAGAGA CAGCTTGGCT AATCCACCCA
AAAGAAATTC ATAATGAACT CAAGCAAGAA CTTACTAAAA TAAGAGATAC TTACTAA
 
Protein sequence
MPNRTFYDPL HKGIRLDSKV PEEGMVIKLI DSAPFQRLRR IKQLGPAYLT FHGAESSRFT 
HSLGVFHIAR RALKKLIELN PSLIDFRGLL YGSALLHDIG HGPLSHTSEE MFGMKHENWT
SKLIREHPQI SNALNEFKSG LGEQVASLID GSETPCKVIK TLVSSQLDCD RLDYLMRDSY
SSGAAYGQLD LERILSALTL SPDGDLAINP KGLLAVEHYL IVRNLMYRSI YNHRLNEVCN
WLLEQIIRIA KELGPKRVWA DNIMGKWLWR NSEIDLYDFL GNDDNRTSYH LLRWSENSQE
PLNILCRNFL NRNLLKAIDI EDLKKESQLE ALAIARKLSE KASKDPAIYC GLRHNKIFGY
HPYKSGLRLW DGKNLKALER ESSLVENLIS PSETAWLIHP KEIHNELKQE LTKIRDTY