Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03591 |
Symbol | |
ID | 5730610 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 336304 |
End bp | 337560 |
Gene Length | 1257 bp |
Protein Length | 418 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 641284708 |
Product | HD superfamily phosphohydrolase |
Protein accession | YP_001550244 |
Protein GI | 159902900 |
COG category | [R] General function prediction only |
COG ID | [COG1078] HD superfamily phosphohydrolases |
TIGRFAM ID | [TIGR01353] deoxyguanosinetriphosphate triphosphohydrolase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.603218 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.796242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAAATA GAACTTTTTA TGATCCCCTT CACAAGGGGA TAAGACTAGA TAGCAAAGTT CCCGAGGAAG GAATGGTAAT TAAATTAATT GATTCTGCAC CCTTCCAAAG GCTCAGAAGA ATTAAACAAC TTGGTCCTGC CTACTTAACT TTTCATGGGG CAGAGTCAAG CCGATTTACT CATTCTTTAG GTGTATTTCA TATAGCTCGT AGAGCACTAA AAAAACTAAT TGAATTAAAC CCAAGCCTTA TAGATTTTAG AGGTTTGTTA TATGGTTCTG CTCTATTGCA TGATATTGGA CATGGCCCCT TAAGTCATAC TAGTGAAGAA ATGTTTGGAA TGAAGCATGA GAATTGGACT TCAAAATTAA TACGTGAACA TCCACAAATT AGCAATGCAT TAAATGAATT CAAGTCTGGG CTAGGGGAAC AAGTTGCAAG CCTAATCGAT GGAAGCGAAA CACCCTGTAA AGTCATAAAG ACTTTGGTTA GCAGTCAACT AGACTGCGAT AGACTCGATT ACTTAATGCG CGATAGCTAC AGTAGTGGCG CAGCATATGG TCAACTAGAT TTAGAAAGAA TTTTGTCAGC TCTTACTTTG TCTCCAGATG GGGATCTTGC GATAAATCCA AAAGGGTTGT TAGCCGTAGA GCATTATTTA ATTGTCCGCA ATTTAATGTA TAGAAGTATC TATAATCATC GTTTAAATGA AGTTTGCAAT TGGTTATTAG AGCAAATTAT TCGAATAGCA AAGGAGTTAG GGCCTAAGAG AGTTTGGGCT GATAACATTA TGGGGAAATG GCTTTGGAGA AATTCAGAAA TAGATCTTTA TGATTTTCTA GGGAACGATG ATAATCGAAC TTCATATCAT CTTTTACGTT GGAGCGAAAA TTCTCAAGAA CCTCTAAATA TACTTTGCAG GAATTTTCTA AATAGAAACC TTTTAAAGGC AATAGATATA GAAGATCTAA AGAAAGAGTC TCAATTAGAA GCCCTTGCTA TAGCAAGAAA ACTTTCTGAG AAAGCCAGTA AAGACCCTGC TATCTATTGT GGACTCCGTC ATAATAAAAT TTTTGGTTAT CATCCCTATA AGAGTGGTCT AAGATTATGG GATGGAAAGA ACCTTAAAGC GCTTGAACGA GAATCATCCC TAGTAGAAAA TCTAATCAGT CCATCAGAGA CAGCTTGGCT AATCCACCCA AAAGAAATTC ATAATGAACT CAAGCAAGAA CTTACTAAAA TAAGAGATAC TTACTAA
|
Protein sequence | MPNRTFYDPL HKGIRLDSKV PEEGMVIKLI DSAPFQRLRR IKQLGPAYLT FHGAESSRFT HSLGVFHIAR RALKKLIELN PSLIDFRGLL YGSALLHDIG HGPLSHTSEE MFGMKHENWT SKLIREHPQI SNALNEFKSG LGEQVASLID GSETPCKVIK TLVSSQLDCD RLDYLMRDSY SSGAAYGQLD LERILSALTL SPDGDLAINP KGLLAVEHYL IVRNLMYRSI YNHRLNEVCN WLLEQIIRIA KELGPKRVWA DNIMGKWLWR NSEIDLYDFL GNDDNRTSYH LLRWSENSQE PLNILCRNFL NRNLLKAIDI EDLKKESQLE ALAIARKLSE KASKDPAIYC GLRHNKIFGY HPYKSGLRLW DGKNLKALER ESSLVENLIS PSETAWLIHP KEIHNELKQE LTKIRDTY
|
| |