Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25851 |
Symbol | |
ID | 4776183 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2279612 |
End bp | 2282977 |
Gene Length | 3366 bp |
Protein Length | 1121 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640088106 |
Product | hypothetical protein |
Protein accession | YP_001018581 |
Protein GI | 124024274 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATC CGCTGTTAAT GTATGAAATG ATCACCAATC CATTTAACTT TTTTGGGGAT TTTGCAGGTG CACACGACGG CAATATAGAT TTCAAAGAAG AATGGAAACA ATCAGACTTT GACCTCAACG AAGGAAACTC GATATGGAAT AAATATACCA ACAATGACCT GATCAATTTA CTAGCAGACA TTCAAAGTCT GGGTTCCAAT GAATTCACAC CAATAGCCGG CAATTTTGGA ATTCAAAAAC ATCTAACAAC TTCAACTTAT CAGATAACAG ACGCGACCGA TGTGATAGAC AAAAACTCAC CCATCAACAG CAATTATTCC GCCTCGCAGG TGGGGGATCA ACCATCCAGA GGATACAGAG GGAACCCTGC CGGAATAGCC CAAGAGGGAG TATGGGGAGT GCTCTGTGAC GATATCTGGA GTCAAATCAT TGCAACTGCA AACTCACTTC AAACAGAGGA AGACAAGCAA AAGCTGATCA ACACCTATGC ATTCAATAAA GAAGATGGCA AACAATACAA TATCATCATT CCTGACTTGC GGAAGGGGGT AACATTCGGA AACAAAGATA TTATCACGGA TCAATATGTA ATTTCAAACC CAATAAGCTT TTCAAACCTA CGATCCAACG TAAATGCGAG GGCAAGAGCT TTTGATACCG TTAATCATTC CGATCATTTC ATAAAACAAC AACCTGAACC GCCAGAGACC GAACTCCGCG ATAACTGGAA AAAAGGATTC CTGGCAGGAC ATGGCCACCT CGGCTTACCC GGGGTAAACC AAGACGGCAA TTTAAGTCAA GGAATAGAAT GGAGAAAAGT TATGTTTCGT GGAGAGGGAG ACAATATTAT CTCAACACGA TATTTTTATC CGATAGGCAA CGGATTTGAT AGCGCGATAC ATCAAAAAAA CAATATAAAT ACTGGGGGTG GTTCAGATTA TGTCATTTAC GATAATTCGC AGCATGAGGT CAAGCTTGGT GATGGTGACG ACTTAGCCTT TCCCTCTATT AAAGCTTTCG CTCCTTCCAT TGGCTTCGGG CAGCACGCCC AAAGTAAGTT AGATAGGAGT CCCAAGTACA ACTGGGATCC AGTACGTTAC AAAGACCATT GGAACTGGTT CGATGGGAGT GTTTTCTATG AATCCGTTGA CCTGGGATGG CCTTTTTACG ACTCAGGAAC AAACCCAAAC AGAAAAGGGT TGATAAAATA TCAAAATAAT CGCTTATTGC CACAACCAGA TCAAAATAAA TTGGCGCCAA CGGTTATTGA AAACCCTCTG ACTAGCGATC CAATAAAAAG CATTGATAAT GGTTGGTCAA AGGAGAAAAA TGTGTGGTAC TACAGCGATA GAGCACAGAT CCATGAAGGT GTTCAGCCAA GACAGGCGAT CGAGATTGGT GGCCAAAAAG TTTATGGTGG TAAAGGCCAC GACACATTGC ACGGCTTTGA CCCACTGATA TACGCCTCAG AAGAAGCAAA AGAATATAAT TACGTGCAAA AAATGAGAGA TAATCCTTGG AAAGACGGAA GGCCATTGCC GCATGTAATT GAAAATAAAC TCAACTTTTT GGGAGACAAA GATATCGATT TCAATTGGGA TCCCATTCTC TTATCCGGAG GAGAAGGATC AGACAGAATT AATCTGGGCG ATCTCAAACG CATTAATCTT GGCAATGGTC AAATCATCGA CAACAACACC GCCGGCACAT TATATTTGGT TTTTGGTGAC AAAGAAAAAT CTGCTGAATG CACGGAAATC GCGAGAAAGA GTAAGAAGTG GGCTGACAAT ATGAGTCCTG ATGTCTTTTC TCTCGATGCT TCCTATGATT TCAGAGAAGA AATCATTGTC GAAGGATTGA ATATTGACAA CCGCATAGCT GGAGATGATC CAAAATCAGA TTGGACGACA CAGGCTGCGA CAGTTCAAAA ATCAGTGACG GCAGCCGCTT TGACTGCTGC AGCCTATCTT GAAACGGCAT TTCCGGTTAT TGGAGCAGCT TCGGCAATAG CTGCCGTAGG AATAGATATT GCCAAACAAT TACAACAACA CGATTCGTCT GCCTCACAAA GTGAAGCCAC GGACTTTTAC GAACGAGATG AGGTGAAGGA AAAAATTGTT CCCTTGGGTT CTTGGACTAA AGCCGTCACC ATTCCTGATT TTGATGCAAG CGACAATATA ACAATCAATT TGATTCCAAT TGAGGATCCT AGCGTTCGTC AGAGTGAACA CAAATGGAGC AATATTAATT TCAGCATGTC ATATGGACAA GACCAAATGC ATAGAACAAC TACCTATGGA CATACAGTAT ATGTTCAAAC ACCTACTGAC CCTCAACCGA ATCCCATTGC TTATCTTTCC GGGCTGTCGA ATGATGGTCA AGGTGCGGAT TATGGCTGGA AGACCTGGGA TTTCATGAGT GGGAACCAGA GTATTCTCGA CCCTACAAAA GACATGGCTT GGTTCGGTGT TTTATCCAAT ACCGAAAACA CCCAAAACAT GAAATTTGAC TCATACGAGG AAGCGGCTTG GAACAACCTG ACAATCGAAA AAGACTCCCC GTATTCAGAC ATATTCCTAT GGGGTAGTGT TTCTCTCGGG CTTGCTGAGA AAGATGGCGA TAAAGGCTGG TTGGACAACT ACAGATCAGG CTCGTCCTCC GTGAGATTGA TGTATGATAA TTTCAAACAA GGCTGGTACT GGGATACACG ATTTTACGGG GAAGGGGAGA GCAAAGGTGA TGTAAAAGTC ATAGATCCAA AATCTTCATG TCTGCACTAT TACAACAAAG CCAATAAAGC TTGGGATAAA ATTTCTTATC AAGACCTTCT AGACAATCCC ACTACTAAAG ATGAAAATGG AATGGAATAC CAACAGATCG CGAAAAGAGC CCAATTTGAA TACTGGGCAG TAGATGAAGA TCACATTGTT GGAGGACCAG ATGATGACTG GTTAACAGGA GGTGATGGAG GCGACTATAT GCATGCAGGT CATGGTCGGG ATACGCTTCT GGGTGGTGAT GGTGATGATG TTTTGATTGG TGGCGAAGGC CGTGATCTCC TTAAAGGAGG GCAAGGCTCA GATGTTTTTA TGTATAAAGA TGCATCTCAC TCCGGCTATG GGATAAAACG TGATGTGATT GGAGACTTTA GATCACATCA AAAGGACAAA ATAGATTTAT CTGGAATCCA AGCTGGCCTG ATCTTTATTG GTTCAGACGG CTTTAGTGGC CAGGCAGGCC AAGTGCGATT TGAGAATGGT CTGCTTCAGG TCAAGATAGA TCGAGGTTGG CGAGCAGAAT TTGAGATCCA GTTGCTTGGT GTTGATAGCC TTGATCTTGA TGATCTAATC TTGTAG
|
Protein sequence | MNNPLLMYEM ITNPFNFFGD FAGAHDGNID FKEEWKQSDF DLNEGNSIWN KYTNNDLINL LADIQSLGSN EFTPIAGNFG IQKHLTTSTY QITDATDVID KNSPINSNYS ASQVGDQPSR GYRGNPAGIA QEGVWGVLCD DIWSQIIATA NSLQTEEDKQ KLINTYAFNK EDGKQYNIII PDLRKGVTFG NKDIITDQYV ISNPISFSNL RSNVNARARA FDTVNHSDHF IKQQPEPPET ELRDNWKKGF LAGHGHLGLP GVNQDGNLSQ GIEWRKVMFR GEGDNIISTR YFYPIGNGFD SAIHQKNNIN TGGGSDYVIY DNSQHEVKLG DGDDLAFPSI KAFAPSIGFG QHAQSKLDRS PKYNWDPVRY KDHWNWFDGS VFYESVDLGW PFYDSGTNPN RKGLIKYQNN RLLPQPDQNK LAPTVIENPL TSDPIKSIDN GWSKEKNVWY YSDRAQIHEG VQPRQAIEIG GQKVYGGKGH DTLHGFDPLI YASEEAKEYN YVQKMRDNPW KDGRPLPHVI ENKLNFLGDK DIDFNWDPIL LSGGEGSDRI NLGDLKRINL GNGQIIDNNT AGTLYLVFGD KEKSAECTEI ARKSKKWADN MSPDVFSLDA SYDFREEIIV EGLNIDNRIA GDDPKSDWTT QAATVQKSVT AAALTAAAYL ETAFPVIGAA SAIAAVGIDI AKQLQQHDSS ASQSEATDFY ERDEVKEKIV PLGSWTKAVT IPDFDASDNI TINLIPIEDP SVRQSEHKWS NINFSMSYGQ DQMHRTTTYG HTVYVQTPTD PQPNPIAYLS GLSNDGQGAD YGWKTWDFMS GNQSILDPTK DMAWFGVLSN TENTQNMKFD SYEEAAWNNL TIEKDSPYSD IFLWGSVSLG LAEKDGDKGW LDNYRSGSSS VRLMYDNFKQ GWYWDTRFYG EGESKGDVKV IDPKSSCLHY YNKANKAWDK ISYQDLLDNP TTKDENGMEY QQIAKRAQFE YWAVDEDHIV GGPDDDWLTG GDGGDYMHAG HGRDTLLGGD GDDVLIGGEG RDLLKGGQGS DVFMYKDASH SGYGIKRDVI GDFRSHQKDK IDLSGIQAGL IFIGSDGFSG QAGQVRFENG LLQVKIDRGW RAEFEIQLLG VDSLDLDDLI L
|
| |