Gene P9303_25851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_25851 
Symbol 
ID4776183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2279612 
End bp2282977 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content43% 
IMG OID640088106 
Producthypothetical protein 
Protein accessionYP_001018581 
Protein GI124024274 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATC CGCTGTTAAT GTATGAAATG ATCACCAATC CATTTAACTT TTTTGGGGAT 
TTTGCAGGTG CACACGACGG CAATATAGAT TTCAAAGAAG AATGGAAACA ATCAGACTTT
GACCTCAACG AAGGAAACTC GATATGGAAT AAATATACCA ACAATGACCT GATCAATTTA
CTAGCAGACA TTCAAAGTCT GGGTTCCAAT GAATTCACAC CAATAGCCGG CAATTTTGGA
ATTCAAAAAC ATCTAACAAC TTCAACTTAT CAGATAACAG ACGCGACCGA TGTGATAGAC
AAAAACTCAC CCATCAACAG CAATTATTCC GCCTCGCAGG TGGGGGATCA ACCATCCAGA
GGATACAGAG GGAACCCTGC CGGAATAGCC CAAGAGGGAG TATGGGGAGT GCTCTGTGAC
GATATCTGGA GTCAAATCAT TGCAACTGCA AACTCACTTC AAACAGAGGA AGACAAGCAA
AAGCTGATCA ACACCTATGC ATTCAATAAA GAAGATGGCA AACAATACAA TATCATCATT
CCTGACTTGC GGAAGGGGGT AACATTCGGA AACAAAGATA TTATCACGGA TCAATATGTA
ATTTCAAACC CAATAAGCTT TTCAAACCTA CGATCCAACG TAAATGCGAG GGCAAGAGCT
TTTGATACCG TTAATCATTC CGATCATTTC ATAAAACAAC AACCTGAACC GCCAGAGACC
GAACTCCGCG ATAACTGGAA AAAAGGATTC CTGGCAGGAC ATGGCCACCT CGGCTTACCC
GGGGTAAACC AAGACGGCAA TTTAAGTCAA GGAATAGAAT GGAGAAAAGT TATGTTTCGT
GGAGAGGGAG ACAATATTAT CTCAACACGA TATTTTTATC CGATAGGCAA CGGATTTGAT
AGCGCGATAC ATCAAAAAAA CAATATAAAT ACTGGGGGTG GTTCAGATTA TGTCATTTAC
GATAATTCGC AGCATGAGGT CAAGCTTGGT GATGGTGACG ACTTAGCCTT TCCCTCTATT
AAAGCTTTCG CTCCTTCCAT TGGCTTCGGG CAGCACGCCC AAAGTAAGTT AGATAGGAGT
CCCAAGTACA ACTGGGATCC AGTACGTTAC AAAGACCATT GGAACTGGTT CGATGGGAGT
GTTTTCTATG AATCCGTTGA CCTGGGATGG CCTTTTTACG ACTCAGGAAC AAACCCAAAC
AGAAAAGGGT TGATAAAATA TCAAAATAAT CGCTTATTGC CACAACCAGA TCAAAATAAA
TTGGCGCCAA CGGTTATTGA AAACCCTCTG ACTAGCGATC CAATAAAAAG CATTGATAAT
GGTTGGTCAA AGGAGAAAAA TGTGTGGTAC TACAGCGATA GAGCACAGAT CCATGAAGGT
GTTCAGCCAA GACAGGCGAT CGAGATTGGT GGCCAAAAAG TTTATGGTGG TAAAGGCCAC
GACACATTGC ACGGCTTTGA CCCACTGATA TACGCCTCAG AAGAAGCAAA AGAATATAAT
TACGTGCAAA AAATGAGAGA TAATCCTTGG AAAGACGGAA GGCCATTGCC GCATGTAATT
GAAAATAAAC TCAACTTTTT GGGAGACAAA GATATCGATT TCAATTGGGA TCCCATTCTC
TTATCCGGAG GAGAAGGATC AGACAGAATT AATCTGGGCG ATCTCAAACG CATTAATCTT
GGCAATGGTC AAATCATCGA CAACAACACC GCCGGCACAT TATATTTGGT TTTTGGTGAC
AAAGAAAAAT CTGCTGAATG CACGGAAATC GCGAGAAAGA GTAAGAAGTG GGCTGACAAT
ATGAGTCCTG ATGTCTTTTC TCTCGATGCT TCCTATGATT TCAGAGAAGA AATCATTGTC
GAAGGATTGA ATATTGACAA CCGCATAGCT GGAGATGATC CAAAATCAGA TTGGACGACA
CAGGCTGCGA CAGTTCAAAA ATCAGTGACG GCAGCCGCTT TGACTGCTGC AGCCTATCTT
GAAACGGCAT TTCCGGTTAT TGGAGCAGCT TCGGCAATAG CTGCCGTAGG AATAGATATT
GCCAAACAAT TACAACAACA CGATTCGTCT GCCTCACAAA GTGAAGCCAC GGACTTTTAC
GAACGAGATG AGGTGAAGGA AAAAATTGTT CCCTTGGGTT CTTGGACTAA AGCCGTCACC
ATTCCTGATT TTGATGCAAG CGACAATATA ACAATCAATT TGATTCCAAT TGAGGATCCT
AGCGTTCGTC AGAGTGAACA CAAATGGAGC AATATTAATT TCAGCATGTC ATATGGACAA
GACCAAATGC ATAGAACAAC TACCTATGGA CATACAGTAT ATGTTCAAAC ACCTACTGAC
CCTCAACCGA ATCCCATTGC TTATCTTTCC GGGCTGTCGA ATGATGGTCA AGGTGCGGAT
TATGGCTGGA AGACCTGGGA TTTCATGAGT GGGAACCAGA GTATTCTCGA CCCTACAAAA
GACATGGCTT GGTTCGGTGT TTTATCCAAT ACCGAAAACA CCCAAAACAT GAAATTTGAC
TCATACGAGG AAGCGGCTTG GAACAACCTG ACAATCGAAA AAGACTCCCC GTATTCAGAC
ATATTCCTAT GGGGTAGTGT TTCTCTCGGG CTTGCTGAGA AAGATGGCGA TAAAGGCTGG
TTGGACAACT ACAGATCAGG CTCGTCCTCC GTGAGATTGA TGTATGATAA TTTCAAACAA
GGCTGGTACT GGGATACACG ATTTTACGGG GAAGGGGAGA GCAAAGGTGA TGTAAAAGTC
ATAGATCCAA AATCTTCATG TCTGCACTAT TACAACAAAG CCAATAAAGC TTGGGATAAA
ATTTCTTATC AAGACCTTCT AGACAATCCC ACTACTAAAG ATGAAAATGG AATGGAATAC
CAACAGATCG CGAAAAGAGC CCAATTTGAA TACTGGGCAG TAGATGAAGA TCACATTGTT
GGAGGACCAG ATGATGACTG GTTAACAGGA GGTGATGGAG GCGACTATAT GCATGCAGGT
CATGGTCGGG ATACGCTTCT GGGTGGTGAT GGTGATGATG TTTTGATTGG TGGCGAAGGC
CGTGATCTCC TTAAAGGAGG GCAAGGCTCA GATGTTTTTA TGTATAAAGA TGCATCTCAC
TCCGGCTATG GGATAAAACG TGATGTGATT GGAGACTTTA GATCACATCA AAAGGACAAA
ATAGATTTAT CTGGAATCCA AGCTGGCCTG ATCTTTATTG GTTCAGACGG CTTTAGTGGC
CAGGCAGGCC AAGTGCGATT TGAGAATGGT CTGCTTCAGG TCAAGATAGA TCGAGGTTGG
CGAGCAGAAT TTGAGATCCA GTTGCTTGGT GTTGATAGCC TTGATCTTGA TGATCTAATC
TTGTAG
 
Protein sequence
MNNPLLMYEM ITNPFNFFGD FAGAHDGNID FKEEWKQSDF DLNEGNSIWN KYTNNDLINL 
LADIQSLGSN EFTPIAGNFG IQKHLTTSTY QITDATDVID KNSPINSNYS ASQVGDQPSR
GYRGNPAGIA QEGVWGVLCD DIWSQIIATA NSLQTEEDKQ KLINTYAFNK EDGKQYNIII
PDLRKGVTFG NKDIITDQYV ISNPISFSNL RSNVNARARA FDTVNHSDHF IKQQPEPPET
ELRDNWKKGF LAGHGHLGLP GVNQDGNLSQ GIEWRKVMFR GEGDNIISTR YFYPIGNGFD
SAIHQKNNIN TGGGSDYVIY DNSQHEVKLG DGDDLAFPSI KAFAPSIGFG QHAQSKLDRS
PKYNWDPVRY KDHWNWFDGS VFYESVDLGW PFYDSGTNPN RKGLIKYQNN RLLPQPDQNK
LAPTVIENPL TSDPIKSIDN GWSKEKNVWY YSDRAQIHEG VQPRQAIEIG GQKVYGGKGH
DTLHGFDPLI YASEEAKEYN YVQKMRDNPW KDGRPLPHVI ENKLNFLGDK DIDFNWDPIL
LSGGEGSDRI NLGDLKRINL GNGQIIDNNT AGTLYLVFGD KEKSAECTEI ARKSKKWADN
MSPDVFSLDA SYDFREEIIV EGLNIDNRIA GDDPKSDWTT QAATVQKSVT AAALTAAAYL
ETAFPVIGAA SAIAAVGIDI AKQLQQHDSS ASQSEATDFY ERDEVKEKIV PLGSWTKAVT
IPDFDASDNI TINLIPIEDP SVRQSEHKWS NINFSMSYGQ DQMHRTTTYG HTVYVQTPTD
PQPNPIAYLS GLSNDGQGAD YGWKTWDFMS GNQSILDPTK DMAWFGVLSN TENTQNMKFD
SYEEAAWNNL TIEKDSPYSD IFLWGSVSLG LAEKDGDKGW LDNYRSGSSS VRLMYDNFKQ
GWYWDTRFYG EGESKGDVKV IDPKSSCLHY YNKANKAWDK ISYQDLLDNP TTKDENGMEY
QQIAKRAQFE YWAVDEDHIV GGPDDDWLTG GDGGDYMHAG HGRDTLLGGD GDDVLIGGEG
RDLLKGGQGS DVFMYKDASH SGYGIKRDVI GDFRSHQKDK IDLSGIQAGL IFIGSDGFSG
QAGQVRFENG LLQVKIDRGW RAEFEIQLLG VDSLDLDDLI L