Gene P9303_19061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_19061 
Symbol 
ID4775930 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp1667500 
End bp1669713 
Gene Length2214 bp 
Protein Length737 aa 
Translation table11 
GC content37% 
IMG OID640087415 
Producthypothetical protein 
Protein accessionYP_001017913 
Protein GI124023606 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.227991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATTCAT ACGATGTCAT CATAGTTGGA GGTGGATGTG CTGGATTATC CGCTTTAAGC 
ACTTTAGCTA ATTATCGGAC TTTATTAATA GAGAAAAACT CATCTTTAGG AGGTCGTGTT
AACTCAATAA ACCTAGAAGA TGCCTCTGCT GAGTTAGGAG CATTATATTC CTTGGAACCG
ATGCTGCAAT TTAATTCTAT ATCACAAACA GCATGCTTGG ATAACATTAA TGGCAACGCT
CAACACTCAT CTATCTTATT TATAATATCT CACTCTCTTC AGTTCGAGTC ATTTTCACCC
ATCGAAGCCT TTGACAGGCT TGCCATCGCA AATCAATCTC AACAAAGTAC TAGCTTTTAT
AAGCGAGTCA TAAAAACAAA CTCTAGATCA GAACCCACAA TCACTAACCC CAACTTTGGC
AGGCTTGATT TACTTTCACC TGACCAGTTA AGCCTTGTTG CTTCACTTTT TCAAGTTACT
CACCCAGGAT CAATTTATGA CTGTATTCCA TCATTAAGAC CATTATGCTT AAATAATTTC
CCTTCTTTGA CTCGAACGCA AACCAACCAA TCAGTTTTGA TGAATCTTTT TAGAGTACCT
AGCAACGCTC AGGTAGCTTT GTCAAGTAAT ATAATAAGTT TAACCGAACT CAACTCATCA
GTTGAGGTCA AGTTTGAAAA CAATGATCAA TGCATTTCTG CCCAAGCAAA ATCGGTGATT
GTTACTCCAC CACCGGCCTC TATTTTTCCG TTTATAAAAT CTATCAGACC TCAAAGCCTT
AATTTTTACC ATTCCGCGCC ATATATCAGC GGATATTCTT TCGTCTTTCT TTTTATTGGA
CCTATTCCAG CGCAATCAGC ATTTGTGTCT TCTATCCATA TATGGAGTGT TTGTTTTTTT
ACACGAGTTG ACGATTCTAG ATTTATTGTT AATTGCTATG TCCCATCAAA CCGTTTTGGT
CAGTTTTCAG CAAAACCTTC CGAGCATGAA TTGGCCTCCT CTCTTAAACC CTATCTTTCA
GATTCTTGCG TACTGCTGCA ATCTCGCTCT AAGTTTTGGA GATATCTCGC TCCAGTTCTA
TCAGATCTTT TAGTTGATAG ATACTTTCAA GAACATTATG TACTCTCAAA TAGGATTTTT
TATGGAGGCG AACTTTCAAC CTTTACTCCT GATAACTTAA ACGCTTACGG AACCCATAAT
GCCATCAAAG CTGGCGAAAC TGTTGCTAAC TTTACCATTG ATACTATTTC CTCCTCTCAC
CGTTTTATCA ATTTTAGGCC ATTACCTCCC TTGTTAAATG CTCACGTTTT TACCTTCAAT
CAACAAATAC CTAAATATTT AGGCTCACAG CCAGAAGGTA ACGTTGCTTT TTATGGCTTA
CTATTGTCTG CTTTTAAAGA ACAATCTATT AAGTCATATC TTCTTAAAGC TTCTGTTGAA
GGATTATGGG AATATAATCA TGGATTTGGC GTTACTTTAG AGGATTCACT TCTTGTTCTC
GAGGGGCTGA TTGATAACGG TTTGCCGAAG CATCACATTA AACGTATTCT CAACCTCTGC
ATTGACAAAT TTTATGACAC AACAACTGGT TTATTTGTAA CGGTAAAAAA AGGTCGCTCT
AGTTACTGGC AAGGACCCTC TATTCATGGA ACTGCCCAGA TATCGTATCT AATTCTAAAA
CACTTCAACT TGTCCGATTC GTTACTAGAT AGCCAAAAGA TCTTGGATTT TTTGTCAGCC
AGTGTAACCT CAGACTCTCT TTGGAAAAGT CGTTGGTTTA CTAATTCTTA TTTTACCAGT
TTTTATGTCG TTAGGCTTTT AATGTTTCTG CCAGATGATC ACACGACTTA TTTATTATTG
TCACGTTACT ATGAACGGAT TCTTCAATCA CAAGTTTCAA ATGGCTCATG GGAGGACTCC
CCAATATCTA CCTCTTCTGT AATTCTAACT CTTGCTTTAT ATCACAACTC CTTGCCTATT
ACCTTATCCA ATGAAATAAA GAATTCTCTG ATAAAAGCAA TTAATTACCT AAGGCAATTT
CAGGGGCAAG AATGTTTTCC ATCCGAACCT CTACTTTATT ATTGGTATGA TATTTCTGAA
ATTGATGATC AATTTACTAA AAGATTTTAC CATTGTATGG ACAAAGGCAG GATCTCAGCT
GCTTTAGCAT CTTTAGCGCT CCAATCCGCC TCTAGCTATA TAGTAGATAA GTGA
 
Protein sequence
MNSYDVIIVG GGCAGLSALS TLANYRTLLI EKNSSLGGRV NSINLEDASA ELGALYSLEP 
MLQFNSISQT ACLDNINGNA QHSSILFIIS HSLQFESFSP IEAFDRLAIA NQSQQSTSFY
KRVIKTNSRS EPTITNPNFG RLDLLSPDQL SLVASLFQVT HPGSIYDCIP SLRPLCLNNF
PSLTRTQTNQ SVLMNLFRVP SNAQVALSSN IISLTELNSS VEVKFENNDQ CISAQAKSVI
VTPPPASIFP FIKSIRPQSL NFYHSAPYIS GYSFVFLFIG PIPAQSAFVS SIHIWSVCFF
TRVDDSRFIV NCYVPSNRFG QFSAKPSEHE LASSLKPYLS DSCVLLQSRS KFWRYLAPVL
SDLLVDRYFQ EHYVLSNRIF YGGELSTFTP DNLNAYGTHN AIKAGETVAN FTIDTISSSH
RFINFRPLPP LLNAHVFTFN QQIPKYLGSQ PEGNVAFYGL LLSAFKEQSI KSYLLKASVE
GLWEYNHGFG VTLEDSLLVL EGLIDNGLPK HHIKRILNLC IDKFYDTTTG LFVTVKKGRS
SYWQGPSIHG TAQISYLILK HFNLSDSLLD SQKILDFLSA SVTSDSLWKS RWFTNSYFTS
FYVVRLLMFL PDDHTTYLLL SRYYERILQS QVSNGSWEDS PISTSSVILT LALYHNSLPI
TLSNEIKNSL IKAINYLRQF QGQECFPSEP LLYYWYDISE IDDQFTKRFY HCMDKGRISA
ALASLALQSA SSYIVDK