Gene P9211_16851 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16851 
Symbol 
ID5730034 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1510140 
End bp1511651 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content41% 
IMG OID641286066 
Productphytoene dehydrogenase and related protein 
Protein accessionYP_001551570 
Protein GI159904226 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02733] C-3',4' desaturase CrtD 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTAATG AGCAAGTAAT TGTTGTTGGA GGCGGAATCG CAGGGTTAAC TGCATCTGCT 
TTATTGGCTC ATGAAGGATT TAATGTAATT CTTTTAGAAT CCCATTATCA GACAGGAGGG
TGTGCAGGGA CGTTTAAAAG AGGAAATTAC GTTTTTGATG TAGGTGCAAC TCAAATTGCA
GGATTAGAGA AAGGAGGTAT TCATGAACGG CTATTTAGAT ATCTTGATTG GCCTCAACCT
TTGGGGTCAA TTCTCGATCC AGGTTGTGAA GTAGACCTTA GAGACGGCTC TCGGCCAATA
AAAATCTGGC ATGATCCTGT GCGATGGGAG ATAGAACGAA ATCAACAATT TCCAGGTAGT
AAGCGTTTTT GGTCTCTTTG CAGTTTGATT CACCAAAGCA ATTGGTCTTT TGCAGAAAAA
GACACAGTAT TACCAGTTTC AGACTTTTGG GATTTTAAGC AGTTCATCAG TTCAGTTGAG
CCTTTAACCT TTTTGTTTGG CTTATTAAGT AGGTCATCAG TTTCTGATCT TCTTTGGCTC
TGTTCTTGCC CTCAAGATCA ACGGCTCCAG AAATTTCTTG ATTTACAACT TCAACTTTAT
TCTCAAGAAA AAGCAGATCA AACAGCTGCA CTTTATGGCG CCACTGTGCT TCAGATGGCA
CAGGCACCAC GTGGCTTATG GCATTTGCAG GGTTCTATGA AGAGTTTGAG TGATGCATTG
GAATCTTGCT TGCTTCGTGA TGGAGGTAAA TTACTTCTGA GGCATAAAGT TACTGCGATA
GAGACAATTA GTACGACAGA TTCTTGGATT GTAGATGTTG TCGATGGTAA TGGTTCTACA
TCCCAACTGG TTGCTTCTGA CTTAGTATTC ACTCTTCCTC CTCAATGTCT TTTGGGGTTA
ACATCAGGCA ATTCATCTTT ACCTACTAAC TATCGGGACC GCATAGCTAA ATTACCTCAA
CCTAATGGCG CGATAGTTTT TTATGGAGCG ATTTCGCGTC AGTACCTTCA AGAAATTCAT
TCAAACCATT ATCAATTCTT TGTAGAAGAT CTTGGCTCAT TATTCCTTTC GATTAGTTGT
GAAGGAGATG GTCGTGCACC TCTAGGTGAG GCAACTATCA TTGCAAGTGC TTTTACTTCT
GTAGATTTCT GGGCATCTTT GGATGATCAG ATTTACCGAA AAGAAAAACA AGTGTTTTTG
CAGAAAATTT TAAAAGCCAT CAAATCAGTT TTTAATATTC ATCCTGATGA TTGGTTACAT
AAAGAATTGG CAACTCCAAG AAGTTTTGCA AAATGGACAG GCCGGCCACA AGGAATTGTA
GGTGGATTGG GACAGAGGCC AACTACTTTT GGCCCGTTTG GATTATCTAG TAGATCTCCT
GTGAAGGGTT TATGGCTCTG TGGAGATTCA ATTTATCCGG GGGAAGGCAC TGCTGGAGTT
TCGCAATCTG CATTAATGGC CTGCCGACAG CTAATGGCTA CCAAGGGGCA CCCTTTCAGA
TTACCCAAAT AG
 
Protein sequence
MTNEQVIVVG GGIAGLTASA LLAHEGFNVI LLESHYQTGG CAGTFKRGNY VFDVGATQIA 
GLEKGGIHER LFRYLDWPQP LGSILDPGCE VDLRDGSRPI KIWHDPVRWE IERNQQFPGS
KRFWSLCSLI HQSNWSFAEK DTVLPVSDFW DFKQFISSVE PLTFLFGLLS RSSVSDLLWL
CSCPQDQRLQ KFLDLQLQLY SQEKADQTAA LYGATVLQMA QAPRGLWHLQ GSMKSLSDAL
ESCLLRDGGK LLLRHKVTAI ETISTTDSWI VDVVDGNGST SQLVASDLVF TLPPQCLLGL
TSGNSSLPTN YRDRIAKLPQ PNGAIVFYGA ISRQYLQEIH SNHYQFFVED LGSLFLSISC
EGDGRAPLGE ATIIASAFTS VDFWASLDDQ IYRKEKQVFL QKILKAIKSV FNIHPDDWLH
KELATPRSFA KWTGRPQGIV GGLGQRPTTF GPFGLSSRSP VKGLWLCGDS IYPGEGTAGV
SQSALMACRQ LMATKGHPFR LPK