Gene A9601_17731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_17731 
Symbol 
ID4718506 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1505093 
End bp1506598 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content33% 
IMG OID640079502 
Productphytoene dehydrogenase 
Protein accessionYP_001010163 
Protein GI123969305 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02733] C-3',4' desaturase CrtD 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAAATT CTGAAGTTAT TGTTATTGGC GCCGGTATAG CAGGACTAAC TTCTGCAGCG 
ATTTTATCAA AACAAGGCTT ATCAGTGACC TTAATCGAAT CTCATACTCA AGCCGGAGGA
TGTGCCGGTA CTTTTAAAAG AAAGAATTAT ACTTTCGATG TTGGCGCAAC TCAGGTTGCA
GGTTTAGAGA AGGGAGGAAT ACATTATAGA ATTTTTGATT TTTTAGATAT TCCATCTCCA
GAAGCCACAA TTTTAGACCC TGCTTGCATT GTTGATTTAA ATGATGGTGG TAATCCTATA
CCTATTTGGT ATGAAAAAAG TAAATGGATT GTTGAACGAG AAATGCAGTT TCCTGGGAGT
CAAAGATTTT GGAAACTTTG TTCCCTAATA CATGAAAGTA ATTGGATATT TGCTAATAAC
AATCCTGTAT TACCAATAAG CAATTTTTGG GATTTTTCTC AACTTCTTAA AGCACTAGTT
CCTTCAAACC TTGTCACAGG TATCTTACTT AAATCTACTA TTTTTGATCT ATTGCGGATA
TGTGGATTAT CCAAGAATGA GCGCTTGATT AAATTCTTAA ATCTTCAACT AAAACTTTAT
TCTCAAGAGG ATGTTTATAA TACTGCTGCA TTATATGGAT CTACTGTTCT TCAGATGTGT
CAACAGCCAT ATGGTCTGTG GCATCTTAAA AAATCTATGC AGTCTTTAAG TGAATCATTA
GAAAGTTCAT TGATTAAAAC TGGAGTTAAT TTATTTTTTG GACAAGAAGT AAATTCTATA
ACTTTTGACG ACGTAAATAT GTGTTGGCAA GTATCTGCTA ATTCGAAAAA AAAATCATTT
ATTTACCAAG CAAAAGATGT GATTTATACT GCCCCTCCAC AGTCTTTGCT CAAGCATTTG
AAAGATCCTT TAGAAAGAAA AAAAAATTAT AAAAATCGAC TTAATAATTT GCCTAATCCA
AGTGGAGCTG TAGTTTTTTA TTCAGCCTTA AAAAAGGAAC ATATTAAAAA AACATTCTCC
AATCATTATC AATTTGTTTC GAAAGAATTT TGTTCCTTAT TTGTATCAAT TAGTGATGAT
GGTGATGGAA GAGCGCCAAA AGGTGAAGTT ACTTTAATTG CCAGTATCTT TACCAAAACT
AAAGATTGGG TTGACCTAGA TAAACAAACT TATTTAAAGA AGAAAAATAG TTTCATGAAA
AAAATATCCC TTGAATTGGA AAGTCAATTT GATATTGATC CTGATAAATG GCTACATAGG
GAATTAGCAA CTCCATTGGG CTTTGAAAGA TGGACAAAAA GACCTAATGG AATAGTAGGG
GGGCTTGGTC AAAATCCAGA TATTTTTGGT TTATTTGGAT TATCAAGTAG GACACCTTTT
GAAGGTTTAT GGTTATGTGG AGATTCGATT TATCCAGGAG AGGGGACTGC AGGTGTTAGT
CAGTCTGCAT TAATGGTTTC AAGGCAAATT TTAGCTTCCA AAGGTGTAAA AAATTTTAGT
TTATAA
 
Protein sequence
MRNSEVIVIG AGIAGLTSAA ILSKQGLSVT LIESHTQAGG CAGTFKRKNY TFDVGATQVA 
GLEKGGIHYR IFDFLDIPSP EATILDPACI VDLNDGGNPI PIWYEKSKWI VEREMQFPGS
QRFWKLCSLI HESNWIFANN NPVLPISNFW DFSQLLKALV PSNLVTGILL KSTIFDLLRI
CGLSKNERLI KFLNLQLKLY SQEDVYNTAA LYGSTVLQMC QQPYGLWHLK KSMQSLSESL
ESSLIKTGVN LFFGQEVNSI TFDDVNMCWQ VSANSKKKSF IYQAKDVIYT APPQSLLKHL
KDPLERKKNY KNRLNNLPNP SGAVVFYSAL KKEHIKKTFS NHYQFVSKEF CSLFVSISDD
GDGRAPKGEV TLIASIFTKT KDWVDLDKQT YLKKKNSFMK KISLELESQF DIDPDKWLHR
ELATPLGFER WTKRPNGIVG GLGQNPDIFG LFGLSSRTPF EGLWLCGDSI YPGEGTAGVS
QSALMVSRQI LASKGVKNFS L