Gene P9303_22931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_22931 
Symbol 
ID4776272 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2024016 
End bp2025527 
Gene Length1512 bp 
Protein Length503 aa 
Translation table11 
GC content56% 
IMG OID640087812 
Productphytoene dehydrogenase 
Protein accessionYP_001018293 
Protein GI124023986 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02733] C-3',4' desaturase CrtD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGAGG ACTCAATCAT CGTGATCGGG GGCGGCATTG CCGGCCTAAC GGCAACAGCC 
CTCTTAGCCA AGCAAGGATT ACCCGTCACC CTGGTGGAAG CGCACCATCA ACCTGGAGGT
TGTGCCGGCA CCTTCAGGCG AGGTCCCTAT ATCTTCGATG TAGGGGCCAC TCAGGTTGCC
GGCCTAGAAC CCAGTGGCAG CCATGAACGC ATCTATAGGC ATCTCAACCT GACCATGCCA
CAAGCGGAGC TACTCAATCC CGCTTGTGCA GTTGATCTCG CCGACGGATC CCAACCCATT
CAGCTCTGGC ATGATCCAAA GCGCTGGGAA CAAGAATGGA GATATCACTT CCCTGGCAGT
GAGGCCTTCT GGGCTCTGAC TGCCAAGCTC CACCAAAGCA ACTGGGGCTT TGCCAGCCAT
GATCCGGTGG TCACACCCCG CAACGCTTGG GATTTAGGCC AACTACTTCA GGCGCTGCGT
CCAGCAACAA TCACCTCAGG GCTATTGAGC AGCCTGACAG TGGCAGATCT CTTAAAGCTA
TGCGGCTGTG CGCAGGATCA GCGCCTTCGC AAATTCCTAG ACCTCCAGAT CAAGCTCTAT
TCGCAAGAAC CGCTCGATCG CACAGCCGCC CTCTATGGAG CCACCGTTTT GCAAATGGCT
CAAGCACCTC TTGGGCTCTG GCATCTGCAG GGATCGATGC AAAAACTCAG CGATGACCTA
ATGCATGCCC TGAAGCGGGA TGGCGGCCAG GTATTGCTGC GCCATCGTGT GGTGGGCCTA
GAAGTAGCTG CGCGGGGCAT GGCTTGGACA GTCAGCATCA AAACCCCCAA TGGCAAGGCT
TTAGAACTGA CCACACCAGA TGTGGTATGC AGTCTGCCTC CTCAATGCCT GAGATCGCTG
ATGCCACAAG GAGCAGGCAT GCCCAAGGCC TATCGCCAAA GGCTAATGAA ATTACCTCAA
CCCAGTGGAG CTTTGGTGTT CTACGGGGCC ATCGATCGCA GCGCCCTACC TAAGAACTGT
CCTGTCCATC TGCAAAGAGC CTCCAAGACA CCCGGCTCCC TGTTCGTATC AATCAGCCGT
GAGGGAGATG GCCGTGCTCC CGCTGGACAA GCCACAGTGA TTGCAAGTGT GTTCACCGCC
ACAGCGGAGT GGTGTGGCCT AACTGAATCC ACCTATCAAG AACGCAAGCA AACGGCCCTG
TTAGCCATCC GCCAAACACT TGAAAGTTGG TTAGATCTCG CACCTGAAGA TTGGCAGCAT
CAAGAACTAG CCACACCCAG GAGCTTTGCA CATTGGACTG GTCGGCCCAA TGGAATTGTG
GGTGGCCTTG GGCAGCACCC TTCACAATTT GGGCCCTTCG GACTAGCAAG CCGAACTCCC
ATGCGAGGGC TCTGGCTCTG TGGCGACTCA ATTCACCCTG GTGAAGGTAC TGCAGGGGTG
AGCATCTCGG CACTCATGAC CTGTCGCCAG CTCATGGCTC AGCGAGGTCA TCAGTTACGG
GTGGCGAATT GA
 
Protein sequence
MAEDSIIVIG GGIAGLTATA LLAKQGLPVT LVEAHHQPGG CAGTFRRGPY IFDVGATQVA 
GLEPSGSHER IYRHLNLTMP QAELLNPACA VDLADGSQPI QLWHDPKRWE QEWRYHFPGS
EAFWALTAKL HQSNWGFASH DPVVTPRNAW DLGQLLQALR PATITSGLLS SLTVADLLKL
CGCAQDQRLR KFLDLQIKLY SQEPLDRTAA LYGATVLQMA QAPLGLWHLQ GSMQKLSDDL
MHALKRDGGQ VLLRHRVVGL EVAARGMAWT VSIKTPNGKA LELTTPDVVC SLPPQCLRSL
MPQGAGMPKA YRQRLMKLPQ PSGALVFYGA IDRSALPKNC PVHLQRASKT PGSLFVSISR
EGDGRAPAGQ ATVIASVFTA TAEWCGLTES TYQERKQTAL LAIRQTLESW LDLAPEDWQH
QELATPRSFA HWTGRPNGIV GGLGQHPSQF GPFGLASRTP MRGLWLCGDS IHPGEGTAGV
SISALMTCRQ LMAQRGHQLR VAN