Gene P9303_03731 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_03731 
Symbol 
ID4776376 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp375245 
End bp378061 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content42% 
IMG OID640085876 
Producthypothetical protein 
Protein accessionYP_001016390 
Protein GI124022083 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism
[R] General function prediction only 
COG ID[COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases
[COG1233] Phytoene dehydrogenase and related proteins 
TIGRFAM ID[TIGR02733] C-3',4' desaturase CrtD 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.968761 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACGTGC TTGACCGCGA TATTTTGCTG CCCTCTTATG ATGTCTTAGT AGTTGGCGCT 
GGGATTGGCG GACTAACAGC AGCAGCACTT CTTGCTAAGC GTGGTTACAG AGTTTTGGTA
GTAGAGCAGC ATTATCTCCC TGGTGGATGC GCGAGTATCT TTCGCCGTCA AGGGTTTACA
TTTGATGTGG GAGCTTCTCT CTTTTTTGGG TTTGGAGAAA AAGGCTACAA CCCACATCAA
TTTGTCATGA ATGAGCTGGA GGAAGACATT ACTCTTGTGC CGATGGATGA AACCTTCACT
ATCCATCTAG ATCAGAAAAC CAAAGTCTCT ATGTATACAC AACGGGAAAG ATTTTGGGAA
GAAATGTGTT CTTGTTTCCC ACACCAATCA GAACAAATCA AAGCATTACT AAGGGAATTC
GAATCTTTTT ATCATGACTC ATTGGATAGC TATGGTGGTC AGTTTTTTGC GCCAGCGGAA
ACACCACCGC AACATGGCAT CAATTTGATG CTTACGCGCC CGTTCTATTT GGCAAGGTTG
CTCAATTATC TGCTTTCAAC TCAAGAGCAG CTATTTCGCC GCTTTACGCG AGATCCGCAG
ATTCTCAAAC TATTTACGCT TTTAAATCAA AATATGACCA CTTGTGGATT GGATCAGACT
CCTGCCATCG CCGGCCCAAT GATCCACGTG GAATCTTACA CTGGTGGTTG CTATTATGCT
CAAGGCTCAC CACAGATATT GGCGAATAAA CTAGAAAAAG CCATTCATAA ATATGGTGGG
CAAATATTAT ATCGCAATCG CATTGACAAG ATCTTAATTG AGAAAGGCAA GGCGATTGGC
TGTCAACTTG AAAATGGTCT TAAGATTAAA GCTCATACAG TAATCTCAAA CACAACGATC
TGGAATCTCT ATGGTAAGTT GATCGATCCC CAGCATATCT CCCGCAGAAA AAGAAAGTGG
GCAAAGAAAT TTAGGCCAAT TTATAGTGTA TTTGGTGTTT ATCTCGGTGT TAAAGCAGAA
GCTGTCCCTC CTGCGATTAA GCCTACACAA ATACTGCCCT ATGACGACAA AGGTTCTTCA
AGTTATCTGA CAGTCTATGT CACGTCGATG TTGGATCCTG ATGCTTCTCC GCCAGGGACC
CATACACTCT GCATATTTCT ACCGGAATCT ACTCCAGAGA TTACAGTGCC AACAGATGGC
AAAGATAAAT ATCATAGCCG TACATATCGA GATCAAAAGC AGAAAAAAGC ATCAGCAATT
ATTGATTACT TAGAAAAGAA TTATTTCCCA GAACTAAAAA AACATATCTT GGTTCAGGAA
ATAGCTACCC CACAGACTAT TCAGCGCTAT ACCCTTAAAA GTCATGGATC TATTGGTGGA
CCACAGGTGA ATATGAGCCA AAGCTATATG AGTCGGCTGG CTGCTCGCAG TGATTGGCAA
GGTCTTTATT GTGTAGGGGA TTCCACCAGT CAAGGGATCG GCGTAGTTTC TGTGACTGTG
TCCGCAATAA GTGCAGTCAA TGCAATTTTA AAGGATCTTC GTCAACCTCA ATATCTTCCA
TTAAAACATT ATCCTCAAAA TTATGTGCAT TTTGCTAAAG CCTCTCTATC TCAAGAACAA
CACTCTGCGT ATTTGTTCCC AGACAGTCAG ATTATACAAA CTGTGGAATT AGAGCCCCGA
GCTTGTCTCT CTCCGCGCTG CGTTCGTGCT GGTCCAGACA GTACTCATCA AGCCCATATC
GCACGTCTTG TCGAAGCCGG TAATTGGCTT GGTGCTGCCC AGTCATTACG AGCCGTTAAT
CCACTTTCAG AAATAACCTC GTACCTTTCT CAATCAGACG ATTTTTGTGG TCCTTCATGT
TCGCAATTGT TATGTCCTGA CGCTTCTATT CCCATTAAGT CATTAAATCG CTATGTGTGC
GAGAAAGTCC CCAATTACAT ACCAGATGTA GCTCCGGATA ATGGTAAGCG AATTTCTATT
GTTGGGGCCG GGCCTGCTGG TCTAACATGC GCTCACTATT TGGCGCGTTT AGGCTACCAA
ATAGATATCT ACGAGAAGCA AAGCGAATCT GGAGGGATTT TGAAGAGGAT TACTCTTGCA
TCAAGGATTC CTCAGTCAGT CTTGCATCGT GAAATATCAA ACCTATTATT GCCTTCTATT
AGGATTTATT TTGATCAATC TTTAGGTGAA GATATCACAA TTGCTGAGCT TAGACTTCAA
TATGATGCAA TCTTTTTAGC CTGTGGTCTA GGAGAGAAGC AAATTCAGCC TAAAGATATC
GATCAAGATT TCAACTTGAT TCATGGTCTG AAGTTTTTAG ACCAATTTAC TCAAGAGCCA
GCTATTGTTA GAGAAAAGAT ATTGACAATA GTTGGTGCTA CCTACCTTGC TACGGATATC
TCAAAGTTGG CAATTCAAAA TGGTGCAAAG AAGGTTTGTC TTATTGACGA GCTGTCTGAA
CTGCAATCGA AATCTCAGGC TAAGCGATTA ACAGAAATGC AAGAACTTGG AATTGAAATC
CACTCCAGAA TAGATCCAAT AGCTTTCTCT GAACTCTGTA GTTCTTCTCA TCAGGTGATC
ATGGCAGGTT TGGAGCAACA GCGTATTCAG TCTGAATTAA GGGACCATCT AAGTACAAGT
CTTGTGGTAG ACATTGATGA CTGGGTTGAC TTAGAGACGC TTCAGGTTCA TGGTCAGGTC
AATGTATTCG CCGGTGGAGA TATCATTAGA GGCTCCAGTA GTATGCAAGA ATCAATTAGA
GACGGTCGCA AGGCAGCAGT AGAAATCAAT CATGCCTTAA TGAAGAATCA ATCTTGA
 
Protein sequence
MHVLDRDILL PSYDVLVVGA GIGGLTAAAL LAKRGYRVLV VEQHYLPGGC ASIFRRQGFT 
FDVGASLFFG FGEKGYNPHQ FVMNELEEDI TLVPMDETFT IHLDQKTKVS MYTQRERFWE
EMCSCFPHQS EQIKALLREF ESFYHDSLDS YGGQFFAPAE TPPQHGINLM LTRPFYLARL
LNYLLSTQEQ LFRRFTRDPQ ILKLFTLLNQ NMTTCGLDQT PAIAGPMIHV ESYTGGCYYA
QGSPQILANK LEKAIHKYGG QILYRNRIDK ILIEKGKAIG CQLENGLKIK AHTVISNTTI
WNLYGKLIDP QHISRRKRKW AKKFRPIYSV FGVYLGVKAE AVPPAIKPTQ ILPYDDKGSS
SYLTVYVTSM LDPDASPPGT HTLCIFLPES TPEITVPTDG KDKYHSRTYR DQKQKKASAI
IDYLEKNYFP ELKKHILVQE IATPQTIQRY TLKSHGSIGG PQVNMSQSYM SRLAARSDWQ
GLYCVGDSTS QGIGVVSVTV SAISAVNAIL KDLRQPQYLP LKHYPQNYVH FAKASLSQEQ
HSAYLFPDSQ IIQTVELEPR ACLSPRCVRA GPDSTHQAHI ARLVEAGNWL GAAQSLRAVN
PLSEITSYLS QSDDFCGPSC SQLLCPDASI PIKSLNRYVC EKVPNYIPDV APDNGKRISI
VGAGPAGLTC AHYLARLGYQ IDIYEKQSES GGILKRITLA SRIPQSVLHR EISNLLLPSI
RIYFDQSLGE DITIAELRLQ YDAIFLACGL GEKQIQPKDI DQDFNLIHGL KFLDQFTQEP
AIVREKILTI VGATYLATDI SKLAIQNGAK KVCLIDELSE LQSKSQAKRL TEMQELGIEI
HSRIDPIAFS ELCSSSHQVI MAGLEQQRIQ SELRDHLSTS LVVDIDDWVD LETLQVHGQV
NVFAGGDIIR GSSSMQESIR DGRKAAVEIN HALMKNQS