Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03731 |
Symbol | |
ID | 4776376 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 375245 |
End bp | 378061 |
Gene Length | 2817 bp |
Protein Length | 938 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640085876 |
Product | hypothetical protein |
Protein accession | YP_001016390 |
Protein GI | 124022083 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG0493] NADPH-dependent glutamate synthase beta chain and related oxidoreductases [COG1233] Phytoene dehydrogenase and related proteins |
TIGRFAM ID | [TIGR02733] C-3',4' desaturase CrtD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.968761 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCACGTGC TTGACCGCGA TATTTTGCTG CCCTCTTATG ATGTCTTAGT AGTTGGCGCT GGGATTGGCG GACTAACAGC AGCAGCACTT CTTGCTAAGC GTGGTTACAG AGTTTTGGTA GTAGAGCAGC ATTATCTCCC TGGTGGATGC GCGAGTATCT TTCGCCGTCA AGGGTTTACA TTTGATGTGG GAGCTTCTCT CTTTTTTGGG TTTGGAGAAA AAGGCTACAA CCCACATCAA TTTGTCATGA ATGAGCTGGA GGAAGACATT ACTCTTGTGC CGATGGATGA AACCTTCACT ATCCATCTAG ATCAGAAAAC CAAAGTCTCT ATGTATACAC AACGGGAAAG ATTTTGGGAA GAAATGTGTT CTTGTTTCCC ACACCAATCA GAACAAATCA AAGCATTACT AAGGGAATTC GAATCTTTTT ATCATGACTC ATTGGATAGC TATGGTGGTC AGTTTTTTGC GCCAGCGGAA ACACCACCGC AACATGGCAT CAATTTGATG CTTACGCGCC CGTTCTATTT GGCAAGGTTG CTCAATTATC TGCTTTCAAC TCAAGAGCAG CTATTTCGCC GCTTTACGCG AGATCCGCAG ATTCTCAAAC TATTTACGCT TTTAAATCAA AATATGACCA CTTGTGGATT GGATCAGACT CCTGCCATCG CCGGCCCAAT GATCCACGTG GAATCTTACA CTGGTGGTTG CTATTATGCT CAAGGCTCAC CACAGATATT GGCGAATAAA CTAGAAAAAG CCATTCATAA ATATGGTGGG CAAATATTAT ATCGCAATCG CATTGACAAG ATCTTAATTG AGAAAGGCAA GGCGATTGGC TGTCAACTTG AAAATGGTCT TAAGATTAAA GCTCATACAG TAATCTCAAA CACAACGATC TGGAATCTCT ATGGTAAGTT GATCGATCCC CAGCATATCT CCCGCAGAAA AAGAAAGTGG GCAAAGAAAT TTAGGCCAAT TTATAGTGTA TTTGGTGTTT ATCTCGGTGT TAAAGCAGAA GCTGTCCCTC CTGCGATTAA GCCTACACAA ATACTGCCCT ATGACGACAA AGGTTCTTCA AGTTATCTGA CAGTCTATGT CACGTCGATG TTGGATCCTG ATGCTTCTCC GCCAGGGACC CATACACTCT GCATATTTCT ACCGGAATCT ACTCCAGAGA TTACAGTGCC AACAGATGGC AAAGATAAAT ATCATAGCCG TACATATCGA GATCAAAAGC AGAAAAAAGC ATCAGCAATT ATTGATTACT TAGAAAAGAA TTATTTCCCA GAACTAAAAA AACATATCTT GGTTCAGGAA ATAGCTACCC CACAGACTAT TCAGCGCTAT ACCCTTAAAA GTCATGGATC TATTGGTGGA CCACAGGTGA ATATGAGCCA AAGCTATATG AGTCGGCTGG CTGCTCGCAG TGATTGGCAA GGTCTTTATT GTGTAGGGGA TTCCACCAGT CAAGGGATCG GCGTAGTTTC TGTGACTGTG TCCGCAATAA GTGCAGTCAA TGCAATTTTA AAGGATCTTC GTCAACCTCA ATATCTTCCA TTAAAACATT ATCCTCAAAA TTATGTGCAT TTTGCTAAAG CCTCTCTATC TCAAGAACAA CACTCTGCGT ATTTGTTCCC AGACAGTCAG ATTATACAAA CTGTGGAATT AGAGCCCCGA GCTTGTCTCT CTCCGCGCTG CGTTCGTGCT GGTCCAGACA GTACTCATCA AGCCCATATC GCACGTCTTG TCGAAGCCGG TAATTGGCTT GGTGCTGCCC AGTCATTACG AGCCGTTAAT CCACTTTCAG AAATAACCTC GTACCTTTCT CAATCAGACG ATTTTTGTGG TCCTTCATGT TCGCAATTGT TATGTCCTGA CGCTTCTATT CCCATTAAGT CATTAAATCG CTATGTGTGC GAGAAAGTCC CCAATTACAT ACCAGATGTA GCTCCGGATA ATGGTAAGCG AATTTCTATT GTTGGGGCCG GGCCTGCTGG TCTAACATGC GCTCACTATT TGGCGCGTTT AGGCTACCAA ATAGATATCT ACGAGAAGCA AAGCGAATCT GGAGGGATTT TGAAGAGGAT TACTCTTGCA TCAAGGATTC CTCAGTCAGT CTTGCATCGT GAAATATCAA ACCTATTATT GCCTTCTATT AGGATTTATT TTGATCAATC TTTAGGTGAA GATATCACAA TTGCTGAGCT TAGACTTCAA TATGATGCAA TCTTTTTAGC CTGTGGTCTA GGAGAGAAGC AAATTCAGCC TAAAGATATC GATCAAGATT TCAACTTGAT TCATGGTCTG AAGTTTTTAG ACCAATTTAC TCAAGAGCCA GCTATTGTTA GAGAAAAGAT ATTGACAATA GTTGGTGCTA CCTACCTTGC TACGGATATC TCAAAGTTGG CAATTCAAAA TGGTGCAAAG AAGGTTTGTC TTATTGACGA GCTGTCTGAA CTGCAATCGA AATCTCAGGC TAAGCGATTA ACAGAAATGC AAGAACTTGG AATTGAAATC CACTCCAGAA TAGATCCAAT AGCTTTCTCT GAACTCTGTA GTTCTTCTCA TCAGGTGATC ATGGCAGGTT TGGAGCAACA GCGTATTCAG TCTGAATTAA GGGACCATCT AAGTACAAGT CTTGTGGTAG ACATTGATGA CTGGGTTGAC TTAGAGACGC TTCAGGTTCA TGGTCAGGTC AATGTATTCG CCGGTGGAGA TATCATTAGA GGCTCCAGTA GTATGCAAGA ATCAATTAGA GACGGTCGCA AGGCAGCAGT AGAAATCAAT CATGCCTTAA TGAAGAATCA ATCTTGA
|
Protein sequence | MHVLDRDILL PSYDVLVVGA GIGGLTAAAL LAKRGYRVLV VEQHYLPGGC ASIFRRQGFT FDVGASLFFG FGEKGYNPHQ FVMNELEEDI TLVPMDETFT IHLDQKTKVS MYTQRERFWE EMCSCFPHQS EQIKALLREF ESFYHDSLDS YGGQFFAPAE TPPQHGINLM LTRPFYLARL LNYLLSTQEQ LFRRFTRDPQ ILKLFTLLNQ NMTTCGLDQT PAIAGPMIHV ESYTGGCYYA QGSPQILANK LEKAIHKYGG QILYRNRIDK ILIEKGKAIG CQLENGLKIK AHTVISNTTI WNLYGKLIDP QHISRRKRKW AKKFRPIYSV FGVYLGVKAE AVPPAIKPTQ ILPYDDKGSS SYLTVYVTSM LDPDASPPGT HTLCIFLPES TPEITVPTDG KDKYHSRTYR DQKQKKASAI IDYLEKNYFP ELKKHILVQE IATPQTIQRY TLKSHGSIGG PQVNMSQSYM SRLAARSDWQ GLYCVGDSTS QGIGVVSVTV SAISAVNAIL KDLRQPQYLP LKHYPQNYVH FAKASLSQEQ HSAYLFPDSQ IIQTVELEPR ACLSPRCVRA GPDSTHQAHI ARLVEAGNWL GAAQSLRAVN PLSEITSYLS QSDDFCGPSC SQLLCPDASI PIKSLNRYVC EKVPNYIPDV APDNGKRISI VGAGPAGLTC AHYLARLGYQ IDIYEKQSES GGILKRITLA SRIPQSVLHR EISNLLLPSI RIYFDQSLGE DITIAELRLQ YDAIFLACGL GEKQIQPKDI DQDFNLIHGL KFLDQFTQEP AIVREKILTI VGATYLATDI SKLAIQNGAK KVCLIDELSE LQSKSQAKRL TEMQELGIEI HSRIDPIAFS ELCSSSHQVI MAGLEQQRIQ SELRDHLSTS LVVDIDDWVD LETLQVHGQV NVFAGGDIIR GSSSMQESIR DGRKAAVEIN HALMKNQS
|
| |