Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24081 |
Symbol | |
ID | 4775935 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2118119 |
End bp | 2120413 |
Gene Length | 2295 bp |
Protein Length | 764 aa |
Translation table | 11 |
GC content | 43% |
IMG OID | 640087929 |
Product | hypothetical protein |
Protein accession | YP_001018406 |
Protein GI | 124024099 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAATCAAG AGGAGATTAT GCAGCAGCTG CAGGCAGCGG TTGCATTGCA TAACCAGGGT GAGCTTGATC AGGCAGAAGC GATTTATAGG CAAGTGCTTG CTGTTGATGA AAATAATTTT TATGCACTTA ATTTCTGCGG ATGTATTCAG CGCGAAAAGA AGAGATTCGA TGACGCGATT ACCTTGCTGA GCAGTGCAGT CTCCGCTCAG CCAGGTAATC CAGATGCTAA CTACAATCTT GGAAATGTCT TTAAGGACGC TGAGCGATGG GATGAAGCTA TCTCTTGCTA CGAGAAGACG CTTGACTTAA AAGCAGAGTA TCCAGAAGCA CTGAATAACC TGGGAATTTG TTTAAAGGAG ACTGAGCAAT ATGAGCATTC AGAGATTGTC CTGAAGCGTG CTATTTCGAG GCAGCCTAGG TTTGCAGCTG CCTGGCTCAA CCTAGGTAAT ACGCTCAAGG AGCAGAAAAA GTATTCAGAA GCGATTGTGA GTTATCGGAA CGCGATCGAG GTGAAGCCTG ATTTTGCGGA GGCTTATCTA AATTTAGGGA ATGTGTTGAA GGAGGAGGGA GCTGTAGAGG AAGCGATCGC AAGTTATCGG AAGGCAATTG AAGTTAAGCC TGATTGTGCT GGTGCGTATT TTTCTCTTGG TTTCGTGTTG AAGGGAGAGG GAGAAGTTGA GGAAGCGATT GTGAGTTATC GGAACGCGAT CGAGGTGAAG CCTGATTTAG CGGAGGCTTA TCTAAATTTA GGGTATGTGT TGAAGGAGGA GGGAGATGTA GAGGAAGCGA TCGCAAGTTA TCGGCAGGCA ATTGAGGTGA AGCCCGAATT TGCGGATGCG TATTTGAATT TAGGGAATGT CTTGGAAGAG GAGGGAGAGA TTGAGGAAGC GATTGCAAGT TATCGGCAGG CGATTGAGGT GAATCCAGAT TTTGTAGAAG CCTACTCCGA TCTGGGAAAG TTATTCTATG AGGGAGGGGA CTACATGTCT AGTATTGAAT TTTTTCAAAA AGCACTGTCG CTAGATAAAA ACCATCTAAA GAGTGCAGCT ACTTTGGGAT TTAGTTTTTT CAGATGTGGT CAGATTGATG CTGCTATATC CTATTATTCA GCCAAGCTAC ATTCGGGATT TTGCAGTATC TCAGCATATT ACTTGTTTCG AGCCGTAGAT CAAATAATTC CTTCCAAAAT TAATTCAGAT TTTGACATTG CATCTGAGTT TTGTGCTTCT GCTATGCAGT ATATGGGCGT TGCTTCGATT TTGGCTTTTG GAGACTCGCA CGTCAATGTC TTTTCTGGTA TTGAGGGCAT TGATGTCCAT CATGTCGGTG CATCTACTGC TTACAACTTA ATGTCAGACA AATCTAGTAG TGGAGGCTCG AAAGAAGTTT GGTCTAGACT AAAGAAAGTA GGTCGTAATA AATCTTCGAC GGCTGTTTTG CTTTGTTTTG GCGAGATTGA TTGTCGTAAT CATGTCGTGA TGCAGTGCTA TTCAAGAGGT ATTTCTATAG AAGAATCGGC TTCTAATGTG GTTTCTAGAT ATTTACAGTT TATTAATAGT CTGCTTAGTG CTGGTTTTAA AGTAGTCGTC TATGGATGCT ATGGATCGGG CTCTCATTTC AATGCTGTTG GCTCTGAAGT CGAAAGGAAT CTTGCGGCTC TAGAAGTAAA TCGTCTGTTG GGGAAGGGCT GTGATGACCT CAACTCTCCG TTCTTTTCCT TGAATGATTC GTTTGTTGAC GAATTTGGGC TTACAAGAAG GTACCTCATG CAAGATGATT CGCATTTACC AGAAAAAGGG AGAGCTGGTA TTGAGATTAG GTCACTTCTA ATGGCTAATT TGATGAAATC CTGTAAAGCG ATGTTTTCCT TATCGTCTAT ACATTCCTTA CATTCTTCTT CGGAATGGGA GGTGTCTTAT TGTAATTCTG TAATTGTCAA TATGGGGGAG GCCTCTCAGT CTTTTTGTCA TTGGGATGAG AATGGATTTC TTGCTTTGGG GGTGCATTCC GTCTCTAGTC TTTGGTTTGA TATGGGGGCT CAGCTTGTAG TTAAAGGTTT TGGGTTGGAA TTTGACTCCA AGCCGAGTTT CCCGCCTGAC TCATCGTGCC CATTCTTTTT CAGGCTTGAT GGGCAAGAGT TGTTAGTAAA GAAGGCTGAA TATAGCTCTG GAGAGTTAAA GGTTGAAATC GGCAATTCTT ATGGTAGGTA TATCGAGATT TTGCCGTCCG TTGCTGGTGG AGGAATTCTG GACTGTGTTT CCCGAATGCT TCCTGCTGTT GGCAGGGCTA TTTGA
|
Protein sequence | MNQEEIMQQL QAAVALHNQG ELDQAEAIYR QVLAVDENNF YALNFCGCIQ REKKRFDDAI TLLSSAVSAQ PGNPDANYNL GNVFKDAERW DEAISCYEKT LDLKAEYPEA LNNLGICLKE TEQYEHSEIV LKRAISRQPR FAAAWLNLGN TLKEQKKYSE AIVSYRNAIE VKPDFAEAYL NLGNVLKEEG AVEEAIASYR KAIEVKPDCA GAYFSLGFVL KGEGEVEEAI VSYRNAIEVK PDLAEAYLNL GYVLKEEGDV EEAIASYRQA IEVKPEFADA YLNLGNVLEE EGEIEEAIAS YRQAIEVNPD FVEAYSDLGK LFYEGGDYMS SIEFFQKALS LDKNHLKSAA TLGFSFFRCG QIDAAISYYS AKLHSGFCSI SAYYLFRAVD QIIPSKINSD FDIASEFCAS AMQYMGVASI LAFGDSHVNV FSGIEGIDVH HVGASTAYNL MSDKSSSGGS KEVWSRLKKV GRNKSSTAVL LCFGEIDCRN HVVMQCYSRG ISIEESASNV VSRYLQFINS LLSAGFKVVV YGCYGSGSHF NAVGSEVERN LAALEVNRLL GKGCDDLNSP FFSLNDSFVD EFGLTRRYLM QDDSHLPEKG RAGIEIRSLL MANLMKSCKA MFSLSSIHSL HSSSEWEVSY CNSVIVNMGE ASQSFCHWDE NGFLALGVHS VSSLWFDMGA QLVVKGFGLE FDSKPSFPPD SSCPFFFRLD GQELLVKKAE YSSGELKVEI GNSYGRYIEI LPSVAGGGIL DCVSRMLPAV GRAI
|
| |