Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_02941 |
Symbol | |
ID | 4776328 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 304258 |
End bp | 309288 |
Gene Length | 5031 bp |
Protein Length | 1676 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640085796 |
Product | hypothetical protein |
Protein accession | YP_001016312 |
Protein GI | 124022005 |
COG category | [H] Coenzyme transport and metabolism [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG2226] Methylase involved in ubiquinone/menaquinone biosynthesis [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | [TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.673691 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAATCAA AAACCATTGA GGAGCTCTCA TCCACTGGAC GCCATCAAGA TTGCCTACAG GCTTGCCAAC AGCTTCTCCA AAGCGAACCC GAAAACCCAT TACCCTGGAA GTACGCAGGT AAATCTCTTC TAGCCCTAGG ACAACCTGAA AAGGCTCAAC AGTGCCTAGC CAAAGCTCAT CAGCTCGATA CAACCGATCC AGAGACAATC AAAGATATTG GCAATATTTT CAATGCTCTC CAAAATGACG CGGAAGCAAT AAGACTCTAT AAAGCAGCTC TTTTAATCAA CCAGAATTAC GCCCCAGCAA TCAACAATCT TGGGTTGATT GCTAAACGGC AAGGGGACCT ATTCGCCGCA GAGCAGTTAG TCAAAAGGGC TTGTGATCTA GATCAATCAT TCGCGCCGTA CCACATGAAC CTGGGCGGTA TCTACAAAGA CCTCGGCAAC CTTGATCAGG CTCTTGCCTC CACTCTCAAA TCCCTAGAGC TCCAACCTGA TAACCCCACT GCCCACATGA ACCTGGGCGG CATCTACAAA GACCTCGGCA ACCTTGATCA GGCTCTTGCC TCCACTCTCA AATCCCTAGA GCTCCAACCT GATAACCCCA CTGCCCACAT GAACCTGGGC GGCATCTACA AAGACCTCGG CAACCTTGAT CAGGCTCTTG CCTCCACTCT CAAATCCCTA GAGCTCCAAC CTGATAACCC CACTGCCCTC ATCAACCTAG GCGGCATCTA CAAAGACCTC GGCAACCTTG ATCAGGCTCT TGCCTCCACT CTCAAATCCC TAGAGCTCCA ACCTGATAAC CCCACTGCCC ACATGAACCT GGGCGGCATC TACCAAGACC TCGGCAACCT TGATCAGGCT CTTGCCTCCA CTCTCAAATC CCTGGAGCTC AAACCTGATA ACCCCACTGC CCACATGAAC CTGGGCGGCA TCTACCAAGA CCTCGGCAAC CTTGATCAGG CTCTTGCCTC CACTCTCAAA TCCCTAGAGC TCAAACCTGA TAACCCCACT GCCCACATGA ACCTGGGCGG CATCTACCAA GACCTCGGCA ACCTTGATCA GGCTCTTGCC TCCACTCTCA AATCCCTAGA ACTCAAACCT GATAACCCTG ATACCCTCAT CAACCTAGGC GGCATCTACA AAGACCTCGG CAACCTTGAT CAGGCTCTTG CCTCCACTCT CAAATCCCTG GAGCTCAAAC CTGATAACCC CACTGCCCAC ATGAACCTGG GCGGCATCTA CCAAGACCTC GACAACCTTG ATCAGGCTCT TGCCTCCACT CTCAAATCCC TAGAGCTCAA ACCTGATAAC CCTGATACCC TCATCAACCT AGGCGGCATC TACAAAGACC TCGGCAACCT TGATCAGGCT CTTGCCTCCA CTCTCAAATC CCTGGAGCTC AAACCTGATA ACCCCACTGC CCACATGAAC CTGGGCGGCA TCTACAAAGA CCTCGGCAAC CTTGATCAGG CTCTTGCCTC CACTCTCAAA TCCCTAGAGC TCAAACCTGA TAACCCCACT GCCCACATGA ACCTGGGCGG CATCTACCAA GACCTCGGCA ACCTTGATCA GGCTCTTGCC TCCACTCTCA AATCCCTAGA ACTCCAACCT GATAACCCTG ATACCCTCAT CAACCTAGGC GGCATCTACA AAGACCTCGG CAACCTTGAT CAGGCTCTTG CCTCCACTCT CAAATCCCTA GAGCTCAAAC CTGATAACCC TGATACCCTC ATCAACCTAG GCGGCATCTA CAAAGACCTC GGCAACCTTG ATCAGGCTCT TGCCTCCACT CTCAAATCCC TAGAGCTCAA ACCTGATAAC CCCACTGCCC ACATGAACCT GGGCGGCATC TACCAAGACC TCGGCAACCT TGATCAGGCT CTTGCCTCCA CTCTCAAATC CCTAGAGCTC AAACCTGATA ACCCTGATAC CCTCATCAAC CTAGGCGGCA TCTACAAAGA CCTCGGCAAC CTTGATCAGG CTCTTGCCTC CACTCTCAAA TCCCTGGAGC TCAAACCTGA TAACCCTGAT ACCCTCATCA ACCTAGGCGG CATCTACAAA GACCTCGGCA ACCTTGATCA GGCTCTTGCC TCCACTCTCA AATCCCTAGA GCTCAAACCT GATAACCCCA CTGCCCTCAT CAACCTGGGC GGCATCTACC AAGACCTCGG CAACCTTGAT CAGGCTCTTG CCTCCACTCT CAAATCCCTA GAGCTCAAAC CTGATAACCC CACTGCCCAG ATGAACCTGG GCGGTATCTA CAAAGACCTC GGCAACCTTG ATCAGGCTCT TGCCTCCACT CTCAAATCCC TGGAGCTCAA ACCTGATAAC CCCACTGCCC ACATGAACCT GGGCGGCATC TACAAAGACC TCGGCAACCT TGATCAGGCT CTTGCCTCCA CTCTCAAATC CCTAGAGCTC AAACCTGATA ACCCCACTGC CCACATGAAC CTGGGCGGCA TCTACAAAGA CCTCGGCAAC CTTGATCAGG CTCTTGCCTC CACTCTCAAA TCCCTGGAAC TCAAACCTGA TAACCCTGAT ACCCTCATCA ACCTAGGCGG CATCTACAAA GACCTCGGCA ACCTTGATCA GGCTCTTGCC TCCACTCTCA AATCCCTAGA GCTCAAACCT GATAACCCTG ATACCCTCAT CAACCTAGGC GGCATCTACA AAGACCTCGG CAACCTTGAT CAGGCTCTTG CCTCCACTCT CAAATCCCTA GAGCTCAAAC CTGATAACCC TGATACCCTC ATCAACCTAG GCGGCATCTA CAAAGACCTC GGCAACCTTG ATCAGGCTCT TGCCTCCACT CTCAAATCCC TAGAACTCAA ACCTGATAAC CCTGATACCC TCATCAACCT AGGCGGCATC TACAAAGACC TCGACAACCT TGATCAGGCT CTTGCCTCCA CTCTCAAATC CCTAGAACTC AAACCTGATA ACCCCACTGC CCACATGAAC CTGGGCGGCA TCTACAAAGA CCTCGGCAAC CTTGATCAGG CTCTTGCCTC CACTCTCAAA TCCCTGGAGC TCAAACCTGA TAACCCTGAT ACCCTCATCA ACCTAGGCGG CATCTACAAA GACCTCGGCA ACCTTGATCA GGCTCTTGCC TCCACTCTCA AATCCCTGGA GCTCAAACCT GATAACCCTG ATACCCTCAT CAACCTAGGC GGCATCTACA AAGACCTCGG CAACCTTGAT CAGGCTCTTG CCTCCACTCT CAAATCCCTA GAACTCAAAC CTGATAACCC TGATACCCTC ATCAACCTAG GCGGCATCTA CCAAGACCTC GGCAACCTTG ATCAGGCTCT TGCCTCCACT CTCAAATCCC TAGAGCTCCA ACCTGATAAC CCCACTGCCC ACATGAACCT GGGCGGCATT TACCAAGACC TCGGCAACCT TGATCAGGCT CTTGCCTCCA CTCTCAAATC CCTAGAACTC AAACCTGATA GCCCTGGCGC GGTCAACAAT CTCAAAGCCT TCATTGAGCA ACTAAACTTG AGTCAATCCA ACGCCAAGAA TCTCGAACGA GCATATGAAT TACTACTAAA CCAGACGGAC TTATCACACC AAAAGCTATC AAAAATATTC CTACAGGCAT TCCTCCCGAC AATTCAGAAC GCATCGGCAT CAGATCCAAT TATCTCTGAA GGCAATGAGG CATTAAAAGC ACTAGCCGCC GACTGGAGAT TTCGCAAATC CCTGACCTTA ATGATTCCAC CAAGCGTAGA AGCTGAAGGA TTTTTCACCA GATTAAGGAA GGAGCTTCTA ACACTGACCA TTAAGGAAGG GACAATTCTA CCGCAACTCA AGCCTTTAAC GGAAGCTTTA GCCACGCAAT GCTTTCTCAA TGAATATGTT TATGCGTCAT CACCAGAAGA GGATGACTCT ATAGCTAAAC TCATTGAGGC AGCAGTTCAC AACCACGAAG ACACCAATCG ATATCTTGCG ATTATAGGCT GCTACAAAGC GATTTATACA ACAGACATAA GTCCGGAATT CATCAATAAA TACCCCACCT CTGATGGCAG CAGCAAAGAA CTAATTACAG CTCAATTCAA AGAACCGCTC CTGGAGCAAG AGATCAAAAC TTCCTTGCGA GAAAAGCACA ATATTACCGA CGCAACCTCC CAAAAAGTTC AAGAAATGTA CGAAGAGAAT CCATATCCTA GATTCAGATT TTCCAGCTAC ACAGACAGCA AACTAGCAAA TTCAATTTGT AAATCTATTG AGATTGAGGC AACCAGAAAA GACCTATCTT TCATAGAAGA ACTAAAATCT CCTGCCTCCA CACCAAAAGT CCTCATTGCT GGCTGCGGTA CTGGCAATCA AGTGATCGGG GCAAGCCGAT ATAAAAATGC TCAAATCACA GCGATTGATT TCAGCGGCAG CAGCCTGGCA TACGCAATCA GAAAGACTAA GGAATATGGA ATGAACAATG TAACGTTCGA AAAAATGGAT CTTCTCAATG TCGCCGAGCT CGGCGACTTA TTCGACATTG TTGAATGTAG CGGTGTTCTT CACCATATGG AGAAGCCAGG CGAAGGATTA TCCGCACTTG TACGACAACT TAAACCTGGT GGGTATATCA AGATCGGACT CTACAGCGAA ATTGCTCGCA AGATCATCGT GGAAGCACGC AAGACTATTC AAACGTTAGA AATTGACAGC AGCCCAGAAG GAATTAGAAG ATTCAGGAAG CAAGTTCTTG ATGAAGAAAT TAAAGAACTC CTGGCTCTTC CTAAATTCGG AAGAGATTTT TATTCACTCT CGGAATGCCT GGACCTTTGC TTCCATGCCC AAGAGCATCG CTTCACAACA GAGTCACTCC AAAAACTCTT GGATTCCCAT GGTCTGACTT TTTGCGGATT CATGGTGCCA GAGCAGATCA AGAAGCTATA CCAAGAAAAA TATCCAGAAG ACAGCAATAT GACTTCATTA TCTAACTGGG GAGAATTCGA GGGAGAGCAT CCCTCAACCT TCACAAGCAT GTATCAGTTC TGGGCACATA AGCAGTCCTA G
|
Protein sequence | MKSKTIEELS STGRHQDCLQ ACQQLLQSEP ENPLPWKYAG KSLLALGQPE KAQQCLAKAH QLDTTDPETI KDIGNIFNAL QNDAEAIRLY KAALLINQNY APAINNLGLI AKRQGDLFAA EQLVKRACDL DQSFAPYHMN LGGIYKDLGN LDQALASTLK SLELQPDNPT AHMNLGGIYK DLGNLDQALA STLKSLELQP DNPTAHMNLG GIYKDLGNLD QALASTLKSL ELQPDNPTAL INLGGIYKDL GNLDQALAST LKSLELQPDN PTAHMNLGGI YQDLGNLDQA LASTLKSLEL KPDNPTAHMN LGGIYQDLGN LDQALASTLK SLELKPDNPT AHMNLGGIYQ DLGNLDQALA STLKSLELKP DNPDTLINLG GIYKDLGNLD QALASTLKSL ELKPDNPTAH MNLGGIYQDL DNLDQALAST LKSLELKPDN PDTLINLGGI YKDLGNLDQA LASTLKSLEL KPDNPTAHMN LGGIYKDLGN LDQALASTLK SLELKPDNPT AHMNLGGIYQ DLGNLDQALA STLKSLELQP DNPDTLINLG GIYKDLGNLD QALASTLKSL ELKPDNPDTL INLGGIYKDL GNLDQALAST LKSLELKPDN PTAHMNLGGI YQDLGNLDQA LASTLKSLEL KPDNPDTLIN LGGIYKDLGN LDQALASTLK SLELKPDNPD TLINLGGIYK DLGNLDQALA STLKSLELKP DNPTALINLG GIYQDLGNLD QALASTLKSL ELKPDNPTAQ MNLGGIYKDL GNLDQALAST LKSLELKPDN PTAHMNLGGI YKDLGNLDQA LASTLKSLEL KPDNPTAHMN LGGIYKDLGN LDQALASTLK SLELKPDNPD TLINLGGIYK DLGNLDQALA STLKSLELKP DNPDTLINLG GIYKDLGNLD QALASTLKSL ELKPDNPDTL INLGGIYKDL GNLDQALAST LKSLELKPDN PDTLINLGGI YKDLDNLDQA LASTLKSLEL KPDNPTAHMN LGGIYKDLGN LDQALASTLK SLELKPDNPD TLINLGGIYK DLGNLDQALA STLKSLELKP DNPDTLINLG GIYKDLGNLD QALASTLKSL ELKPDNPDTL INLGGIYQDL GNLDQALAST LKSLELQPDN PTAHMNLGGI YQDLGNLDQA LASTLKSLEL KPDSPGAVNN LKAFIEQLNL SQSNAKNLER AYELLLNQTD LSHQKLSKIF LQAFLPTIQN ASASDPIISE GNEALKALAA DWRFRKSLTL MIPPSVEAEG FFTRLRKELL TLTIKEGTIL PQLKPLTEAL ATQCFLNEYV YASSPEEDDS IAKLIEAAVH NHEDTNRYLA IIGCYKAIYT TDISPEFINK YPTSDGSSKE LITAQFKEPL LEQEIKTSLR EKHNITDATS QKVQEMYEEN PYPRFRFSSY TDSKLANSIC KSIEIEATRK DLSFIEELKS PASTPKVLIA GCGTGNQVIG ASRYKNAQIT AIDFSGSSLA YAIRKTKEYG MNNVTFEKMD LLNVAELGDL FDIVECSGVL HHMEKPGEGL SALVRQLKPG GYIKIGLYSE IARKIIVEAR KTIQTLEIDS SPEGIRRFRK QVLDEEIKEL LALPKFGRDF YSLSECLDLC FHAQEHRFTT ESLQKLLDSH GLTFCGFMVP EQIKKLYQEK YPEDSNMTSL SNWGEFEGEH PSTFTSMYQF WAHKQS
|
| |