Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_24661 |
Symbol | |
ID | 4776086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2165046 |
End bp | 2168237 |
Gene Length | 3192 bp |
Protein Length | 1063 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 640087986 |
Product | hypothetical protein |
Protein accession | YP_001018462 |
Protein GI | 124024155 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGGCGGCGA ACAGTCCCCC AACCCTTCTC GTTGGACTCT TACTTGCGAC AGGGTTAATC CCTGCTGAAG CAAATTCTCA GTCAAGCAAC TCATACAAAG AAGGCACCTC CTCGCCCACC CAAGCGGCTC CCATCGCAAG ACCAGCGCGG CAGCTGGGGC AATCAACAGG CCTTCAAAGA CAGCCCAGCC TTCCAGAACA ATCTCCCGAA CCGCCCCAAG AGCTCAAACT TCGGGCTGAT CAACAGCGCT ATGACGCCCG GCAAGAGCGT TTTATCGCTG AAGGTCGAGT GAGGGCCGTT CTCAACGGCG GTGTCCTCAA GGCAGATCGC ATTGAGTTCG ACAGCAACTT CAACACTCTT TATGCAAGGG GGAGTGTTCG TTTCCGCAAG GGATCTCAGT ACTTCCAAGC CAGTTCCCTG CGTTACAGCC TGATTCAAAA AGAAGGGGAA CTAGAGGACG TTTATGGCGT TCTAGATCTC GATACCGCAG CGAGCGATCT GAAACCTAAT TCTCAAGTAA CTAATCAAAC TCCTAAGGGC ACCTCAGCGC AATGGACGCG CCTACCTACA CAAGGAAGTG AAAGCCTCGG ATTTCCCACA GCATTAGATA TTGAGCTGAA CAGCACCACA GCTGGATTTA TCAGCAACCA GAACGATTCG GGCACGATCT GGGATCAACC ACTTGCCCCC AATAACAACT GGATTCAACA AGACAACAAA AGTAAATCCG GCATCGCCTG TCCACCTGTG CTGCCGCCAA TACCAGATTG GCATCCCCAT CCCTGGGCCG TAACGGCTTG GGGTGGCCAG ATGATTGATT CAAATTTTGG AGACACCTTT CTGTTCAATG GCCGAATCAG GGATGAATAT CTCCTGGGCC TGGGTCTGCA GAAAAGAATC GCTCGTGCAG GCCCCTTCGC TCTTGAACTA GAAGCAGACC TCTTTGCTCA CCATGCCAAC CAACAAGCTG GAGGAGCATT CAATCAATCA GTACCATTTG CAGACACACC TGCCCAGAGC TTTGGTGAAG GTGTGCTGGG TATCGGAGCA CGGCTATGGG TGCAACCGTG GTTGAGTTTT GGCGTGATCG AGGGAATCAG CTTCAACACG GCCTATAGCA ACTATGAGAA GACCTACCGC GAGAACTATG CAAAATTGCT CAACTATCTG AGCTTTGAGC TTGAAGCCGC AGTCTCACAA CAACTCTCTC TTGTGGGACG CATTCACCAT CGCTCAGGAG CCTTCGGCAC CTATAGCGGT GTCAAGGAAG GAAGTAATGC TTACCTTGTT GGCTTGAGAT ACCGCTGGGG CAAAGACAAC ACAGCACCGC AGCAAGCTGA CGTTCCGCCA CCACTGGGTT GTCCTGATCC CGATCGTGCT AATCGAACAC CTCGCCAAGG ACTCCAAGAG ACCCTTGAGT CGATCACCCT TGGGGAGGGT GACCCTAAAG CCCAAGCAGA GGCTCTCCCT CTTGGCACTA CGCCTCCCCA AGCTGTTGCC ACACGACAAA GACTCGCCCA GACGAATTCA TCGACGCTCT CTCCAGCAGA GCAAGAAGCA TTGCGCGCCA AGGCAATTGC CAAGATCGAT CAACGTATTA GCCGCATCCG GTTCCAACGA GCTCTCACCA TTGAGAGGCG ACAGGGCGTT GGTAACACAA CAGGCAATAT CGCAGAGAAA AACAAATACG GCGGAATTGG CGCTTCTCAG CTAAAGCAAC AGGGAGCGAC CAAACTCATC ACTGGTTCTA TCAGCCGCTG GAGGATTCAA GCCGCCAAGG TCACGATCAG CCCAGAGGGA TGGAAAGCAG ACCGCATGGG CTTCTCCAAT GATCCCTACA CACCTTCTCA AACGCGTATT GATGCCGAAG ATGTGATTGC CACTGAAGAG CCTAGTGGAG ACATTGTGAT CCAAAGCCGC CGCAATCGAC TAATTGTAGA GGAGCGTTTT CCTATCCCAG TCTCCAGAAC CCAAACCATC AAAAAACAGG AAGAGGTTGA GAACCGTTGG GTGTTTGGCA TCGACAATGA AGACCGAGAT GGTTTCTTTG TCGGCCGCGA TCTAAACCCT ATCGAGCTAA CAAAGAACTA CACGCTCTCC CTCCAGCCTC AATTCCTTCT AGAGCGAGCC ATCGACGGAG AGACAAAAAG CTATGTCGCT CCTGGCACCT CGATTGATAG CACAAAAACA ACCCAGCCAA TTACGGCTGC TGACTTATTC GGACTAGAAG CAGAACTAAC TGGGAACACC TGGGGGTGGG ATGTTGATAT TAATGCTAAT ATAAGCACCT TTAACCCTCA AAATATTGCT GATGGAAGCC GTTACTGGGG TGATTTAAAA AGGAAATTTG ATATCCCCTG GATTGGCTCT TTGGAGGCTA ACCTTTTTGC GGCTTATCGC TACGAAGCCT GGAATGGATC CCTAGGAGAA ACTGATATTT ATTCAGCCTA TGGTGCCTTC CTCCAAAAGA CAGGTAATTG GTCATGGGGC AAACTGACTA ACAACTATCT CCTTAGAGCA GGCGCAGGTA ACTATCAAGC AGAAAAGTTC AAAAGTGAGA ATCTTGCCGA TCTCTGGCGC GCTAATTTCT ACGGGTCACT AAATAGCAGC TATCCACTCT GGAAAGGCAA ATCTGCTGCT CTAACACCTG AAGCAGCTTA CCGCTACTCA CCTATAGCAA TCGTACCTGG GCTTAGATTC AACACTAATC TCAAAACCAC CTTTGATGCC TATGGCGATG GAGAGAGAAA AGCCACCATT GGTCTAACCG GAGGGCCAGC ACTCACACTT GGGACATTCA GCAAGCCCTT CTTAGATTTT ACTCGTTTAT CTATTAGTGG TGGGGGCACT TTAATCCAAG GCTCTAGCCC ATTTAAATTT GACCAAAATA TTGATCTTGC CACCCTAGGG ATCGGTCTAA CGCAGCAAAT CGCGGGGCCT TTAATTCTTA ATACCGGTGT CGCTTACAAC GTAGATCCTG ATTCACCTTA TTACGGAGAC ATCATTAACT CAAACATCGA ACTGCGCTGG CAACGTCGTT CCTATGACTT CGGTTTCTAC TTCAACCCCT ACAAAGGCAT CGGCGGCTTT CGTTTCCGTC TTAATGATTT CAACTTCACC GGCACCGGCG TACCTTTTGT TCCCTATACA CCCATCAATC AATTCGATCA ATTTGAAGAG CACCTCTTCT AA
|
Protein sequence | MAANSPPTLL VGLLLATGLI PAEANSQSSN SYKEGTSSPT QAAPIARPAR QLGQSTGLQR QPSLPEQSPE PPQELKLRAD QQRYDARQER FIAEGRVRAV LNGGVLKADR IEFDSNFNTL YARGSVRFRK GSQYFQASSL RYSLIQKEGE LEDVYGVLDL DTAASDLKPN SQVTNQTPKG TSAQWTRLPT QGSESLGFPT ALDIELNSTT AGFISNQNDS GTIWDQPLAP NNNWIQQDNK SKSGIACPPV LPPIPDWHPH PWAVTAWGGQ MIDSNFGDTF LFNGRIRDEY LLGLGLQKRI ARAGPFALEL EADLFAHHAN QQAGGAFNQS VPFADTPAQS FGEGVLGIGA RLWVQPWLSF GVIEGISFNT AYSNYEKTYR ENYAKLLNYL SFELEAAVSQ QLSLVGRIHH RSGAFGTYSG VKEGSNAYLV GLRYRWGKDN TAPQQADVPP PLGCPDPDRA NRTPRQGLQE TLESITLGEG DPKAQAEALP LGTTPPQAVA TRQRLAQTNS STLSPAEQEA LRAKAIAKID QRISRIRFQR ALTIERRQGV GNTTGNIAEK NKYGGIGASQ LKQQGATKLI TGSISRWRIQ AAKVTISPEG WKADRMGFSN DPYTPSQTRI DAEDVIATEE PSGDIVIQSR RNRLIVEERF PIPVSRTQTI KKQEEVENRW VFGIDNEDRD GFFVGRDLNP IELTKNYTLS LQPQFLLERA IDGETKSYVA PGTSIDSTKT TQPITAADLF GLEAELTGNT WGWDVDINAN ISTFNPQNIA DGSRYWGDLK RKFDIPWIGS LEANLFAAYR YEAWNGSLGE TDIYSAYGAF LQKTGNWSWG KLTNNYLLRA GAGNYQAEKF KSENLADLWR ANFYGSLNSS YPLWKGKSAA LTPEAAYRYS PIAIVPGLRF NTNLKTTFDA YGDGERKATI GLTGGPALTL GTFSKPFLDF TRLSISGGGT LIQGSSPFKF DQNIDLATLG IGLTQQIAGP LILNTGVAYN VDPDSPYYGD IINSNIELRW QRRSYDFGFY FNPYKGIGGF RFRLNDFNFT GTGVPFVPYT PINQFDQFEE HLF
|
| |