Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_14611 |
Symbol | |
ID | 4776780 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1255510 |
End bp | 1258362 |
Gene Length | 2853 bp |
Protein Length | 950 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640086970 |
Product | hypothetical protein |
Protein accession | YP_001017472 |
Protein GI | 124023165 |
COG category | [S] Function unknown |
COG ID | [COG1615] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.62237 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCTTGG CTGGTTTCGC CCGACCCTTG CAATGGTTCA AGAGCCAGTG GCTTTGGCTC TTGCTCTCTA TCGCAGCTTT CTGGTTGTTG ATGCGGGTTC AAGTCGAATG GTTGTGGTTC GGTCAATTTG ATTGGCAAGG GATGCTTCTC CGCCGTTGGC TCTGGCAACT GGGAGGACTT CTACTCGCCC TTCTGGTTGT AGCGACTTGT CAGCTTTGGC AGCGCAACTG GATCAAACTC GAAGGTGCAA GCAACTTGGC AGAGCCAGCG CTTCCCCTGC ATGGATGGCG TTATGGATTG GGGCTACTTG GCTGCTTCGT GGTGGTGGTC GGTGATCTTG TATTACTCAC TCGATTGGCG TGGTTAGCTT GTTTTAAGCC ATTTGCCCTT GGGCATTGGT GGAGTGAACC TTTTGAAGAC ATTTGGGCGT TGGTGATTCC GCTGTCCTGT GTATTCATCT CAATTTGCGT GATGCTTGGC AATGCCCGAG GGGGAAGGAT TGCTCATTTG ATGGGTTGTT TTTGCTTCAG TATCTCCATC GCTAGAGGCT GGGGATTATG GTCACTCGCC CTTGCTATCC CTCCTACAGG TATTAAAGAA CCACTACTGG GCGCTGATGT GAGTTTTGGG CTCGGTCAAT TTCCAGCACT TGCCTTTGCC TTGGTTGTTC TATTGGCCCA ACTCGTCTTA ACAACGAGTA CGACAATATG GATGAAGTTA GCCCAGCCTG AATCTCTTTC AGATTGGGTT TTCAAAGGTT TATCGCCTAG ACAGTGTGAT GTTATGAGGC CATTAATTGG CATCATACTT TTAACACTTT CAGCACTTTT ATGGTTGTCA CGTCATGAAC TTCTGTGGAC GCAGAACGGT ACGGTAGCCG GAGCTGGTTG GTTGGATGCT CACCTGATAC TTCCTCTACG AAGTTTGGCG AGCTTGGCCA TTTTGGTTCT CGCATTTCTG GTGATTCCAT TTCCATGGAT ACAACAAAGG CGTTTATGGA GATTAATCGC TTCAATTATT GGCGTAGGCG CAATTCTTCT CGAAGTGCTT TTGGCTCCTT TTGTCCAGTG GATGGTCGTA AAACCTAGAG AACTGAAACT AGAAACCCCT TACATCATTC GAGCAATTAA AGCCACAAGG AAAGCATTTC AGCTTGACTC GATTACGACT ACACTTATTA ATCCTCAACC TCAGTTAACT CAACTTGATC TTGAACAAGG TGCAAGCACC CTAAGGAATA TTCGTCTATG GGACAGCCAG CCTCTCTTGG CAACCAATCG TCAATTACAG CAATTGAGGG TTTACTATCG CTTTTCAAAT GCGGCGGTAG ATCGTTATCG CTTTGTGCCT GACAAGGCGA ACCGGCAACA GGTGATGATC ACAGCACGCG AACTTGATCA GGCTGCACTT CCAAAACGTT CACGTACTTG GTTGAATCGT CATTTTGTTT TCACGCATGG ATATGGCTTT ACTCTCAGTC CAGTCAATAC AAGAGCACCC GATGGCCTCC CTGACTATTT TATCAGTGAT CTAGGAACCT CAACACGTCT TGAAGGTAGT TCTGAGCTCG GCATTACTCG TGAAGATGTC AAAGAGGCAG TGCCTATTGG GCGAGCAGCT CTCTATTTTG GAATGCTTCC CTCTCCTTAT GCTCTTGCTC CGAGCAAACT AAAAGAACTA GATTATCCAG TAGGCGATAA GAATATCTAC AACCACTATT TAGGATCAGG AGGTGTGCCG GTGGGTCATC CCTGGCAACA ATTAGCAGCA GCTATGTACC TTTTTGAACC CCGCCTTCTT AATACAGGGT CGCTAACGAT TAATTCCAAA CTTCTTATTA GGCGAGAAGT GAGACAAAGA GTGAGTGCAA TAGCGCCATT CCTTGAAGTT ATTGGTGATC CATATTTGGT CTCTACATCT GTTAATTCTA GAGATCATGA TTATCAAGCT AAGCAAAACC AATATTGGAT TGTTGAAGCT TATACAAGCT CACGCACATA TCCCTACGCA GCAAACCTGC CAGATGGTCG CCCTGTGCGA TATTTACGTA ACTCAGTCAA AGCTATTGTT GATGCATACA GTGGTCGTGT TCACTTGTAT GTGAGCGAAC CGCGGGATCC GATAATATTG GGTTGGCAGC GATTGTTTCC AGATCTTTTC AAACCTCTTG AGGAGATGCC TTCAAGCTTA CGAGAGCATC TTAAGGTTCC TACAGATTTG TTTAATGTAC AGGTACAGCA ATTGCTTAGA TACCACGTTA CTGATCCTCG TATATTTTAT AGCGGTGATG ATGTTTGGCA GGTTCCAAAG GAACTCTATG GTAAGCGGCA GGTTCCTGTT GATCCCTATC ACATCACTGC ACAATTAGGT AGTCAAGAAA GTTCAGAATT TCTCTTATTA CAACCACTAA CACCACTAGC TCGCCCCAAT CTTTCTGCGT GGCTCGCTGC TCGAAGTGAT GGTGATCATT ATGGAAAATT AGTGTTGCTA CGTTTCCCAA GTCAAACGCC AATCTTTGGC CCTGAACAGA TTCAGGCCCT TATCAATCAG GACCCGCAGA TCAGCCAACA GTTTGGTCTT TGGGATCGTG CTGGGTCTGA AGTTGTGCAA GGGAACCTTC TTGTTGTACC CCTAGGCAAG GCCCTTCTTT ATGTAGAACC TGTTTACCTA AGGGCACGTC AAGGTGGTCT ACCAACCCTT ACCAGGGTAG TTGTTAGTGA TGGCAAAAGG ATTGCCATGG CTGAGGATCT CGGAGAAGGC CTAAGGGCCT TGGTTGACGG ATCAAGCAAA AAAGCAGTGT ATTTAAATAG AAATGATCTG CCACCGATTA AGGCAGCAGA TCAATCAAAT TAA
|
Protein sequence | MGLAGFARPL QWFKSQWLWL LLSIAAFWLL MRVQVEWLWF GQFDWQGMLL RRWLWQLGGL LLALLVVATC QLWQRNWIKL EGASNLAEPA LPLHGWRYGL GLLGCFVVVV GDLVLLTRLA WLACFKPFAL GHWWSEPFED IWALVIPLSC VFISICVMLG NARGGRIAHL MGCFCFSISI ARGWGLWSLA LAIPPTGIKE PLLGADVSFG LGQFPALAFA LVVLLAQLVL TTSTTIWMKL AQPESLSDWV FKGLSPRQCD VMRPLIGIIL LTLSALLWLS RHELLWTQNG TVAGAGWLDA HLILPLRSLA SLAILVLAFL VIPFPWIQQR RLWRLIASII GVGAILLEVL LAPFVQWMVV KPRELKLETP YIIRAIKATR KAFQLDSITT TLINPQPQLT QLDLEQGAST LRNIRLWDSQ PLLATNRQLQ QLRVYYRFSN AAVDRYRFVP DKANRQQVMI TARELDQAAL PKRSRTWLNR HFVFTHGYGF TLSPVNTRAP DGLPDYFISD LGTSTRLEGS SELGITREDV KEAVPIGRAA LYFGMLPSPY ALAPSKLKEL DYPVGDKNIY NHYLGSGGVP VGHPWQQLAA AMYLFEPRLL NTGSLTINSK LLIRREVRQR VSAIAPFLEV IGDPYLVSTS VNSRDHDYQA KQNQYWIVEA YTSSRTYPYA ANLPDGRPVR YLRNSVKAIV DAYSGRVHLY VSEPRDPIIL GWQRLFPDLF KPLEEMPSSL REHLKVPTDL FNVQVQQLLR YHVTDPRIFY SGDDVWQVPK ELYGKRQVPV DPYHITAQLG SQESSEFLLL QPLTPLARPN LSAWLAARSD GDHYGKLVLL RFPSQTPIFG PEQIQALINQ DPQISQQFGL WDRAGSEVVQ GNLLVVPLGK ALLYVEPVYL RARQGGLPTL TRVVVSDGKR IAMAEDLGEG LRALVDGSSK KAVYLNRNDL PPIKAADQSN
|
| |