Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_00661 |
Symbol | |
ID | 4776987 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 62605 |
End bp | 65856 |
Gene Length | 3252 bp |
Protein Length | 1083 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 640085566 |
Product | hypothetical protein |
Protein accession | YP_001016088 |
Protein GI | 124021781 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1404] Subtilisin-like serine proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.464249 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAGAC AAGAAGCTGG CCACAAACAG CTCGTCAACT CGCGATTTGA TCGCTGGCAA GATGAGCTCA GCAAACCAAT GCAACAATCA ATCGCATTTT TCTCCAATCC TGAAAACTTA ACCAACAAGG ATTCAAGCGA TGTGTTCCGC TGGGTGATTG CAGGACCAAC TCAAAAAAAC CTTTCTGCAT TTAAACAACA AGCAGCCAGG GTCGGAATCA AAATTCAAGA AGAGAATCTG AGCAATGCAG GATTCGGAAT TGATGAGCTA CAACTAATCG CCTTCGAACC TGGGTCCTTA TCAGATTTTC AAAGGCTGCA AACAGCAGCG AAAGAACTTA ACCTGGAACT CTGGCAAGAA GTGGTTCAAA CACACTCCGG TGATGGGCCT ATCGACACAG CTGCCATTAC AGAAGGAACC TTCGATCTCA ATCTTAACGA CAACAATACA GCCACTGATG TCAATGGTGA CGGCACTCCA GACCCTCTCT TTCATCTATT AGCAAGCAAT GATGCTAACG GTAGTTATGG CGTCAATGCT GTTGGAGCCT GGAGCCAAGT CAGTGGAGAG GGTATCACTG TTGGAGTTCT AGATACATTC TTTGATCTAA ATCATACAGA TCTAAATGCG GCTATGCCAA CTAATTTTGA CTGGGACAAT GATGGTCAAA ATGATGGTGT AGACAACAAC AACAACAACA TTCCAGATCT CTTCGAAAGT GAGCAATTCA CCCATGCCCT AGGTTCACCT AATTGGCCCG TAAATAACCC ACCACAACCA ACACCCCCTA ATCAATCTCA TGGTACGGCT GTTAGTGGAA TCGCGGTAGG CAGAAGCAAT GGAAACTCAG GCATTGGTGT TGCTCCTGAA TCCAACTGGA TCCCAGATGG CTTTCTTGAC CATCAAAATC TATGGCCATC TCAGAATTAC TACAATTACG CTGATGTCGT TAACAATAGT TGGGGGATGC CGAATACAGC CGGCGTTTTC CAAACATGGA CTCCTCAACG ACTTGCCAAC TGGCAACTTG CCACAGATGG AGCTATTCAA GTTGTTACTG CAGGCAATGA CAGAGACCCA GGCAATACAG CAAATCAGGG ATGGAGCAAT ACAAACAATT CTGAAAAGAC TAGAAGAGAA AATATTGTTG TTGCAGCAAC AATGCGCAAT GGTGAAGTAG AACAATACAG CACACCTGGC GCCTCAGTTT TTGTCAGCGC CCCTGTCAAT GGATCAAACT TCAGGTTTGC AAACTCATTT TTCGCCAACG CAGGAGTCCA ACGCACAACA ACAGCCGATG TCACAGACAA TGTGGCGTCC AATGCAGACA ATTCGGGTTA CATGAATGGG CCCACTGACA CCAGATTCAA TGGCACATCA GCCGCAGCAC CGATGGTGAC AGGAGCTATC GCCTTGATGT TAGAAGCGAA TCCAACACTC ACTGTAAGGG ATATTCAGCA CATCCTTACT GAAACATCAG TCAAAAATGG CCTAATAGAT AGTGATGGAG ACGGTTTACT CGATGCCATT AACCCCAATG CAGGCGGTAA TGCAGCGTTC CCAGGCGCAG CAGGAACAAT TGAACTAAGA AATTCACTAA CAGCGGGTGT TAACTCGACT TTTAACATTG CGGATGGTCA CAACACCGGA TGGTTTGTCA ATGGTGCCGG TCATTGGGTT AGTGATTCCT TCGGTTTCGG CATCGTTGAT GCAGGAGCAG CTGTTGCATT AGCAAACAAC TGGACAAATG TAGGAGATGA GCTCAAAGTC ACCACTGACA CGATTCTAAA CAACCCATAC ACCATTCAAG AAGGCATTCT AGGTGGACTC AATTCACTCA CAAATGCAGG CTCTTGGAAT GTCAACAACC ACATCGAACT GGAATGGGTT GAACTCACTC TGAACTTGAA CCTGCCAGAA CAAGATGAGG TGATGCTGGC GATTCAATCA CCATCTGGAA CCAGATCAGT GTTAATGGCT CCTGGTGGAA GCGATGCAAC CGCATTTAAT GGTCAGCGCA CCCTGATTAC AAATCAGTTC TGGGGTGAAA GCGCAAATGG ACAGTGGAAT ATCGAAGTTC TTGATGTGAA CAATGATGGT GACAATGCCA CCATCTCAAA CGCAACTCTG GACCTCTACG GAACCTGTAA TCAAACTTGC CCCCTTGAGG TTAAATCCTT CAAAGAACTA TCAAACAGTG GATTTGGTCT CGGCCAGTTA GCCAATCAAT TCCTGCAAGA TGGTGGTGCA AACCAAGGTA GCTATCAACT TCACTCAGTG ATGTCGATTG GCGATTGGGA ATCCTATGGA AATTTCACAG AAGGCTACAA CACAGGGCTG AAGATTGATG AAGGATTGAT CTTAACCAGT GGTCGCGCCA AAGATGCTAT TGGACCAAAC TCATCACCTA GCACGTCAAC AAATTGGCAA AATGTTGGCC ATCCACTACT AGGTGCAAAT AGCAAGGATG CTTCAGGAAT GGAGATTCGT TTCTCCCCCA ATCAAGACAT GGTCTTGGAT TGGAATGCAC AATTTGGTTC AGAAGAATTT GACGAGTATT CGCCCAGTAT TTTTGATGAC AACGCCGGCA TCTTCTTTAC TGAAATTACT GATCAAAAGG ACCCACTAGT TGGATACAAC CCAACAAACC TCCTCTCCGG TCCTAATCAA GGTCCCTTCT CAGTGAATGG CTTCAGCGAA AACCCTGGCA TCTTTGAGAA ATGGATGAAT ATGACCGAGC CTTGTGGACC AGTGAGCTGG GAATATGATG GAGGAACAAA CTTTGCAATC ACCTCCAAAA AAGCGGTGCT GGAAAAAGGC AAGACTTATG TACTCGCACC AATAATTGGT GATGCAACCG ATCATATCTA TGACAGCGGC ATCATCATCG GTCCAAACAA GCCAATTTTC AACTTACCCA AGCTCCCGAG GTTATGGGAG CCACGGAAAA AATCTCTTCC ATTCCGCAAA GAAGATCTCA TTCACATTGA CGTGAAACCC GAAAACACTG GAGCGAATGA CAACCTGGAT GCTCTGAGCA AACTTGGCCA AGTCTCCTTC GCTGAGGCCT CTACACGTAG TCTCGAAGAC ATCGAAATCT TCACCGGTCG TTTGCTTGAG GCCTTCTTCA CAGGTAACAA CCTCTCCAGA GAGCAAGTCA AAACCATGCT CACTGGCCTC GATTCCGAAG ACGCCATGAA CAACCAGCTC TTGAGCAACC ACTTCGCCCC TGAAGTTGCA AGAGTGATTT GA
|
Protein sequence | MKRQEAGHKQ LVNSRFDRWQ DELSKPMQQS IAFFSNPENL TNKDSSDVFR WVIAGPTQKN LSAFKQQAAR VGIKIQEENL SNAGFGIDEL QLIAFEPGSL SDFQRLQTAA KELNLELWQE VVQTHSGDGP IDTAAITEGT FDLNLNDNNT ATDVNGDGTP DPLFHLLASN DANGSYGVNA VGAWSQVSGE GITVGVLDTF FDLNHTDLNA AMPTNFDWDN DGQNDGVDNN NNNIPDLFES EQFTHALGSP NWPVNNPPQP TPPNQSHGTA VSGIAVGRSN GNSGIGVAPE SNWIPDGFLD HQNLWPSQNY YNYADVVNNS WGMPNTAGVF QTWTPQRLAN WQLATDGAIQ VVTAGNDRDP GNTANQGWSN TNNSEKTRRE NIVVAATMRN GEVEQYSTPG ASVFVSAPVN GSNFRFANSF FANAGVQRTT TADVTDNVAS NADNSGYMNG PTDTRFNGTS AAAPMVTGAI ALMLEANPTL TVRDIQHILT ETSVKNGLID SDGDGLLDAI NPNAGGNAAF PGAAGTIELR NSLTAGVNST FNIADGHNTG WFVNGAGHWV SDSFGFGIVD AGAAVALANN WTNVGDELKV TTDTILNNPY TIQEGILGGL NSLTNAGSWN VNNHIELEWV ELTLNLNLPE QDEVMLAIQS PSGTRSVLMA PGGSDATAFN GQRTLITNQF WGESANGQWN IEVLDVNNDG DNATISNATL DLYGTCNQTC PLEVKSFKEL SNSGFGLGQL ANQFLQDGGA NQGSYQLHSV MSIGDWESYG NFTEGYNTGL KIDEGLILTS GRAKDAIGPN SSPSTSTNWQ NVGHPLLGAN SKDASGMEIR FSPNQDMVLD WNAQFGSEEF DEYSPSIFDD NAGIFFTEIT DQKDPLVGYN PTNLLSGPNQ GPFSVNGFSE NPGIFEKWMN MTEPCGPVSW EYDGGTNFAI TSKKAVLEKG KTYVLAPIIG DATDHIYDSG IIIGPNKPIF NLPKLPRLWE PRKKSLPFRK EDLIHIDVKP ENTGANDNLD ALSKLGQVSF AEASTRSLED IEIFTGRLLE AFFTGNNLSR EQVKTMLTGL DSEDAMNNQL LSNHFAPEVA RVI
|
| |