Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_20801 |
Symbol | |
ID | 4776659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1832358 |
End bp | 1836815 |
Gene Length | 4458 bp |
Protein Length | 1485 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640087589 |
Product | hypothetical protein |
Protein accession | YP_001018081 |
Protein GI | 124023774 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCAAAG TCAATTACAT CATCGGAGGT CTTTTCCCTA AAAAGGCAAA TGGCGAGCTC AATCTAATTT TGTCTGATGG GCTCTATATC GACGCAAATA ACAGGGTTTC TGCGAGTAGT GTAGATCTTG ATAAAGGAGA TCTTAATACA AAGCTAGACA TAACGACTAA TAATATTCGC ATACCAGGCA TGGATAATTT CTTAAAAAAT TGGCTAGTAC CACAAATCAC AAATACTAAG GCTACGACTT TTAAGAAGGG TGTAATAAGT GGTGGAACTC TTTTCGGAAG CATTGCTGCC GGCGCAATAG GTAATTGGTC TAAGCAGGCA AGAAATAGAA CAATGTCTAC TCTTGTCAAG CTAGGATATA TAGGGCCAGG CTCAGATAAT GATGCGACAA ACAATGCATT TTGGAATGCC AACGGAACTA AATCTGATGA GCTAGACGGG CAAAATTTGT ATGACGCGAT CAACAATGTA GATAATTACA ATCCATCAGC CCTTTGGGGA ACTTACAACG GCAGCATCGA GCTTGAATCA AATTTCGATA CCGATTCACT CCTAGCCAAT CTTCACGGTG GAATGCATGA AGAAGGGATG GATATGCCAG ACCTTTGGTA TCCAAGCCTT TTATACAGCT ATAGCGAGAA GGGGCAAGGA ACCAGTTTCC CAGGCCCTGT GCTAATGCTA CGGCCTGGCA ATGACCTTAA TATTGATTTC AACAACAAGT TGACAATTCC TGGCTTAACA GTAGAGCAAG CTCAAAAAGC AACATTAATT CAAAACTCAA CGTATGGGAA CACTGCCAGT GATGGACTTG GTGGTACGAC AAGTGTCAAT TACCATTTCC ATGGCTCACA TACAAACTCA ACTGGATTTG GAGACAATGT CGTATCTAGA TACACAACTG GACAAGAGTG GACTACCCAT ATCCAATTGC CAGAAGATCA CGGACAAGGT TCATATTGGT ATCATCCCCA TTACCATCCA TCTGTTAACC AACAAGTCTA TGGAGGTCTC TCTGGTTTTA TCCAGATAGG AGATCCTTTA AGCAAAATCC CAGATTTTAA AGATATTCCT CGAAATCTGG CGGTCATGAA GCAGGTTGGC ATTGGTATTG ACTCATCAAC TGGCAATCCG CTCCTTTCAG GATTTGACCA TGGGTCTGGG AATACTCCAT ACATGGTGAC AGTCAATGGA GAATTTCAAC CAACTGCAGA TGCTGGCAAA GGCGGTTGGC AGTCAATCAC ATTAAGCAAT CAGTCCAACA AAGATTTTTA CAATATTGGA CTAAAAAACC AAGCTTCGGA TGGGAGCTGG GTAGATTTGC CTTTGTATAT CTATGGTGAA GATGGGCACC AATATCCTCA AATAAAGGCA GCGAAAGGAA CATTAGGCCG TTATGACACA AAGGCGGTAG ACGGAAGCAC AACATCAATA TACAGCCAAG CCGATAATAT TCTCTCTCTA GCACCGGCTA AAAGATTAGA CGTGTTAGTA TATCTACCTG AAGGAACATC TCAATTAGTC TCACATCCAA CCAAAAATAC ATTTACAAAA GATGGTAAGC AATACTCAAT ACAAAACACA GCTGGTTTCC CGGACTTATC GGAGACGGCC CAAGGCCTTA CCAGTGCTGG GCCTTTGGCT TACTTTGAAG TTAATGATGG AACTCCTGCT CTTTCAACTA ATGAATTAAA CAGCCAGATC AATACAGCGA ACAAAGGAAT AGATATACAA AACATACAAC CAACCACTAA AGAGTCAGAC TATGACTCAT CAAAAATTCC AAGTGTTAAT CTATTTGCCG ATCAGTGGGA TCCAATCAGA AAAAGGGAAT ACAACTGGTC AAAAGCTGTT CTTGTTGGAC CAGAGGATGA ACAGGATGCT GCAACTCAAG CAGCCATTAA AGACTATGAG GCCGCTAATC CAGGTAAAAC AATTGAACGA TATCATCAAT TACCGGTTAT AACAGCATTC AACCAAGCCA CAAAACCAAC AGCTGGAATT GAAAACTGGC TGGGCTATGA CAACCCATTT CTAATTAATG ACCATGTCTT TCCTAATGGC AGCCTGACGA TCGCACAATT AGGTACCATT GAAGAATGGC GACTGAGAAA CTGGAGTATC GGACCAAATG GTCCAACTAA ATATATTGGC CATCCTTTCC ATATTCACAT CAATGATTAT CAAACGAAAG ACAGTGATAC TGAGCTAATC AACAAAAATG TTCTAGAAGA TGTAACAATG GTCAATTCTT CTGGGTATAA AGCGTATAAC AATAAAACTG GTTTAATCGA CCAGGCTGAT CCATTTCGCG GAGAGTTTCA CAGTATTGAG GAAGCAACCA GCCCAGAACT AATAGAGAGT AAGGGTATTG GATACAACAA AAATGGCTAT GCAGATCTTG GCACCTGGGG TGCAAATGAT CAAACCGTAA GGATGTTATT TCAGGATTAC ATCGGTACTT ATGTATTTCA CTGCCATATC CTTCCTCATG AAGATGCAGG AATGATGCAG GTGGTGACAG TGGTTGAGAA TACCGATTCT AGCTGGCTAG TACCTGCTGA AGGTTTCAAT ATAACGTCTG ATTTGGATAG CTTCGATCAG AATCGAAATA TTGAAATAAG GCTTGCACAG GACTTAAGCA CCAGATATCT AAACCCTGAG CTGGGTATTT CCGAGAACGC AGTAAGAATG CAAGTTGGAG ACCTCAGTGA TGATTTTGTT CAAGATGTAG TGCTAACAAC AAGCTCTAAA GCTACAGGTG GCGAAGTAAG AGTCTATGAT GGTAGTTCAT TATTAGATGG CAAAACAAAG GAGCTATCCT CAATCAAGCC TTATGAAGCA AGTATTGCTC CATGGGCATT TGTAGAAGAC TTCACAGGTG ATGGTCGTCG AGAGTTATTT ACAGCAGGCT TTGATAAAGA GAAAAAAGAT GGGACCATCA ACATCAATGA TCTAAAGGTC AATGGATGGG CTTCAGCAAA TAACGACGGT AAAGGCTGGA AGGATGTTTT CGACTTCTCT CCTTTTGAGA GCATAGAGAA AGACGTTCAG CATGGAGGAC ACCACTACGA GGTCTCAGAA GATATCTCGA GCGATCAGGT TAGCGTTGCA ATCGCTGATA TGAATTTAGA TAATTTTCAA GATGTTATTG TTGCCTATAA AGTTAAAGTC AGTGAGGGGG AGCATAAAAG TCACACTAAT AGCAAAGATG GATTAAGGGT GGTTGTACTT GATGGCGCTG CTCTCAGCCT CAATTACCAA ACAGGCAAGA TGGAAGGTGG ATACTTGCCC GACTCGAATG TCTTGGCTGA TGCACTGTTC CTTGACAGCA GCCTTTCAGA TCTAAGCAAT CTTGTTTTGA CTGCAGGCTT TAATAGCTAT GCCCAATCAG CTCTTGAAAA TATAGTCATT ACTGCTCAAT CAGGTTCTAA ATCACAACAA TTCACGCTGC AGCTACAAGC TGGTCATTTT CTGGCCACAT CAGAACCTGA TGCGCATGGT GGGCATGGTG GGCATGGTGG GTCACAAACA CAGGACGATC GTATAATCAA TCTTCGAAAT GACTCGATGC CTCTCTATTT AGTTGAGGAG CTTGATCTTC CAGATGAGAC AGTAACAGCG AATCCTGTCC TTACTGGTGG TTTGGGTAAT GGAGCTCTCT TATCAGGTGA CTACCTGATG ATTGCTCAGG GTAATTCGGC AAATGGGAAT CATTCCAGTA GTGATAGAGG TATAAATACA ACACAACAGC TTGTAATCAA CCTACCCGGT TTGCTAGAAG TTAATGACTT AGACCTTATA GGCGCAACTG ATTCTAATCT TTCAAGTACT TTCAAAGGTA AGCAAGTCGA ACAACGAGGT AATTTAGCCA ATCTCACCTA CCTCGCTTAT GCAGGCACAT CATTATGGCC ATCACACCAA GCTAGTCTAT CTGCAGGAAT TCTTGGAAAG GGTGGAACCG CAGAAGAATT AGCAGAATCT ATACTTTTAG GATACAGCAG TGACATTATT GACTATTACG GGGAAGATTT AGATGGTCTT TCTACTAAAG ATGTCGTTAC TGGCGCTACT GAAAGTCTCT ATGGGCGAAA GGCAGAGAAG AGTGAGATCA AGCACTGGAA CAAAGAAGTC AAGGATGGCT TAGACAAGTC ACTAATCCCT TTGAGAATAC TACAAACCAC CTCTGGGATA GATTTATATC GTGTAGCATT CTTATCTGCA GGCTCACAAT GGAGTCAATT CCAATGGGCA ACGAATGACA ATGTAGAAGG ATCATTTGGC CAAGGCCTTC AAGGAGATAG CTCAGGCTTT AACAACTTGT CTAATGCTAT GATGTCTGTA GGCGAGATTG CATCTTGGGA TGATGCTCAA AAAGAATATG ACGCCTATAG AGAAAGTTTC CTGACTGCAT TTGTAGGCTC CACGGTTGAG AAGTCAGGAT TCTTCTGA
|
Protein sequence | MSKVNYIIGG LFPKKANGEL NLILSDGLYI DANNRVSASS VDLDKGDLNT KLDITTNNIR IPGMDNFLKN WLVPQITNTK ATTFKKGVIS GGTLFGSIAA GAIGNWSKQA RNRTMSTLVK LGYIGPGSDN DATNNAFWNA NGTKSDELDG QNLYDAINNV DNYNPSALWG TYNGSIELES NFDTDSLLAN LHGGMHEEGM DMPDLWYPSL LYSYSEKGQG TSFPGPVLML RPGNDLNIDF NNKLTIPGLT VEQAQKATLI QNSTYGNTAS DGLGGTTSVN YHFHGSHTNS TGFGDNVVSR YTTGQEWTTH IQLPEDHGQG SYWYHPHYHP SVNQQVYGGL SGFIQIGDPL SKIPDFKDIP RNLAVMKQVG IGIDSSTGNP LLSGFDHGSG NTPYMVTVNG EFQPTADAGK GGWQSITLSN QSNKDFYNIG LKNQASDGSW VDLPLYIYGE DGHQYPQIKA AKGTLGRYDT KAVDGSTTSI YSQADNILSL APAKRLDVLV YLPEGTSQLV SHPTKNTFTK DGKQYSIQNT AGFPDLSETA QGLTSAGPLA YFEVNDGTPA LSTNELNSQI NTANKGIDIQ NIQPTTKESD YDSSKIPSVN LFADQWDPIR KREYNWSKAV LVGPEDEQDA ATQAAIKDYE AANPGKTIER YHQLPVITAF NQATKPTAGI ENWLGYDNPF LINDHVFPNG SLTIAQLGTI EEWRLRNWSI GPNGPTKYIG HPFHIHINDY QTKDSDTELI NKNVLEDVTM VNSSGYKAYN NKTGLIDQAD PFRGEFHSIE EATSPELIES KGIGYNKNGY ADLGTWGAND QTVRMLFQDY IGTYVFHCHI LPHEDAGMMQ VVTVVENTDS SWLVPAEGFN ITSDLDSFDQ NRNIEIRLAQ DLSTRYLNPE LGISENAVRM QVGDLSDDFV QDVVLTTSSK ATGGEVRVYD GSSLLDGKTK ELSSIKPYEA SIAPWAFVED FTGDGRRELF TAGFDKEKKD GTININDLKV NGWASANNDG KGWKDVFDFS PFESIEKDVQ HGGHHYEVSE DISSDQVSVA IADMNLDNFQ DVIVAYKVKV SEGEHKSHTN SKDGLRVVVL DGAALSLNYQ TGKMEGGYLP DSNVLADALF LDSSLSDLSN LVLTAGFNSY AQSALENIVI TAQSGSKSQQ FTLQLQAGHF LATSEPDAHG GHGGHGGSQT QDDRIINLRN DSMPLYLVEE LDLPDETVTA NPVLTGGLGN GALLSGDYLM IAQGNSANGN HSSSDRGINT TQQLVINLPG LLEVNDLDLI GATDSNLSST FKGKQVEQRG NLANLTYLAY AGTSLWPSHQ ASLSAGILGK GGTAEELAES ILLGYSSDII DYYGEDLDGL STKDVVTGAT ESLYGRKAEK SEIKHWNKEV KDGLDKSLIP LRILQTTSGI DLYRVAFLSA GSQWSQFQWA TNDNVEGSFG QGLQGDSSGF NNLSNAMMSV GEIASWDDAQ KEYDAYRESF LTAFVGSTVE KSGFF
|
| |