Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_25821 |
Symbol | |
ID | 4778988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2275451 |
End bp | 2278786 |
Gene Length | 3336 bp |
Protein Length | 1111 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 640088103 |
Product | hypothetical protein |
Protein accession | YP_001018578 |
Protein GI | 124024271 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2931] RTX toxins and related Ca2+-binding proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATC CGCTGTTGAT GTATGAAATG ATCACCAATC CATTTAACTT TTTTGGTGAT TTCGAAGATC CAACATACCG CGACAACAAC ATAGAATTCA AAAAAGAATG GATACAATCA GACTTTGATC TAAACGATGG AGACTCGACA TGGAATAAAT ATACCAACAA TGACCTGATC AAATTACTAG CAGACATTCA AAGTCTGGGT TCCAATGAAT TCACGCCAAT AGCCGGCAAT TTTGGAATTC AAAAACACCT GATAACTTCA ACTTACCAGA TATCAGACTC GGACTATGTG ATAGACAAAC CTGATACCAG CAGATTTAAT CCTAAGGGTC CGTATAATCC ATCTGGTTGG GGATACAGAG GGAACCCTGC CGGAATAGCT CAAGAGGGAG TATGGGGAGT GCTATGTGAC GATATCTGGA GGCAAATCAT TACAACTGCA AATTCACTTG AAACCGAGGA ACGCAAACAA AAGCTGATCA ACACCTATGC ATTCAATAAA GAAGATGGCG AAGAATACAA TATCATCATT CCTGACTTGC GGAAGGGGGT AACATTCGGA AACAAACATA TTATCACGGA TCAATATGTA ATTTCAAACC CATTAAGCTT TTCAAGCCTA CGATCCAACG TAAATGCGTG GGCTAGATCT CTTGATACTG TTACTCATTC CGATCATTTC ATAACAAAAA AACCCGAGGA AGTTCCGAAT TGGAGAACAG GATACCTGGC AGGACATGGC CAACTTGGTT TGCCCGGGGT AGACCAAGAC GGCAATTTAA GCCAAGGAAT AGAATGGAGA AAAGTTATGT TTCGTGGAGC TGGCGACAAT ATTATTCGAA CACGATATTT TTATCCGATA GGCAACGGAT TTGATAGCGC GATACATCAA AAAAACAATA TAAATACTGG GGGTGGTTCA GATTATGTCA TTTACGATAA TTCGCAGCAC GAGGTAATAC TTGGTGATGG TGACGACTTA GCCTTTCCCT CTATTAAAGC TTTCGCTCCT TCCATTAGCT TCGGGCAGCA TGCCCAAAAT AAGCTGCTGG AAGAACCCGA CGGAGATCCT GGCTCTATAC TAATAAATAG TGTACGTTAT AAAGATGATT GCGATTGGTT GGGGGATTGT ACGACTACTC ACGTAGGTAA ACAGCTCCCT TTTGTCAATG CAAACAACGA AGGGCAGCTA AAATATTCAA GCAATCGCTT ATTGCCACAA CCAGATCAAA ATACATTGGT GCCAACGGTT ATTGAAAACC CTCTGATTAG CAATCCAATA AATAGCGTTA ATCATGGTCC GGTTTCCAGT GATGAAGCGT GGTACTACAC AAATAAAGCA CAGATCCTTG ATGGTGTTCA GCCAAGACAG GCGATCGAGA TTGGTGGCCA AAAAATTTAT GGCGGTAAAG GCCACGACAC ATTGCACGGC TTTGACCCAC TGATATACGC CTCAGAAGGA GCAAAAGAAT ATAATCACGT GCAACGCAAA GGAGACAATC CCTGGAAATA TGGAAGGCCA GTGCCGCATG TAATTCAAAA TAAACTCAAC TTTTTGGGAG ACAAAGATAT CGATTTCAAG TGGGATCCCA TTCTCTTATC CGGAGGAGAA GGATCAGACA GAATTAATCT GGGCGATCTC AAACGAATTA ATCTTGGCTT GAAGGGTGAC ATCATCAACA AAAACTTCGC CAACACATTG TATTTGGTTT TTGGTGACAA AGAAAAATCT GATGAATGCA CGGTAATCGC GAGAAAGAGT AAGAAGTGGG CTGACAATAT GAGTCCTGAT GTCTTTTCTC TCGATGCTTC TTATGATTTC AGAGAAGAAA TCATTGTCGA AGGATTGAAT ATTGACAACC GCATAGCTGG AGATGATCCA AAATCAGATT GGACGACACA GGCTGCGACA GTTCAAAAAT CAGTGACGGC AGCCGCTTTG ACTGCTGCAG CCTATCTTGG AACGGCATTT CCGGTTATTG GAGCAGCTTC GGCAATAGCT GCCGTAGGAA TAGATATTGC CAAACAATTA CAACAACACG ATTCGTCTGC CTCACAAAGT GAAGCCACGG ACTTTTACGA ACGAGATGAG GTGAAGGAAA AAATTGTTCC CTTGGGTTCT TGGACTAAAG CCGTCACCAT TCCTGATTTT GATCCAAGCG ACAATATAAC AATCAATTTG ATTCCAATTG AGGATCCTAG CGTTGATCAG AGTGAAGACA AATGGAGCAA TATTAATTTC AGCATGTCAT ATGGACAAAA CCAAATGCAT AGAACAACTA ACTATGGACA TACAGTTTAT TTGGAAACAC CTACTGACCC TCAACCGAAT CCCATTGCTT ATCTTTCTGG GCTGTCGAAT GCTGATCAAA GTGCGGAGTA TGGCTGGAAG ACCTGGGATT TCATGAGTGG GAACCAGAGT ATTCTCGACC CTGTAAAACA CATGGCTTGG TTCGGTGTTT TATCCAATAC CGAAAACACC CAAAACATGA AACTTGACTC ATACAAGGAA GCGCATTGGA ACAACCTGGA AATCGAAAAA GACTCCCCGT ATTCAGACAT ATTCCGATGG AATAGTGATT CGCTTGGAAC TGCTGAAAAA TTGGACAACT ACAGATCTGG ATCGTCCTCC ATGAGATTGA TGTATGATAA TTTCGAAAAA GGCTGGTACT GGGATACACG ATTTTACGGG GAAGGGGAGA GCAAAGGTGA TGTAAAAGTC ATAGATCCAC ATTTTTCATT TCTGCACTAT TACAACAAAG CCAATAAAGC TTGGGATAAA ATTTCTTATC AAGACCTTCT AGACAAGCCC ACTACTGTTG ATGAAAATGG AATCGAATAC CAAGAGATCG CGAAAAGAGC CCAATTTGAA TACTGGACAA AAGATGAAGA TCACATTGTT GGAGGACCAG ATGATGACTG GTTAACAGGA GGTGATGGAG GCGACTATAT GCATGCAGGT CATGGTCGGG ATACGCTTCT GGGTGGTGAT GGTGATGATG TTTTGATTGG TGGCGAAGGC CGTGATCTCC TTAAAGGAGG GCAAGGCTCA GATGTTTTTA TGTATAAAGA TGCATCTCAC TCCGGCTATG GGATAAAACG TGATGTGATT GGAGACTTTA GATCACATCA AAAGGACAAA ATAGATTTAT CTGGAATCCA AGCTGGCCTG ATCTTTATTG GTTCAGACGG CTTTAGTGGC CAGGCAGGCC AAGTTCGATT TGAGAATGGT CTGCTTCAGG TCAACATAGA TCGAGGTTGG CGAGCAGAAT TTGAGATCCA GTTGCTTGGT GTTGATAGCC TTGATCTTGA TGATCTAATC TTGTAG
|
Protein sequence | MNNPLLMYEM ITNPFNFFGD FEDPTYRDNN IEFKKEWIQS DFDLNDGDST WNKYTNNDLI KLLADIQSLG SNEFTPIAGN FGIQKHLITS TYQISDSDYV IDKPDTSRFN PKGPYNPSGW GYRGNPAGIA QEGVWGVLCD DIWRQIITTA NSLETEERKQ KLINTYAFNK EDGEEYNIII PDLRKGVTFG NKHIITDQYV ISNPLSFSSL RSNVNAWARS LDTVTHSDHF ITKKPEEVPN WRTGYLAGHG QLGLPGVDQD GNLSQGIEWR KVMFRGAGDN IIRTRYFYPI GNGFDSAIHQ KNNINTGGGS DYVIYDNSQH EVILGDGDDL AFPSIKAFAP SISFGQHAQN KLLEEPDGDP GSILINSVRY KDDCDWLGDC TTTHVGKQLP FVNANNEGQL KYSSNRLLPQ PDQNTLVPTV IENPLISNPI NSVNHGPVSS DEAWYYTNKA QILDGVQPRQ AIEIGGQKIY GGKGHDTLHG FDPLIYASEG AKEYNHVQRK GDNPWKYGRP VPHVIQNKLN FLGDKDIDFK WDPILLSGGE GSDRINLGDL KRINLGLKGD IINKNFANTL YLVFGDKEKS DECTVIARKS KKWADNMSPD VFSLDASYDF REEIIVEGLN IDNRIAGDDP KSDWTTQAAT VQKSVTAAAL TAAAYLGTAF PVIGAASAIA AVGIDIAKQL QQHDSSASQS EATDFYERDE VKEKIVPLGS WTKAVTIPDF DPSDNITINL IPIEDPSVDQ SEDKWSNINF SMSYGQNQMH RTTNYGHTVY LETPTDPQPN PIAYLSGLSN ADQSAEYGWK TWDFMSGNQS ILDPVKHMAW FGVLSNTENT QNMKLDSYKE AHWNNLEIEK DSPYSDIFRW NSDSLGTAEK LDNYRSGSSS MRLMYDNFEK GWYWDTRFYG EGESKGDVKV IDPHFSFLHY YNKANKAWDK ISYQDLLDKP TTVDENGIEY QEIAKRAQFE YWTKDEDHIV GGPDDDWLTG GDGGDYMHAG HGRDTLLGGD GDDVLIGGEG RDLLKGGQGS DVFMYKDASH SGYGIKRDVI GDFRSHQKDK IDLSGIQAGL IFIGSDGFSG QAGQVRFENG LLQVNIDRGW RAEFEIQLLG VDSLDLDDLI L
|
| |