Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21071 |
Symbol | |
ID | 4776903 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1865532 |
End bp | 1868738 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640087615 |
Product | MrsD-like protein |
Protein accession | YP_001018107 |
Protein GI | 124023800 |
COG category | [V] Defense mechanisms |
COG ID | [COG4403] Lantibiotic modifying enzyme |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.365739 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGACAAGTC CAACATCTTG GAAAACCAGT TGGTTAGCAG CTATTGCTCC CGATGAGCCC CACAAATTCG ATCGGCGCCT TGAATGGGAT GAGCTATCGG AGGAAAACTT CTTTGCTGCG CTGAACAGTG CGCCAACTTC CCTCGAGGAG GATGACCCTT GTTTTGATGA GGCTCTACAA GATGCTCTTG AAGCTTTAAA GGCCGCCTGG GACTTACCCC TGTTACCAGT CGACAACACT GTTAATAGAC CTTTTGTTGA TCTTTGGTGG CCGATTCGTT GCCACTCAGC AGAGAGTCTT AGACAAATCT TCGTTTCTGA TAGCGCTGGC CTTGCTGATG AGATTTTTGA TCAATTGGCA GACAGTCTGC TTGATCGACT CTGCGCGCTT GGCGATCAAG TGTTATGGGA AGCCTTCAAT AAGGAACGCA CACCAGGCAC GATGCTGCTT GCACATCTGG GTGCTGCTGG AGATGGATCA GGGCCACCAG TGCGTGAACA CTACGAGCGA TTCATCCAAT CCCATCGCCG CAATGGCTTA GCACCATTGC TTAAAGAATT TCCTGTACTT GGGCGCCTAA TCGACACCGT TCTTTCGCTT TGGTTTCAAG GCAGTGTGGA GATGTTGCAA CGCATCTGTG CCGATCGCAC AGTCCTTCAG CAAGGCTTTG CGATTCCCTG CGGCCATCAT CTCAAAACCG TCAAGCAGGG CTTGAGTGAT CCCCACCGCG GTGGTCGTGC AGTCGCTGTT TTGGAATTTG CTGATCCCAA TTCGACTGCC AACTCCTCCA TGCATGTGGT CTACAAGCCC AAAGACATGG CTGTAGACGC GGCTTATCAG GCCACCTTGG CTGATCTCAA TGCCCACAGT GATCTATCAC CGTTGCGCAC TCTTGCCATC CACAACGGCG ACGGCTATGG CTATATGGAG CATGTCGTCC ATCACTTTTG CGCTAATGAC AAAGAACTAA CCAATTTCTA TTTCAATGCC GGTCGACTAA CAGCCTTACT GCATCTACTC GGTTGCACTG ACTGCCATCA CGAAAATCTA ATTGCCTGCG GTGATCAGTT GCTTCTGATC GATACCGAAA CATTGTTGGA AGCTGATCTT CCTGATCACA TCAGCGACGC TTCCTCTACG ACAGCTCAAC CCGAACCTTC CACTCTGCAA AGACAGTTTC AGCGTTCTGT TTTGCGTTCC GGTTTACTCC CGCAATGGAT GTTTCTTGGA GAATCAAAAC AGGCAATTGA CATCAGCGCT CTTGGCATTT CGCCACCGAA TAAACCCGAA AGAATTGCTG CTGGCTGGCT TGGGTTTAAC AGTGACGGGA TGATGCCAGG TCGTGTTAGC CAGCCTGTTG AAATTCCAAC AAGTTTGCCA GTGGGAATCG CTGATGTTAA TCCTTTTGAT CGTTTTCTTG AAGACTTCTG CGATGGCTTC TCAATGCAAA GTAAAGCGCT CATCAAACTC CGCAATCGCT GGTTGGATTT AAATGGAATA TTGGCTCACT TTGCAGGATT GCCACGCCGG ATTGTGTTGC GAGCCACGCG TGTGTATTTC AACATTCAAC GTCAGCAGTT AGAGCCAACT GCTTTGCGTT CTTCGCTTGC ACAGGCGCTC AAACTCGAGC AGCTAGCACG TAGTTTTCTG CTCGCCGAAT CCAAGCCTCT TCACTGGCCA ATTTTTGCAG CAGAAGTAAA GCAGATGCAG CAATTGGATA TTCCTTTCTT CACTCATCTC ATTGATGCTG ATGCCCTCCA GCTCGGCGGA TTGGAACAAG AATTACCGGG CTTTATCCAA ACCAGCGGTT TAGCGGCGGC TTATGAGCGA TTGCGAAATC TAGATAGCAA TGAGATTGCT TTTCAACTGC GCCTGATTCA TGGAGCCGTA GAAGCGCGTG AGCTGCACAC AACCCCCGAG AGTTCACCAA CCCTTCACCC CCCCGCGACA CCAGAAGCGC TGATGTCAGC ATCGGCTGAG ACGAGTCTGA AGGCAGCAAA GCGGATCGCC CATCGTCTTT TGGAATTGGC GATTCGCGAC TCGCAGGGGC AGGTGGAGTG GCTGGGTATG GACCTGGGCG CTGATGGCGA GAGCTTTGCC TTTGGGCCCG TTGGTCTATC GCTTTATGGT GGATCCATAG GAATTTCCCA CTTGCTGCAG CGCTTGCAAG CACAGCAGGT TCCATTGATG GATGCCGATG CCATTCAGAC TGCAATCCTG CAACCCCTGG TTGGTCTTGT TGCTCAACCC AGCGACGATG GCCGTCGGCG CTGGTGGCGT GATCAGCCAC TTGGACTGAG TGGCTGTGGT GGAACCCTTT TGGCTCTTGC TCTGCAAGGC GAGCAGGCGA TGGCCAATAG TCTGCTCGCG GCGGCTCTAC CGCGTTTCAT CGAGGCTGAT CAGCAACTGG ATTTGATTGG TGGTTGCGCA GGCTTGATTG GATCGCTGGT GCAGCTGGGG ACGGAATCAG CTCTGCAGTG GGCTTTGCGT GCTGGTGACC ATCTCATTGC TCAACAAAAT GAAGAAGGTG CCTGGAGCTC ATCGTCAAGC CAGCCAGCAC TTTTGGGCTT CTCCCATGGT GTAGCTGGTT ATGCGGCAGC TCTTGCTCAC CTACATGCTT TCTCCGGTGA CGAGCGCTAC CGCATAGCTG CCGCAGCTGC ACTGGCTTAT GAGCGGGCTC GATTCAATAA AGACGCGGGC AATTGGCCTG ATTACCGCAG TATTAGTAGA GACTCAGATT CTGATGAACC AAGCTTCATG GCTAGCTGGT GTCATGGTGC CCCGGGTATT GCCCTTGGCC GAGCCTGTCT GTGGGGTACG GCTCTTTGGG ATGAAGAATG CACCAAGGAG ATAGGGATTG GCTTGCAAAC CACAGCAGCT GTCAGTGCTG TTTCGACGGA TCATCTTTGC TGCGGATCAC TGGGTTTGAT GGTCTTGCTA GAGATGCTTT CTAAAGGCCC TTGGCCGATT GACAACCAGC TCAGATCTCA TTGCCAAGAC GTGGCTTCTC AGTACCGCCT GCAAGCTTTA CAACGCTGCT CAGCTGAACC AATCAAGCTG CGATGCTTCG GCACAAAAGA AGGATTACTT GTACTGCCTG GCTTCTTCAC AGGCTTAAGC GGGATGGGAT TAGCGCTGCT CGAGGATGAT CCATCACGAG CTGTCGTATC TCAACTGATC AGCGCTGGGC TGTGGCCTAC TGAATAA
|
Protein sequence | MTSPTSWKTS WLAAIAPDEP HKFDRRLEWD ELSEENFFAA LNSAPTSLEE DDPCFDEALQ DALEALKAAW DLPLLPVDNT VNRPFVDLWW PIRCHSAESL RQIFVSDSAG LADEIFDQLA DSLLDRLCAL GDQVLWEAFN KERTPGTMLL AHLGAAGDGS GPPVREHYER FIQSHRRNGL APLLKEFPVL GRLIDTVLSL WFQGSVEMLQ RICADRTVLQ QGFAIPCGHH LKTVKQGLSD PHRGGRAVAV LEFADPNSTA NSSMHVVYKP KDMAVDAAYQ ATLADLNAHS DLSPLRTLAI HNGDGYGYME HVVHHFCAND KELTNFYFNA GRLTALLHLL GCTDCHHENL IACGDQLLLI DTETLLEADL PDHISDASST TAQPEPSTLQ RQFQRSVLRS GLLPQWMFLG ESKQAIDISA LGISPPNKPE RIAAGWLGFN SDGMMPGRVS QPVEIPTSLP VGIADVNPFD RFLEDFCDGF SMQSKALIKL RNRWLDLNGI LAHFAGLPRR IVLRATRVYF NIQRQQLEPT ALRSSLAQAL KLEQLARSFL LAESKPLHWP IFAAEVKQMQ QLDIPFFTHL IDADALQLGG LEQELPGFIQ TSGLAAAYER LRNLDSNEIA FQLRLIHGAV EARELHTTPE SSPTLHPPAT PEALMSASAE TSLKAAKRIA HRLLELAIRD SQGQVEWLGM DLGADGESFA FGPVGLSLYG GSIGISHLLQ RLQAQQVPLM DADAIQTAIL QPLVGLVAQP SDDGRRRWWR DQPLGLSGCG GTLLALALQG EQAMANSLLA AALPRFIEAD QQLDLIGGCA GLIGSLVQLG TESALQWALR AGDHLIAQQN EEGAWSSSSS QPALLGFSHG VAGYAAALAH LHAFSGDERY RIAAAAALAY ERARFNKDAG NWPDYRSISR DSDSDEPSFM ASWCHGAPGI ALGRACLWGT ALWDEECTKE IGIGLQTTAA VSAVSTDHLC CGSLGLMVLL EMLSKGPWPI DNQLRSHCQD VASQYRLQAL QRCSAEPIKL RCFGTKEGLL VLPGFFTGLS GMGLALLEDD PSRAVVSQLI SAGLWPTE
|
| |