Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_07871 |
Symbol | |
ID | 4778584 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 721611 |
End bp | 722879 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640086296 |
Product | Zn-dependent protease |
Protein accession | YP_001016803 |
Protein GI | 124022496 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGAAG GCTGGGAGCT GATGAAGATT CGGGGAATCC CTTTAAGGGT TCATCCCAGC TGGTTTGTGA TCCTGTTGCT TTTTACCTGG ATTTCACAAA ATCAGGTGTC CGCTGCAGCT GAATCTTCGC TTCCAGCTTG GATCAGCTGG GGTCTGGGGT TGATTACGGC CCTGCTGTTG TTTTTGTCTG TTCTCCTGCA TGAGCTAGGC CACTCTTTGG TTGCACTGCG TGAGGGGGTC AAGGTTCGCA GCATCACGCT TTTTTTCCTG GGAGGTGTTG CCAGCGTTGA GAGGGAGTGC TCTACACCGA TGGCCTCTTT AAGGGTTGCT GCAGCCGGCC CACTGGTGAG CTTGGTGTTG GCTGTTGCCT TGCTGACGGG AGGGGTGTAT GCAGCTGATC ACGTCAATCC GCTGCTTGCC AATCTCGTTG GGCAGTTGGG TGGGCTCAAT TTGTTGCTTG CCCTTTTTAA CTTGCTACCT GGGTTGCCTC TTGATGGAGG CTTGATCCTT AAGGCTTTGG TCTGGCAGTG GACTGGCAGT CAGAGGAAGG GTGTCCAGGT CGCCACAGCA ACTGGCCGTG CCCTGTCTCT CTCGGCAATG GTGTTGGGGG GTTGGTTGCT CTTCTTTAAA GGTGGTGGGA TCGGTGGGCT TTGGCTGTTG ATGCTTGGTT GGTTTGGTCT CGGTGCATCT CGCTCTCAAA CCCAGCTACT TGCCTTGCAG AAGGTCTTGC GTGAGCTCAA CGTGGGCCTG GCTGCTGGGC GCAACTTCCG TGTGCTTGAA GATGACCAGT CGTTGCGCAG GCTTAGTCAG TTGCGTTTGT CTGGAAGCGA GGAGCAGTCT CCTCCGGCGT GGGTTTTGGT TTGTCGCTCT GGTCGATGGG TTGGTTACAT GACGGACCAA CCCTTAAAAG AATTGCCTGT GCAGCAATGG GATAGGCAAT GCCTGGCGGA TCACATGAAA CCGATATCTG AGTTGCCTTC CATTGGCGAG AAAGCCCCTT TATGGCAGGC GGTGTTGGCA CTAGAACAGG CTGAGGAGGG CAGGCTTCTT GTCTTTAATG TTGCTGGTCT TCCTTGCGGA ACATTAGATC GAATTGATCT CTCCGAAGCT GTTCTTAAGC GTCTTGGGGT AAGGCTTCCT GCTCAGTTTC TCGAAGCTGC TCGCCGTCAG AACACCTATC CCCTGGGTAT GGCACTGCCT AAAGTTGTGG AGTCGATGAT CTCTGGCGGA TTGGTTGAGC AGCCTGAGGC ATCCAGCAGT ACTTCATAG
|
Protein sequence | MGEGWELMKI RGIPLRVHPS WFVILLLFTW ISQNQVSAAA ESSLPAWISW GLGLITALLL FLSVLLHELG HSLVALREGV KVRSITLFFL GGVASVEREC STPMASLRVA AAGPLVSLVL AVALLTGGVY AADHVNPLLA NLVGQLGGLN LLLALFNLLP GLPLDGGLIL KALVWQWTGS QRKGVQVATA TGRALSLSAM VLGGWLLFFK GGGIGGLWLL MLGWFGLGAS RSQTQLLALQ KVLRELNVGL AAGRNFRVLE DDQSLRRLSQ LRLSGSEEQS PPAWVLVCRS GRWVGYMTDQ PLKELPVQQW DRQCLADHMK PISELPSIGE KAPLWQAVLA LEQAEEGRLL VFNVAGLPCG TLDRIDLSEA VLKRLGVRLP AQFLEAARRQ NTYPLGMALP KVVESMISGG LVEQPEASSS TS
|
| |