Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_14661 |
Symbol | |
ID | 4777512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 1261676 |
End bp | 1262677 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640086976 |
Product | M23/M37 familypeptidase |
Protein accession | YP_001017477 |
Protein GI | 124023170 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0739] Membrane proteins related to metalloendopeptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.122507 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGCCCC TGTTGTTGCT TATCTCTTCC TTTGCAGCAC CAGTGCTTGC TTTAGGGAAT CTTGGGTCCT TGCCTGGCTA TGCAGATAGT GCTGTACAGG AGAAGGTCAA GATCTCAAGC ACAACCTCAC AGCAACTGAT ATGGATAAAA GTTGCCCTAC CAGTAACCAT AGAACAATTA GCAGGCAAAC TTGGGCTGAA AGCGACTGAA CTGTCAAAAC TCAATAAAAA GTCATCCAAT ACAGAACTAA CCAAAGGTAG TTGGATTGTT TTGCCAAGAT CTGCTCACAA CCAATTGAGG CGTATCAGTT ACCTAGATTC TGAGCAGGTG TTGCTGCATA ACCCTCGTAA CAGTAAAAAC CAACTAAATA ACAATCGATT GAGTCGACTG CTTGATGAAA CTAAGAAAAA AAACACGTTG TATTCGTTGA ATGATCACAA TAAGAATACA AATCGGGTAC AGAACCAAAC AAAAGTAAGT AGCAACAACA TCCTTAAGAA GGAGTGCTCC CTTGAGTCTC CATGCAATTG CCCCACCTGC TTAGATGTTG AATCACCAGA GAGCACAGTA GATCTGTTCA CCAGAAGTAA TGACATGCTT CAGCTAGGAA GCATTGATTC TGATTCCTAC ATATGGCCTA CTAAAGGTGT TTTCACATCT GGATTCGGAT GGAGGTGGGG GAGAATGCAT CAAGGTATCG ATATCGCCAA TAAAGTGGGC ACTCCCGTTT TTGCAGCAAA AGACGGAATA GTCACCTATG CCGGATGGAG GGGGGCCTAC GGCTACCTCG TAGAAATTGC ACATGGTGGC GGCTCCACAA CTCGCTATGC CCATAACAAT CAGATTTTGG TGCGCAGTGG TCAGTTCATA CCGCAAGGAG CAACGATCTC GAAGATGGGC AGCACAGGTC GGAGCACTGG TCCACATCTC CATTTTGAGA TCAGAAAGAA GGGTGGCTTA GCAATGAATC CAGTCACGTT GCTTCCATCG AATAAGGTCT GA
|
Protein sequence | MKPLLLLISS FAAPVLALGN LGSLPGYADS AVQEKVKISS TTSQQLIWIK VALPVTIEQL AGKLGLKATE LSKLNKKSSN TELTKGSWIV LPRSAHNQLR RISYLDSEQV LLHNPRNSKN QLNNNRLSRL LDETKKKNTL YSLNDHNKNT NRVQNQTKVS SNNILKKECS LESPCNCPTC LDVESPESTV DLFTRSNDML QLGSIDSDSY IWPTKGVFTS GFGWRWGRMH QGIDIANKVG TPVFAAKDGI VTYAGWRGAY GYLVEIAHGG GSTTRYAHNN QILVRSGQFI PQGATISKMG STGRSTGPHL HFEIRKKGGL AMNPVTLLPS NKV
|
| |