Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_05401 |
Symbol | |
ID | 5730785 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | + |
Start bp | 506599 |
End bp | 507840 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641284899 |
Product | Zn-dependent protease |
Protein accession | YP_001550425 |
Protein GI | 159903081 |
COG category | [R] General function prediction only |
COG ID | [COG1994] Zn-dependent proteases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGATAC GTGGAATACC TATTAGGGTC CATCCCAGTT GGTTTCTAAT CCTTTTTGTT TTTACATGTG CCTCTCAAGG TCAGATATCG AACCTGTTTG ATTCTGAATT GCCTGTCCTA TTGAGTTGGG GGATCGGTTT TCTAACTTCT TTGCTTGTTT TTGCATCAGT TGTTTTGCAT GAACTTGGTC ACTCTTTTAT GGCAATGCAC GAGGGCATAA AAGTGAGAAG CATTACTTTG TTTTTGTTGG GTGGGGTGGC GCGAATTGAT AAAGAATGTG TGACAGCAAT GTCTTGTTTA AGAGTTGCTA TAGCGGGCCC TTTAGTAAGT CTTTCTTTAG CAGGTTTGCT TTTGGCCTTT GTTCAAGTTG CATCTAACAC AAGTTTAATT GCCTCGAATT TATTCTCTCA GCTTGGAACT ATTAATTTGT TATTGGCTTT CTTTAACTTA TTACCAGGTT TACCTTTAGA TGGAGGTGTA ATACTAAAGT CAATTGTTTG GCACTTTTCT GGGAGTCAGC GTAAAGGATT AAAGGTTGCA AACTATTCAG GACGACTATT ATCGGTTTTT GCGGTTTTTC TTGGGACTTT CATTTGGTTA AGAGGTGGAG GTTTCGGAGG TATATGGTTG ATAATTCTTG GCTGGTTTGG CCTTGCTTCT TCGCGATCTC AGAATCAGAT TTTCTCATTA CAAGAAATTT TATGCACATT GAATGTTAGC CAAGCCTCGA GAAGAAATTT TAGAGTCCTT GAAGTGGATC AGTCTCTAAA AAGTATTAGT GAGTTGAACT TGGGCTCGGC TGAAAATCAA CGGATATCTG AATGGGTGCT TTTGTGCAAT GCAGGAAGAT GGGTTGGCTA TTTAACAGAT AAAGTTTTGA AGGATGTTCC AGTTCAGGAT TGGGATAAAT ATTTGGTATC GGAATATAGT CAACCACTAA GTGAGTTGCC TTCGATAAGC GACAAAGAAC CTCTTTGGCA CGCGGTACTA ACCTTAGAAA AGCTTAAGGC ATCAAGACTG TTGGTCTTTA ATTCGGCAGG CTTGCCTTCT GGAACTTTAG ATAAAGTAGA TATTGGGAAT GCAGTCCTAA GTAGACTTGG ACTTAAACTA CCAAAGTCAT TTCTCGAAAC GGCAAGGCAG AATAACATTT ATCCATTGGG CATCTCATTG GTTCAAGTAG TCGAAGAAAT GATTATCACT GGATTAGTTC AAGAACCTAA TAGCAATGAG TCGATGAAGT AA
|
Protein sequence | MRIRGIPIRV HPSWFLILFV FTCASQGQIS NLFDSELPVL LSWGIGFLTS LLVFASVVLH ELGHSFMAMH EGIKVRSITL FLLGGVARID KECVTAMSCL RVAIAGPLVS LSLAGLLLAF VQVASNTSLI ASNLFSQLGT INLLLAFFNL LPGLPLDGGV ILKSIVWHFS GSQRKGLKVA NYSGRLLSVF AVFLGTFIWL RGGGFGGIWL IILGWFGLAS SRSQNQIFSL QEILCTLNVS QASRRNFRVL EVDQSLKSIS ELNLGSAENQ RISEWVLLCN AGRWVGYLTD KVLKDVPVQD WDKYLVSEYS QPLSELPSIS DKEPLWHAVL TLEKLKASRL LVFNSAGLPS GTLDKVDIGN AVLSRLGLKL PKSFLETARQ NNIYPLGISL VQVVEEMIIT GLVQEPNSNE SMK
|
| |