Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9211_03011 |
Symbol | |
ID | 5731485 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9211 |
Kingdom | Bacteria |
Replicon accession | NC_009976 |
Strand | - |
Start bp | 284625 |
End bp | 285908 |
Gene Length | 1284 bp |
Protein Length | 427 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641284647 |
Product | hemolysin-like protein |
Protein accession | YP_001550186 |
Protein GI | 159902842 |
COG category | [R] General function prediction only |
COG ID | [COG1253] Hemolysins and related proteins containing CBS domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0287533 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCTTC TTCTTCTCGC AATACTACTT TCACTGCCAG CATTTTTTGC TGCAGGAGAG CTTGCAATAT TACGACTTAG GCCAAGTCGA GTAGAAAGAC TGATTGAGGA AAAGCAACCA GGCGCCCATT CAATTCACAG GCTGCAAAAA AGGTTACGTA GAACATTAAT GGCCGCTCAA CTAGGTATAA CAATTGCGCT AGTTGCCTTG GGATGGTTAG CAAATGAACT AGCAAACTTA TGGTTCTTTT CTACAGAATC AAAATCTCGC TTATTAAATC TCACCATCTT TCTGAGCATT GTCTTACTTG GAACATTGCT TTCTGGCCTC CTACCAAAAG CATTGGTTCT TAGCAATCCT GAAAAATCTG CTCTGCGAAT CTCGCCTCTT CTAGAAGGAG TAATAAGAGC AATGACTCCT ATTCTTTCTT TATTAGAAAC AATCTCTTCT TTTTTACTGA GGTTAATTGG GTTGAACATG CAGTGGGAAT CATTAGTTAC TGCTTTATCA GCAGGAGAAT TAGAAACCCT TATTGAATCA GGCAAAGTTA CTGGTCTGCA CCCTGATGAA AAAAATATTC TGGAAGGTGT CTTCGCGCTT AGAGATACAC AAGTTCGCGA AGTTATGGTC CCTCGTTCAG GGATGGTGAC CCTTCCCAGA AATGTTCTTT TCTCTGAATT AATGAGCAAA GTTCATTTAA CTCGACATGC TCGTTTTTTG GTTATTGGTG ATTCACTTGA TAACGTTCTT GGAGTTCTAG ACCTGCGCCT CTTAGCCGAG CCAATTTCGA AAGGAGAAAT GAATCCTGAT ACACCTTTGG AAAAATATAT CAAGCCTGTT GCACGAGTTG CAGAAACTTG CACATTAGAG ATGTTGCTTC CCTTAATAAG AAAAGGAAAC CCTTTTTTGT TAGTGGTCGA TGAGCATGGA GGGACTGAAG GGCTTATAAC AGCAGCAGAC TTAACAGGAG AAATCGTTGG TGAAGAAATT GATTCAGGTA AAAAAGAGCC TATTCTACGA AGAATAAATA ACAGTTCCAA AACGTGGATT GCTGCAGGCG ATCTAGAAAT TATTGAACTT AATCGACAAT TGAATATCGA TCTTCCTGAA AACATAAATC ACTATACGCT TGCTGGCTTC TTATTAGAAA AGCTTCAAAG CGTTCCTTCC AAAGGAGAAA CCCTTCTAGA TAATGGAATT ATTTTTGAAA TTACTTCTAT GAGAGGGCCA AGAATTGATC TAGTAAAAAT AATGTTGCCC CAAAAAGTTA CTGATAAGGA TTGA
|
Protein sequence | MRLLLLAILL SLPAFFAAGE LAILRLRPSR VERLIEEKQP GAHSIHRLQK RLRRTLMAAQ LGITIALVAL GWLANELANL WFFSTESKSR LLNLTIFLSI VLLGTLLSGL LPKALVLSNP EKSALRISPL LEGVIRAMTP ILSLLETISS FLLRLIGLNM QWESLVTALS AGELETLIES GKVTGLHPDE KNILEGVFAL RDTQVREVMV PRSGMVTLPR NVLFSELMSK VHLTRHARFL VIGDSLDNVL GVLDLRLLAE PISKGEMNPD TPLEKYIKPV ARVAETCTLE MLLPLIRKGN PFLLVVDEHG GTEGLITAAD LTGEIVGEEI DSGKKEPILR RINNSSKTWI AAGDLEIIEL NRQLNIDLPE NINHYTLAGF LLEKLQSVPS KGETLLDNGI IFEITSMRGP RIDLVKIMLP QKVTDKD
|
| |