Gene P9211_03011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_03011 
Symbol 
ID5731485 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp284625 
End bp285908 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content39% 
IMG OID641284647 
Producthemolysin-like protein 
Protein accessionYP_001550186 
Protein GI159902842 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0287533 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCTTC TTCTTCTCGC AATACTACTT TCACTGCCAG CATTTTTTGC TGCAGGAGAG 
CTTGCAATAT TACGACTTAG GCCAAGTCGA GTAGAAAGAC TGATTGAGGA AAAGCAACCA
GGCGCCCATT CAATTCACAG GCTGCAAAAA AGGTTACGTA GAACATTAAT GGCCGCTCAA
CTAGGTATAA CAATTGCGCT AGTTGCCTTG GGATGGTTAG CAAATGAACT AGCAAACTTA
TGGTTCTTTT CTACAGAATC AAAATCTCGC TTATTAAATC TCACCATCTT TCTGAGCATT
GTCTTACTTG GAACATTGCT TTCTGGCCTC CTACCAAAAG CATTGGTTCT TAGCAATCCT
GAAAAATCTG CTCTGCGAAT CTCGCCTCTT CTAGAAGGAG TAATAAGAGC AATGACTCCT
ATTCTTTCTT TATTAGAAAC AATCTCTTCT TTTTTACTGA GGTTAATTGG GTTGAACATG
CAGTGGGAAT CATTAGTTAC TGCTTTATCA GCAGGAGAAT TAGAAACCCT TATTGAATCA
GGCAAAGTTA CTGGTCTGCA CCCTGATGAA AAAAATATTC TGGAAGGTGT CTTCGCGCTT
AGAGATACAC AAGTTCGCGA AGTTATGGTC CCTCGTTCAG GGATGGTGAC CCTTCCCAGA
AATGTTCTTT TCTCTGAATT AATGAGCAAA GTTCATTTAA CTCGACATGC TCGTTTTTTG
GTTATTGGTG ATTCACTTGA TAACGTTCTT GGAGTTCTAG ACCTGCGCCT CTTAGCCGAG
CCAATTTCGA AAGGAGAAAT GAATCCTGAT ACACCTTTGG AAAAATATAT CAAGCCTGTT
GCACGAGTTG CAGAAACTTG CACATTAGAG ATGTTGCTTC CCTTAATAAG AAAAGGAAAC
CCTTTTTTGT TAGTGGTCGA TGAGCATGGA GGGACTGAAG GGCTTATAAC AGCAGCAGAC
TTAACAGGAG AAATCGTTGG TGAAGAAATT GATTCAGGTA AAAAAGAGCC TATTCTACGA
AGAATAAATA ACAGTTCCAA AACGTGGATT GCTGCAGGCG ATCTAGAAAT TATTGAACTT
AATCGACAAT TGAATATCGA TCTTCCTGAA AACATAAATC ACTATACGCT TGCTGGCTTC
TTATTAGAAA AGCTTCAAAG CGTTCCTTCC AAAGGAGAAA CCCTTCTAGA TAATGGAATT
ATTTTTGAAA TTACTTCTAT GAGAGGGCCA AGAATTGATC TAGTAAAAAT AATGTTGCCC
CAAAAAGTTA CTGATAAGGA TTGA
 
Protein sequence
MRLLLLAILL SLPAFFAAGE LAILRLRPSR VERLIEEKQP GAHSIHRLQK RLRRTLMAAQ 
LGITIALVAL GWLANELANL WFFSTESKSR LLNLTIFLSI VLLGTLLSGL LPKALVLSNP
EKSALRISPL LEGVIRAMTP ILSLLETISS FLLRLIGLNM QWESLVTALS AGELETLIES
GKVTGLHPDE KNILEGVFAL RDTQVREVMV PRSGMVTLPR NVLFSELMSK VHLTRHARFL
VIGDSLDNVL GVLDLRLLAE PISKGEMNPD TPLEKYIKPV ARVAETCTLE MLLPLIRKGN
PFLLVVDEHG GTEGLITAAD LTGEIVGEEI DSGKKEPILR RINNSSKTWI AAGDLEIIEL
NRQLNIDLPE NINHYTLAGF LLEKLQSVPS KGETLLDNGI IFEITSMRGP RIDLVKIMLP
QKVTDKD