Gene P9303_24941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_24941 
Symbol 
ID4777176 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2190860 
End bp2192143 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content54% 
IMG OID640088015 
Producthemolysin-like protein 
Protein accessionYP_001018490 
Protein GI124024183 
COG category[R] General function prediction only 
COG ID[COG1253] Hemolysins and related proteins containing CBS domains 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGTTGC TTTTCCTGGC TGTATTGCTG GTTCTACCGG CTTTTTTCGC GGCTGGAGAG 
GTTGCCTTGT TGCGGCTGCG GCCTAGTCGA GTACAGGTTT TAGTGGAAGA GCAGCAGCCC
GGAGCTTCCG CCGTTCATCG TCTTCAGCGC CGTTTGAGGA GGGCGTTGAT GGTGTCTCAA
CTGGGTGGAA TGCTGGCGTT GGTAGCCCTG GGCTGGGTTG GCCGTGGTGT TGGACATCGC
TGGTGGCCTC TAGCTGATCC TGCTAGTCGC TGGTTGGACG GGGGGCTTTT TCTGCTGCTT
GTGGTGTTGG CCACCTTGTT GGCTGGTTTT CTTCCTAAGG CCTGGGTGCT GAACCGTCCA
GAGGCTTCAG CTCTAAACCT CGCTCCATTG TTGGAGATGG TGATGCGTGT GCTTGCTCCC
CTTTTGGCTC TTCTGGAAGC TGTCGCTTCG ATGATGTTAC GGCTGGTTGG TTTGAATGCA
CATTGGGATT CTCTTGTTCC TGCTCTCTCT GCTGGTGAGC TGGAGTCTCT GATCGAAATT
GGCGGTGTAA CAGGCCTTCG TCCTGATGAG CGCAACATCC TTGAAGGTGT TTTTGCCTTG
CGCGACACTC AAGTTAGAGA GGTGATGGTG CCACGTTCTG GCATGGTCAC CTTGCCTGTT
GGGGTCTGCT TCGCTGAACT GATGAGAGTG GTGCATAGCA CCCGCCATGC GCGCTTTCCA
GTGATCGGTC AGTCCCTAGA TGATGTCAGG GGTGTGCTTG ATTTACGTCG GTTGGCGGAA
CCCATCTCCC GGGGTGCTTT GCAGGCAGAA TCTCCGCTTG AACCTTTTTT AGAACCAGCT
GTAAGGGTTC TTGAGACCAG CACTTTGGCT GAATTGTTGC CGATGATCCG AAGTGGACAG
CCCCTACTGC TTGTCGTTGA TGAGCATGGC GGTACAGAAG GATTGGTTAC AGCTGCCGAT
CTCACTGGTG AGATCGTGGG CGATGAGCCC CATGCAGACG ACGATGAGCC GGATCTTGAG
CTGATTGAGG GTCAGTCAGA CACATGGATG GTTGCAGGAG ATCTTGAGAT CATTGAGCTC
AATCGACAGC TCAATCTGGA CTTGCCTGAA GCTGATGGAC ATCACACCTT GGCTGGCTTT
CTGCTTGAAA AGTTGCAACA CATCCCTTCT GCTGGAGAGG CCTTGCGCTG CGATGGTTTG
CAGTTCGAGA TCGTAACGAT GAAGGGTCCT CGTATCGAGC GTGTGCGACT GATTCTTCCC
AGTCACGATC ACACTGAGGA ATGA
 
Protein sequence
MRLLFLAVLL VLPAFFAAGE VALLRLRPSR VQVLVEEQQP GASAVHRLQR RLRRALMVSQ 
LGGMLALVAL GWVGRGVGHR WWPLADPASR WLDGGLFLLL VVLATLLAGF LPKAWVLNRP
EASALNLAPL LEMVMRVLAP LLALLEAVAS MMLRLVGLNA HWDSLVPALS AGELESLIEI
GGVTGLRPDE RNILEGVFAL RDTQVREVMV PRSGMVTLPV GVCFAELMRV VHSTRHARFP
VIGQSLDDVR GVLDLRRLAE PISRGALQAE SPLEPFLEPA VRVLETSTLA ELLPMIRSGQ
PLLLVVDEHG GTEGLVTAAD LTGEIVGDEP HADDDEPDLE LIEGQSDTWM VAGDLEIIEL
NRQLNLDLPE ADGHHTLAGF LLEKLQHIPS AGEALRCDGL QFEIVTMKGP RIERVRLILP
SHDHTEE