Gene P9211_12761 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_12761 
Symbol 
ID5731826 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1150157 
End bp1151254 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content37% 
IMG OID641285645 
Productmembrane-associated Zn-dependent protease 1 
Protein accessionYP_001551161 
Protein GI159903817 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0750] Predicted membrane-associated Zn-dependent proteases 1 
TIGRFAM ID[TIGR00054] RIP metalloprotease RseP 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0131503 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTTTTT TTAATGTGAT TGCTTCCATT GCCGTCTTAG CGCTTCTAAT TTTTTTCCAT 
GAAGCAGGGC ATTTTCTTGC AGCCACATTG CAAGGCATTC GAGTCAGTGG ATTTTCAATT
GGCTTTGGTC CTGCACTTTT AGAAAAAGAA TTTAAAGGTG TCACTTACTC AATAAGAGCA
TTCCCTTTAG GAGGGTTTGT ATCCTTTCCT GATGATGACA ATGAAAAAGA AAAGATTTCT
CTAGACGATC CTGATCTATT AAGTAATAGG CCAATTTATC AAAGATTATT GGTTATTTCA
GCTGGTGTTA TTGCCAATCT ATTAGTTGCA TGGATTGCAT TATTTAGCCA AGCAACTTTT
ATAGGCCTGC CAAATCAGCC TGATCCTGGT GTTCTTATTA TTGGAGTACA AGATCAAGAA
GCTGCTTACC AAGCAGGCCT AGAAATAGGC GATAAAGTCT TAAGCATTGA CGGCATCAAA
CTAGGTTCAG GCCAAGAAGC TGTTCAATCT TTAGTTGACA AAATTAAAGC TTCTCCAGGT
AAATCAATAG AACTTGATAA AGCAAATAGT AAAGGTAACT TTACTATCAC CATCACTCCT
TCAGATTATT TTGGCAACGG CAGAGTAGGT GCACAATTAC AACAAAATAC TGTTGTTTCT
TCAAGGCCTG CAAAGGGAAT ATTAGAAATT ATTGTTCATT CTAATTCTCA ATTTACAGAT
TTATTAATTC GAACTGTTAA AGGGTATCAA GGACTGTTTA CAGATTTCGC CTCAACATCA
AAGCAAATTA GCGGACCAGT TAAAATTGTT GAATTAGGGG CCCAAATGTC AGGGCAAGGG
GTATCTGGAT TAATATTTTT TGCTTCTCTA GTTTCAATAA ATCTTGCCGT TTTAAATTCC
TTACCCTTGC CGGTCCTTGA TGGAGGTCAA TTTGCATTAA TTCTTATAGA GGCGGTAAGA
GGAAAGCCAG TTCCAGAAAA AATTCAACTC GCTTTTATGC AATCAGGCTT TCTTCTTCTA
ATAGGACTAA GCATTGTGCT TATTATTCGC GATACAAGTC AATTATCAAT ACTTCAACAA
TTAGCAAGTA ATCACTAG
 
Protein sequence
MTFFNVIASI AVLALLIFFH EAGHFLAATL QGIRVSGFSI GFGPALLEKE FKGVTYSIRA 
FPLGGFVSFP DDDNEKEKIS LDDPDLLSNR PIYQRLLVIS AGVIANLLVA WIALFSQATF
IGLPNQPDPG VLIIGVQDQE AAYQAGLEIG DKVLSIDGIK LGSGQEAVQS LVDKIKASPG
KSIELDKANS KGNFTITITP SDYFGNGRVG AQLQQNTVVS SRPAKGILEI IVHSNSQFTD
LLIRTVKGYQ GLFTDFASTS KQISGPVKIV ELGAQMSGQG VSGLIFFASL VSINLAVLNS
LPLPVLDGGQ FALILIEAVR GKPVPEKIQL AFMQSGFLLL IGLSIVLIIR DTSQLSILQQ
LASNH