Gene P9211_01011 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_01011 
Symbol 
ID5731564 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp105523 
End bp106665 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content37% 
IMG OID641284444 
Productserine protease 
Protein accessionYP_001549986 
Protein GI159902642 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.336111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCTAA AGAAACTCAT TGCTGAATCA GGATTTGGTG TATTACTTGT TGGGGCCTAT 
GTTTTTACTA ATGGCACTCA GCAAATTTCA GCTGCAACAA ATTTCAAATT AGCTCAATTA
ACAGTTAGAT CTCAAAACTC TTTTGTTACT GAAGCCATCA ACGAAAGTGG ACCTGCTGTC
GTTACTGTGG AGACTCAAAG ACAAGTAGTT TCCAGAAATA ATCTTTTCCC TCCCAATTTT
TTTATAGATC CTTCTTTAGA GAGGTTTTTC AATCAGCCAA AATTAAAAAT GCCCAAATCT
AGATTGCAGC TAGGTCATGG GAGTGGAGTG ATTTTTTCTT CTAAAGGTCT GGTATTAACT
AACGCACATG TGATTGAAAA TACTGACAAA TTAGTTGTTG GCTTGTCAGA TGGAAGAAGA
TTCCCAGCCA GGGTGATCGG TCAAGACGCC CTCACAGATC TAGCCGTGAT AGGTATAGAA
GGAAAGGGTC CATGGCCAAT TGCAAAATTA GGCGATTCCG ACAAACTTGT TGTAGGTGAA
TGGGCTATTG CCGTTGGAAG TCCTTTTGGT CTAGAAAAAA CAGTGACACT AGGGATTATT
AGTAACCTTA ATAGAAATGT TTCTCAGCTA GGTATTGCAG ACAAAAGGTT AAAGCTTATA
CAAACTGATG CAGCAATCAA TCCAGGCAAT TCTGGTGGTC CATTACTAAA CTCTAATGGA
GAAGTAATAG GAATTAATAC ATTAGTCAGA TCTGGCCCAG GGGCAGGCCT AGGTTTTGCA
ATACCTATCA ATCAAGCAAT TCAGATTGCA AGTCAATTAG TAGCAAGAGG CAAAGCCATC
CATCCAATGA TTGGAGTAAA CCTTACTTAT TTAATAAATC AACCTGAAGA CAACTATATC
TCTACAAAAG GGGCACAAAT TATAAATATT CTTCCTGGAA GTCCAGCTGA GAAAGAAGGT
CTAAAGGTTA ATGATATTAT TCTTGCAATT AATGGTATAA AAGTTGATGG TCCTCAAGAT
GTAGTTGACA AAATTAATAA AAATGGATTG AGTAAGAGGC TAAGATTGAC GCTTGTCAGA
AACAAAAGGA GGATAACTGT CTCTATACTC CCAGTAGATA TAAGCAATTT CAAAAAAGAT
TAA
 
Protein sequence
MSLKKLIAES GFGVLLVGAY VFTNGTQQIS AATNFKLAQL TVRSQNSFVT EAINESGPAV 
VTVETQRQVV SRNNLFPPNF FIDPSLERFF NQPKLKMPKS RLQLGHGSGV IFSSKGLVLT
NAHVIENTDK LVVGLSDGRR FPARVIGQDA LTDLAVIGIE GKGPWPIAKL GDSDKLVVGE
WAIAVGSPFG LEKTVTLGII SNLNRNVSQL GIADKRLKLI QTDAAINPGN SGGPLLNSNG
EVIGINTLVR SGPGAGLGFA IPINQAIQIA SQLVARGKAI HPMIGVNLTY LINQPEDNYI
STKGAQIINI LPGSPAEKEG LKVNDIILAI NGIKVDGPQD VVDKINKNGL SKRLRLTLVR
NKRRITVSIL PVDISNFKKD