Gene P9211_16121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9211_16121 
Symbol 
ID5731199 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9211 
KingdomBacteria 
Replicon accessionNC_009976 
Strand
Start bp1443388 
End bp1444500 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content41% 
IMG OID641285990 
Producttrypsin-like serine protease 
Protein accessionYP_001551497 
Protein GI159904153 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGATTTA CTAGATATTG CCTCGCATTG GTTTGCATAA TTTTGCTTCT GGGACAGGAA 
CCTGCTCTTG CAGAGATGGC TTTATCTAAT CAGGTCAATC ATGGTTTCGT AGCTGATGCC
GTGAAGAATG TTAGTCATTC AGTGGTTCGT ATAGACACAG AAAGATTGGT TGAACGTGAG
CAATTTGATC CAACATTGCT AGATCCCTTG CTGAAAGATT TACTAGGTGA ACCCTCTTAT
GGCCCTGAGC ATGAAAGGGG CCAAGGATCA GGAGTATTGA TTGATACCAA TGGGCTGGTT
CTAACAAATG CACATGTTGT TGAGAAAGTG GATGACGTGC TTGTAACAAT TTCAAATGGA
TATGAGATAG AGGGAAAGGT TATTGGTACT GATGAGATTA CAGATTTGGC TTTAGTTCGT
TTAGACGGAG ATTTGGATCT CAACCCTGCA CCTCTAGGAA ATTCTGAGGC TTTAGAGGTT
GGGGATTGGG CAATCGCCTT AGGCACCCCT TATGGCTTAG AAAGCACAGT GACTCTCGGC
ATAATCAGCA GTTTGCATAG AAATATAAAT AGTCTGGGGT TTTCAGATAA ACGTTTGGAT
TTAATTCAAA CTGATGCAGC AATAAATCCT GGCAATTCGG GAGGGCCTTT AGTTAATGCC
TCTGGAGAAG TTATTGGCAT CAACACTTTG GTGCGCTCTG GACCTGGCGC TGGCCTTGGC
TTTGCCATAC CAATTAATCT TGCAAAGAGA ATTTCCGCGC AATTGCTAGA CTCAGGTGAA
GTTATTCATC CCTATTTAGG TGTTCAGCTG GTTCCGTTAA CCGCTCGTAT TGCTAAAGAA
CATAATCGCG ACCCTAATTC AATAATTCAA TTACCAGAAA GATCAGGTGC ATTAGTTCAA
TCAGTACTTT CTGAAAGCCC GGCAGCAAAA GCTGGAATGA AAAGAGGAGA TTTGGTTATT
TCTGCAGAAG AAAAAGAAAT TTTTGATCCA GAGGCTTTAC TTCAAAAGGT TGAACAATCA
GAAATAGGAG TCCCTTTTGG TTTAAGTGTT TTAAGGAATG AGCATGAGAT AAGACTTTCA
ATAAAGCCTG AACCGTTACC TGGCTTTAAT TAG
 
Protein sequence
MRFTRYCLAL VCIILLLGQE PALAEMALSN QVNHGFVADA VKNVSHSVVR IDTERLVERE 
QFDPTLLDPL LKDLLGEPSY GPEHERGQGS GVLIDTNGLV LTNAHVVEKV DDVLVTISNG
YEIEGKVIGT DEITDLALVR LDGDLDLNPA PLGNSEALEV GDWAIALGTP YGLESTVTLG
IISSLHRNIN SLGFSDKRLD LIQTDAAINP GNSGGPLVNA SGEVIGINTL VRSGPGAGLG
FAIPINLAKR ISAQLLDSGE VIHPYLGVQL VPLTARIAKE HNRDPNSIIQ LPERSGALVQ
SVLSESPAAK AGMKRGDLVI SAEEKEIFDP EALLQKVEQS EIGVPFGLSV LRNEHEIRLS
IKPEPLPGFN