Gene P9515_16701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9515_16701 
Symbol 
ID4720473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9515 
KingdomBacteria 
Replicon accessionNC_008817 
Strand
Start bp1464257 
End bp1465387 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content35% 
IMG OID640081362 
Producttrypsin-like serine protease 
Protein accessionYP_001011984 
Protein GI123966903 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGATTTT TAAAAAATAA ATTTATTTAT TTATTTAAAT TGGGCATCGT GCTATTTGCT 
TTTTTAATTA ATTTTTTGCC TTTGTCTGAA GTTTTTGCTT TAAATTCTCT CGATGGGCAT
AATTTCGTAT CGGACGCAGT TAAAAATGTA GGTCCTGCAG TGGTTAGAAT TGATACTGAA
AGATTAGTAG AAAGACAACA GTTTGATCCA ACTTTATTAG ATCCATTATT AAGAGATTTA
CTAGGGGAAC CCGGAATGGC TCCTGACAGA GAAAGAGGTC AAGGTTCAGG TGTGATAATT
AACAAAAATG GTTTGGTTTT AACAAATGCT CATGTTGTAG AAAGAGTTGA TAATGTGTCA
GTGACGTTGG CGGATGGAAC TAATTGTGAT GGGAAAGTAT TGGGAACCGA TTCGATTACT
GATTTAGCGT TAGTTAAAAT CGAACAACTT ATTGATTCAA GTTATGCTCC TTTAGGAGAT
TCAGAGAAAC TTGAAGTTGG GGATTGGGCA ATAGCTCTTG GTACGCCGTA TGGCCTTGAG
AAAACAGTTA CTCTTGGCAT AGTTAGCAGT CTGCATAGAG ATATCAATTC ACTAGGTTTT
TCTGATAAAA GGCTTGATCT AATTCAAACA GATGCCGCAA TTAACCCAGG TAATTCTGGA
GGTCCGCTCA TAAATTCTAA TGGCCAGGTT ATTGGCATAA ATACACTCGT TAGAAGTGGA
CCTGGAGCTG GCCTAGGTTT TGCAATACCT ATAAATTTAG CTAAAAATGT TTCTGACCAA
TTATTAGAGA ATGGTGAAGT TATTCATCCT TATTTAGGAG TACAATTAAT ATCCTTAAAT
CCTAAAATGG CTAAACAACA CAACGAAGAT CCTAATGCAA TTGTTCAATT ACCCGAGAGG
TCCGGAGCTT TAATTCAGTC TATAGTTCCA AATAGTCCTG CAGAAAAAGC AGGTTTGAAA
AGAGGTGACT TAGTAATTGC AGCTGAAAAT ATATCAATAG AAGAACCAAA AACTCTTTTA
GATGAAGTAG AAAAAGCTCA AATTGGAAAA GTATTTCTTT TAAATGTTGT GAGGGATAAT
AAAGAAATCA AAGTTAATAT TAAACCTGAA GCACTTCCAG GTTTGACATA A
 
Protein sequence
MRFLKNKFIY LFKLGIVLFA FLINFLPLSE VFALNSLDGH NFVSDAVKNV GPAVVRIDTE 
RLVERQQFDP TLLDPLLRDL LGEPGMAPDR ERGQGSGVII NKNGLVLTNA HVVERVDNVS
VTLADGTNCD GKVLGTDSIT DLALVKIEQL IDSSYAPLGD SEKLEVGDWA IALGTPYGLE
KTVTLGIVSS LHRDINSLGF SDKRLDLIQT DAAINPGNSG GPLINSNGQV IGINTLVRSG
PGAGLGFAIP INLAKNVSDQ LLENGEVIHP YLGVQLISLN PKMAKQHNED PNAIVQLPER
SGALIQSIVP NSPAEKAGLK RGDLVIAAEN ISIEEPKTLL DEVEKAQIGK VFLLNVVRDN
KEIKVNIKPE ALPGLT