Gene P9301_16811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9301_16811 
Symbol 
ID4912094 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9301 
KingdomBacteria 
Replicon accessionNC_009091 
Strand
Start bp1412773 
End bp1413903 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content36% 
IMG OID640161278 
Producttrypsin-like serine protease 
Protein accessionYP_001091905 
Protein GI126697019 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAATTTA TCAAGATTAA ATTTATTAAT TTAATCCAAA TTTTCATTAT TTTTTGTTTT 
TGTTTAGTCA ATTTCTCTCA AAAAGCTGAA GTTTTAGCTT TAACATCTTC AGAAAGTCAT
AATTTCGTAT CATCCGCAGT TAAAAATATT GGCCCTGCAG TTGTAAAAAT TGACACTGAG
CGCTTGGTAG AGAGGCAACA ATTTGATCCT ACTTTACTTG ACCCATTATT AAGGGATTTA
CTTGGTGAGC AAGGCATTAC TCCTGAAAGG GAGAGAGGAC AAGGCTCTGG GGTTATCATT
AATGAAAATG GTTTGGTTCT TACAAACGCT CATGTCGTAG AAAGAGTCGA TAATGTTTCA
GTTACTTTGG CAGATGGATC TATTTGTGAT GGTGAAGTTT TGGGGACGGA TACAGTAACT
GATCTTGCTT TAGTAAAAAT TGATGAAGAT GCTTATTCTG GTTTTGCTCC ACTTGGAAAT
TCTGAAGATC TTGAAGTTGG GGATTGGGCA ATAGCTCTTG GTACTCCTTA TGGTCTTGAA
AAAACAGTTA CCTTAGGGAT TGTAAGCAGC CTGCATAGAG ATATTAATAG TTTAGGATTT
TCAGATAAAA GGTTGGATCT TATTCAGACT GATGCGGCAA TAAATCCAGG AAATTCTGGG
GGACCACTCA TAAATTCCAA TGGCGAGGTA ATTGGAATCA ATACATTAGT AAGAAGTGGC
CCTGGAGCAG GTCTAGGTTT TGCGATTCCC ATCAATCTGG CTAAAAGTGT TTCTGATCAG
CTACTCAAAA ATGGGGAAGT GATTCATCCA TATTTAGGGG TACAATTAAT TTCTTTAAAT
CCTAGAATTG CTAAAGAACA TAATCGAGAT CCCAATTCTT TAGTTCAATT ACCCGAAAGA
AACGGAGCTC TAATTCAATC AGTAATACCT AATAGCCCCG CTGAAAAAGC TGGTTTAAGA
AGAGGAGATT TAGTAATAGC AGCCGAAAAT ATCTCTATAA ATGAGCCTAA GACTTTATTA
GATGAAGTAG AAAAAGCTCA GATAGGAAAA GTATTTCTTT TAAATATTTT GAGAGATAAT
AAAGAGATAC AGATAAATAT CAAACCAGAA CCTCTCCCAG GTTTGACATA A
 
Protein sequence
MKFIKIKFIN LIQIFIIFCF CLVNFSQKAE VLALTSSESH NFVSSAVKNI GPAVVKIDTE 
RLVERQQFDP TLLDPLLRDL LGEQGITPER ERGQGSGVII NENGLVLTNA HVVERVDNVS
VTLADGSICD GEVLGTDTVT DLALVKIDED AYSGFAPLGN SEDLEVGDWA IALGTPYGLE
KTVTLGIVSS LHRDINSLGF SDKRLDLIQT DAAINPGNSG GPLINSNGEV IGINTLVRSG
PGAGLGFAIP INLAKSVSDQ LLKNGEVIHP YLGVQLISLN PRIAKEHNRD PNSLVQLPER
NGALIQSVIP NSPAEKAGLR RGDLVIAAEN ISINEPKTLL DEVEKAQIGK VFLLNILRDN
KEIQINIKPE PLPGLT