Gene P9303_04311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_04311 
Symbol 
ID4776209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp432194 
End bp433282 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content55% 
IMG OID640085935 
Producttrypsin-like serine protease 
Protein accessionYP_001016448 
Protein GI124022141 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGTGC TGTTCCCTCT TGCAGCCGGG GTTCAGCCTG TTTGGGCTTT GTCTGGTTTA 
GATGGGACGA CTAGCCATAA CTTCGTTGCT GATGCAGTAA GTCAGGTGGC GCCCGCAGTG
GTTCGCATCG ATACGGAACG CACTGTGCAA CGTCAGCCCT TTGATCCCAC GCTGATTGAT
CCCTTGCTCA GAGATCTCTT GGGCGAGCCA GGAATTGGGC CAGAGCGTGA GCGGGGTCAG
GGTTCGGGTG TCGTGATCGA TGACCAGGGG TTGGTGCTGA CCAATGCCCA CGTGGTTGAA
CGGGTGGATG CGGTCAGCGT CACCCTTGCC GATGGAGATC AACACGATGG TTCGGTTGTT
GGGACGGATC CTGTTACTGA TCTGGCTCTT GTGCGACTGG ATGGGGGCAC ACGTCCTGAG
GCCGCCCCTC TTGGAGATTC TGATGCGCTT GAGGTAGGCG ATTGGGCGAT CGCTCTTGGT
ACTCCCTATG GCCTTGAACG CACCGTCACC CTTGGCATTG TTAGCAGCCT GCATCGCAAT
ATCAGCAGCC TTGGCTTCTC TGATAAACGT CTGGATTTGA TTCAGACCGA TGCCGCGATT
AACCCTGGTA ATTCCGGTGG TCCACTGGTG AATGGTCGTG GTGAGGTGAT CGGTATCAAC
ACACTGGTTC GTTCTGGTCC AGGCGCTGGT TTGGGATTTG CTATTCCGAT CAATTTGGCT
CGACATGTTT CTGAGCAGCT TTTGACCAGT GGGGAGGTGG TGCATCCTTA TTTGGGTGTC
CAATTGGTGC CGCTGACAGC TCGTATTGCC AGGGAGCACA ATCGTGATCC GAATTCGCTG
GTGGAATTAC CCGAACGCTT GGGGGCGCTT GTGCAGAGTG TTTTGCCGGA TAGCCCGGCG
GAACGAGCTG GTTTGCGGCG TGGTGATCTT GTGATTGCGG CAGCTGAAAC ATCAGTCTCT
GATCCACAAA TGCTGCTTAA ACAGGTTGAT CAGGCTGAGA TCGGTGTCCC CTTCTCATTA
AGGATCATGC GCAATGGTCA AGAGATGAGC CTTTCGGTTA ATCCAGCCGC ATTACCTGGC
CTTAGTTGA
 
Protein sequence
MSVLFPLAAG VQPVWALSGL DGTTSHNFVA DAVSQVAPAV VRIDTERTVQ RQPFDPTLID 
PLLRDLLGEP GIGPERERGQ GSGVVIDDQG LVLTNAHVVE RVDAVSVTLA DGDQHDGSVV
GTDPVTDLAL VRLDGGTRPE AAPLGDSDAL EVGDWAIALG TPYGLERTVT LGIVSSLHRN
ISSLGFSDKR LDLIQTDAAI NPGNSGGPLV NGRGEVIGIN TLVRSGPGAG LGFAIPINLA
RHVSEQLLTS GEVVHPYLGV QLVPLTARIA REHNRDPNSL VELPERLGAL VQSVLPDSPA
ERAGLRRGDL VIAAAETSVS DPQMLLKQVD QAEIGVPFSL RIMRNGQEMS LSVNPAALPG
LS