Gene A9601_16941 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagA9601_16941 
Symbol 
ID4718424 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. AS9601 
KingdomBacteria 
Replicon accessionNC_008816 
Strand
Start bp1436996 
End bp1438126 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content36% 
IMG OID640079420 
Producttrypsin-like serine protease 
Protein accessionYP_001010084 
Protein GI123969226 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGTATC TCAAGATTAA ATTTATTAAT TTAATCCAAA TATTCATTGT TATTTGTTTT 
TGCATACTCA ATTTCTTTCA AGATGCTGAA GTTTTAGCTT TAACTTCTTT TGAAAGTCAT
AATTTCGTAT CATCGGCAGT TAAAAATATT GGCCCTGCAG TTGTAAAAAT TGACACTGAG
CGCTTGGTAG AGAGGCAACA ATTTGATCCT ACTTTACTTG ACCCTTTATT AAGGGATTTA
CTTGGCGAGC AAGGCATTAC TCCTGAAAGG GAGAGAGGAC AAGGCTCCGG GGTTATCATT
AATGAAAATG GTTTGGTTCT TACAAACGCT CATGTCGTAG AAAGAGTCGA TAATGTTTCA
GTTACTTTGG CCGATGGATC TATTTGTGAT GGTAAAGTTT TGGGCACGGA TACAGTAACT
GATCTTGCTT TAGTAAAAAT TGATGAAGAT ACTTATTCTG GTTTTGCTCC ACTTGGAAAT
TCTGAAGATC TTGAAGTTGG GGATTGGGCA ATAGCTCTTG GTACTCCCTA TGGTCTTGAA
AAAACAGTTA CTTTAGGGAT TGTAAGCAGC CTGCATAGAG ATATCAGTAG TTTAGGATTT
TCAGATAAAA GGTTGGATCT TATTCAGACT GATGCGGCAA TAAATCCAGG AAATTCTGGG
GGACCACTAA TAAATGCTAA TGGCGAGGTA ATTGGAATAA ATACATTAGT AAGAAGTGGC
CCTGGTGCTG GTTTAGGTTT TGCGATTCCC ATCAATCTAG CTAAAAGTGT TTCTGATCAG
CTACTCAAAA ATGGAGAAGT TATTCATCCA TATTTAGGGG TACAATTAAT TTCTTTAAAT
CCTAGAATTG CTAAAGAACA TAATCTAGAT CCCAATTCTT TAGTGCAATT ACCCGAAAGA
AATGGAGCTC TTATTCAATC AGTAATACCT AATAGCCCCG CTGAAAAAGC TGGTTTAAGA
AGAGGCGATT TAGTCATAGC AGCCCAAAAC ATCTCTATAA ATGAGCCTAA AACTTTACTA
GATGAAGTAG AAAAAGCTCA GATAGGAAAA GTATTTCTTT TAAATATTGT GAGAGATAAT
AAAGAGATAC AGATAAATAT CAGACCAGAA CCTCTACCAG GTTTGACATA A
 
Protein sequence
MKYLKIKFIN LIQIFIVICF CILNFFQDAE VLALTSFESH NFVSSAVKNI GPAVVKIDTE 
RLVERQQFDP TLLDPLLRDL LGEQGITPER ERGQGSGVII NENGLVLTNA HVVERVDNVS
VTLADGSICD GKVLGTDTVT DLALVKIDED TYSGFAPLGN SEDLEVGDWA IALGTPYGLE
KTVTLGIVSS LHRDISSLGF SDKRLDLIQT DAAINPGNSG GPLINANGEV IGINTLVRSG
PGAGLGFAIP INLAKSVSDQ LLKNGEVIHP YLGVQLISLN PRIAKEHNLD PNSLVQLPER
NGALIQSVIP NSPAEKAGLR RGDLVIAAQN ISINEPKTLL DEVEKAQIGK VFLLNIVRDN
KEIQINIRPE PLPGLT