Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_16941 |
Symbol | |
ID | 4718424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 1436996 |
End bp | 1438126 |
Gene Length | 1131 bp |
Protein Length | 376 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 640079420 |
Product | trypsin-like serine protease |
Protein accession | YP_001010084 |
Protein GI | 123969226 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGTATC TCAAGATTAA ATTTATTAAT TTAATCCAAA TATTCATTGT TATTTGTTTT TGCATACTCA ATTTCTTTCA AGATGCTGAA GTTTTAGCTT TAACTTCTTT TGAAAGTCAT AATTTCGTAT CATCGGCAGT TAAAAATATT GGCCCTGCAG TTGTAAAAAT TGACACTGAG CGCTTGGTAG AGAGGCAACA ATTTGATCCT ACTTTACTTG ACCCTTTATT AAGGGATTTA CTTGGCGAGC AAGGCATTAC TCCTGAAAGG GAGAGAGGAC AAGGCTCCGG GGTTATCATT AATGAAAATG GTTTGGTTCT TACAAACGCT CATGTCGTAG AAAGAGTCGA TAATGTTTCA GTTACTTTGG CCGATGGATC TATTTGTGAT GGTAAAGTTT TGGGCACGGA TACAGTAACT GATCTTGCTT TAGTAAAAAT TGATGAAGAT ACTTATTCTG GTTTTGCTCC ACTTGGAAAT TCTGAAGATC TTGAAGTTGG GGATTGGGCA ATAGCTCTTG GTACTCCCTA TGGTCTTGAA AAAACAGTTA CTTTAGGGAT TGTAAGCAGC CTGCATAGAG ATATCAGTAG TTTAGGATTT TCAGATAAAA GGTTGGATCT TATTCAGACT GATGCGGCAA TAAATCCAGG AAATTCTGGG GGACCACTAA TAAATGCTAA TGGCGAGGTA ATTGGAATAA ATACATTAGT AAGAAGTGGC CCTGGTGCTG GTTTAGGTTT TGCGATTCCC ATCAATCTAG CTAAAAGTGT TTCTGATCAG CTACTCAAAA ATGGAGAAGT TATTCATCCA TATTTAGGGG TACAATTAAT TTCTTTAAAT CCTAGAATTG CTAAAGAACA TAATCTAGAT CCCAATTCTT TAGTGCAATT ACCCGAAAGA AATGGAGCTC TTATTCAATC AGTAATACCT AATAGCCCCG CTGAAAAAGC TGGTTTAAGA AGAGGCGATT TAGTCATAGC AGCCCAAAAC ATCTCTATAA ATGAGCCTAA AACTTTACTA GATGAAGTAG AAAAAGCTCA GATAGGAAAA GTATTTCTTT TAAATATTGT GAGAGATAAT AAAGAGATAC AGATAAATAT CAGACCAGAA CCTCTACCAG GTTTGACATA A
|
Protein sequence | MKYLKIKFIN LIQIFIVICF CILNFFQDAE VLALTSFESH NFVSSAVKNI GPAVVKIDTE RLVERQQFDP TLLDPLLRDL LGEQGITPER ERGQGSGVII NENGLVLTNA HVVERVDNVS VTLADGSICD GKVLGTDTVT DLALVKIDED TYSGFAPLGN SEDLEVGDWA IALGTPYGLE KTVTLGIVSS LHRDISSLGF SDKRLDLIQT DAAINPGNSG GPLINANGEV IGINTLVRSG PGAGLGFAIP INLAKSVSDQ LLKNGEVIHP YLGVQLISLN PRIAKEHNLD PNSLVQLPER NGALIQSVIP NSPAEKAGLR RGDLVIAAQN ISINEPKTLL DEVEKAQIGK VFLLNIVRDN KEIQINIRPE PLPGLT
|
| |