Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_01061 |
Symbol | |
ID | 4716789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 106295 |
End bp | 107368 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 640077804 |
Product | serine protease |
Protein accession | YP_001008501 |
Protein GI | 123967643 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.057763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATTA GACATCCTGC AAAAGTAATA TCCCAAACAT TAAGCAAGCC AAAAATTAAT GAAGTTAATT TATATTCGAA CAAATCTTTT ATAACTAAAG CTGTAGAAAG AACCGGTGCA GCTGTGGTGA CAATTGATAC TCAAAGATAT GTTAAAAAAA GAAATTTTCC AAGAAATTCT CAACTATTTT TAGACCCATA TTTTGAAAGA TTTTTTGGAT TAGATTTAAA TAACGAAAAT CGACCAAGGA TAGAGCAAAA CCAAGGCAGT GGATTTATAT TTGCAGATGG ACTTGTAATG ACCAATGCTC ATGTAGTGAA TGGATCAGAT AAGGTAATTG TTGGTTTAAC CAATGGCAAA AAATTAAACG CTAAACTGAT AGGTCAAGAC TCTTTTACTG ATTTAGCTGT GCTAAAGATT GAAGGGAAAG GGCCTTGGCC AAAAGCAAAA TTGGGCGATT CTGCAAAGAT TAAAGTTGGT GATTGGGCTA TAGCAGTTGG AAATCCATTT GGATTGGAAA ACACAGTTAC GCTTGGTATT ATTAGTAATC TAAATAGAAA CGTAAATCAA TTAGGAATAT ATGATAAAAA ACTTGAACTG ATACAAACTG ACGCTGCTAT TAATCCTGGC AATTCTGGAG GTCCACTGTT GAATAGCGAT GGAGAAGTAA TTGGTATTAA TACCTTGATA AGATCAGGTC CAGGAGCCGG TTTGAGTTTC GCAATCCCAA TTAATAAAGC TAAAGAAATT GCCTATCAAC TTTTAAACAA TGGGAAAGTA ATACATCCTA TGATTGGAAT TAGCCTAATA GAAGAAAGTG TTTCTGAGAG AAAAAATAAT GTCGTAAAAG TTGGATATGT AGTACCGAAC AGTCCAGCTG AAAAAAGTGG AATCAAGATA GATGATATTT TAATTAAAAT AGACAATAAA GATATTGAAA CCGCATCAGA CGTAATAGAA CAAATTAGTA AAAATGGTAT CAAAAAACAA GTAAATATAT TATTAAAGCG TAAAAATAAA TTTATTAAAT TAAAAGTAAT ACCAACTGAT ATTACTAATC TACAAAATAA ATAA
|
Protein sequence | MGIRHPAKVI SQTLSKPKIN EVNLYSNKSF ITKAVERTGA AVVTIDTQRY VKKRNFPRNS QLFLDPYFER FFGLDLNNEN RPRIEQNQGS GFIFADGLVM TNAHVVNGSD KVIVGLTNGK KLNAKLIGQD SFTDLAVLKI EGKGPWPKAK LGDSAKIKVG DWAIAVGNPF GLENTVTLGI ISNLNRNVNQ LGIYDKKLEL IQTDAAINPG NSGGPLLNSD GEVIGINTLI RSGPGAGLSF AIPINKAKEI AYQLLNNGKV IHPMIGISLI EESVSERKNN VVKVGYVVPN SPAEKSGIKI DDILIKIDNK DIETASDVIE QISKNGIKKQ VNILLKRKNK FIKLKVIPTD ITNLQNK
|
| |