Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | A9601_08061 |
Symbol | |
ID | 4717512 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. AS9601 |
Kingdom | Bacteria |
Replicon accession | NC_008816 |
Strand | - |
Start bp | 693631 |
End bp | 694848 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 26% |
IMG OID | 640078520 |
Product | insulinase family protein |
Protein accession | YP_001009199 |
Protein GI | 123968341 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.209165 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAAGA GATATTTTTT AAATAATAAA AAAAGAAATT TTTCAATTGC TTCAATTTGG ATTAAAGGGG GGAGTGATAT GGATAGTACT GGCAAAAAAG GTATTAACAA GATCCTTTGT TCATTACTTA CCAGAGGATG TGAAGGTTTT AACAATTTAA CTCTCTCTGA ATATATTGAG TCCTATGGAG CAGAATTAAA TCAAGAAATA TTTGAAGATG GAATTTCAAT AAGTATTAAA TCCCTAAATG AACATTTCAG CAAATTATTC CCTTTATTAG AGTTAATAAT TAATAAGCCA ATCCTTTCGG AAACTGAATT TAAAAAAGTA AAAAAATCTT CTATTGATCA CATTAAAAAA GATAAAGAGA ATCCATTCAA TATCTGTTTT GAAAAATGGA GAAAAATTGT TTATTCAAAT CATCCTTATG CCTTTAACAC AATAGGCAAT GCTAGTGATG TCTCAAAGAT TACCTATGAA GATATTTTAC TTGAGTTTAA AAATTTAAAA AAAAGAGAAA AGTATTTAAT TTCAAATAAT CCTGAAATAA ATGGAGAAAA TTATGGAACA CTTGAAAAAA AAATCTTAAA AGAAAAATCA GATCCTTTAA ATCACAATTT AAAAACTACA AATAGATTTG ATTACATTAG TAATGATACA AATCAAACAA TAATAATGAT GGGTGACCAA ACTTGCTCGC GAAGAAGTAG TGAATATTTT CCTCTTAAGG TTTTGGAGTC ATATTTATCT TATGGAATGA GCGCTGCTTT ATTTAAACTT TTTAGAGAAA AACATGGTAT CACTTACGAT TTAGGTGTTT ATTATCCTAT CAGGAGTGGA AATGCCCCAT TTTTAATTTA TTTATCCGTA TCTAATGATC AAGCACTTTT TGCTTTTGAA CTTTTATCAA CACTATGGAA AAATTTACTT TTAAATCCGT TGACTGATGC TGAAATATTT TTAGCAAAAG AAAAACTAAA AGGTTCTTTT TTATTAGGAA ATCAATCACT AGATGAAATT TTACACAGAA AGATACAGTT AGTTAGTTAT GGTATTTCAC CAATTTCAGA GAACGAATTA AATTCAAAAA TAGAGGAAAT TTCTTCGTTA GATATTTTGA CATTAACTAA TAAGTATTTT TCAAAACCTT TTCTGTGTAT TTCTGGAAAT AAAAATATAT GTTTAGAAAT TTCTAATAGG TGGAAGAAAA ACTTTTAG
|
Protein sequence | MLKRYFLNNK KRNFSIASIW IKGGSDMDST GKKGINKILC SLLTRGCEGF NNLTLSEYIE SYGAELNQEI FEDGISISIK SLNEHFSKLF PLLELIINKP ILSETEFKKV KKSSIDHIKK DKENPFNICF EKWRKIVYSN HPYAFNTIGN ASDVSKITYE DILLEFKNLK KREKYLISNN PEINGENYGT LEKKILKEKS DPLNHNLKTT NRFDYISNDT NQTIIMMGDQ TCSRRSSEYF PLKVLESYLS YGMSAALFKL FREKHGITYD LGVYYPIRSG NAPFLIYLSV SNDQALFAFE LLSTLWKNLL LNPLTDAEIF LAKEKLKGSF LLGNQSLDEI LHRKIQLVSY GISPISENEL NSKIEEISSL DILTLTNKYF SKPFLCISGN KNICLEISNR WKKNF
|
| |