Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9301_08051 |
Symbol | |
ID | 4911660 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9301 |
Kingdom | Bacteria |
Replicon accession | NC_009091 |
Strand | - |
Start bp | 693308 |
End bp | 694525 |
Gene Length | 1218 bp |
Protein Length | 405 aa |
Translation table | 11 |
GC content | 28% |
IMG OID | 640160387 |
Product | insulinase family protein |
Protein accession | YP_001091029 |
Protein GI | 126696143 |
COG category | [R] General function prediction only |
COG ID | [COG0612] Predicted Zn-dependent peptidases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.126573 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAGAAA GATATTTTTT AAATATTAAA AAAAGGAATT TTTCAACTGC TTCAATTTGG ATTAAAGGAG GGAGTGATGC GGATAGTGTT GGCAAAAAAG GTATAAACAA GATCCTTAGT TCATTACTTA CCAGAGGATG TGAGGGTTTT AATAATTTCA CTCTTTCAGA GTATATTGAG TCCTACGGAG CAGAATTAAA TCAAGAAGTA TTTGAAGATG GTATTTCAAT AAGCATTAAA TCCCTAAATG AACATTTCAG CAAATTATTC CCCTTATTAG ATTTAATAAT TAATAAACCA ACACTCTTAG AAAGAGAATT TGAAAAAGTA AAAAAATCCT CTATTGATTT TTTAAAAAAG GATAAAGAGA ATCCATTTAA TATCTGTTTT GAAAAATGGA GAAGAATTGT TTACTCAAAT CATCCTTATG CCTTTAATAC AAATGGCAAT GAAAATGATG TCTCAAAGAT TACATATGAA GATGTTTTGC TGGAATTTAA AAATTTCAAA AGTCGAGATA AGTATTTGAT TTCAAATAAT TCAGAAATAG ATGGAGTAAG TATAGAAAAA TTAGACAAAA AACCCTTAGT AGAAAAATTT AGACCTCTAA ATCATGATTT AAGTCCAAAC AATCGATTTG ATTTCAATAA TAATAATTCA AATCAAACAA TAATAATGTT TGGCAACCAA ACTTGCTCTC GTAAAAGTAG TGAATATTTG CCTCTTAAGG TTTTGGAGTC GTATCTATCT TATGGAATGA GCGCTGCTTT ATTTAAACTT TTTAGGGAAA AAAATGGGAT CACTTACGAT TTAGGTGTTT ATTATCCAGT TAGGAGAAGG AATGCTCCAT TTTTAGTATA TTTATCAGTA TCAAATAAAA AAGCCCTTTT TGCTTTTGAA CTTTTATCAA CTTTATGGAA AGATTTACTT TTAAATCCTT TGATTGATAA TGAAATACTA TTGGCTAAAG AAAAACTAAA AGGTTCTTTT CTATTGGGAA ATCAATCACT AGATGAAATT TTACAGCGAA AGATACAGTT AATTAGTTAT GGTGTTACCC CAATTTCTGA GAGTGATTTA AATTCTAAAA TAGACGAAAT ATCTTCATTA GATATTCTTA AATTAACAAA CAAGTATTTT TCAAAACCTT TTCTGAGTAT TTCTGGTAGT AAGAATATAT GTTTAGAAAT TATTAAAAGT TGGAAGCAGA ACTTTTGA
|
Protein sequence | MLERYFLNIK KRNFSTASIW IKGGSDADSV GKKGINKILS SLLTRGCEGF NNFTLSEYIE SYGAELNQEV FEDGISISIK SLNEHFSKLF PLLDLIINKP TLLEREFEKV KKSSIDFLKK DKENPFNICF EKWRRIVYSN HPYAFNTNGN ENDVSKITYE DVLLEFKNFK SRDKYLISNN SEIDGVSIEK LDKKPLVEKF RPLNHDLSPN NRFDFNNNNS NQTIIMFGNQ TCSRKSSEYL PLKVLESYLS YGMSAALFKL FREKNGITYD LGVYYPVRRR NAPFLVYLSV SNKKALFAFE LLSTLWKDLL LNPLIDNEIL LAKEKLKGSF LLGNQSLDEI LQRKIQLISY GVTPISESDL NSKIDEISSL DILKLTNKYF SKPFLSISGS KNICLEIIKS WKQNF
|
| |