Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28721 |
Symbol | |
ID | 4778589 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2543667 |
End bp | 2545787 |
Gene Length | 2121 bp |
Protein Length | 706 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 640088395 |
Product | hypothetical protein |
Protein accession | YP_001018867 |
Protein GI | 124024560 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCTTGTT GTTGCTGCCA GAGGGGAGAG AAAGAACACC TGCCATACCC TGAACCCAGC AGCACCCAGC AGATGTCACG CAGAAATACT GCACTTGCTG CTGCTCTATC GCTGCTGCCA ATAGGACAAC CACTGCTAGT GGGCACCCTT GGCATCACAA CAGCAACAAC AGCAGTCGTT CTGCAAGCGC CACCAGCAGT TGCTCAAGAT GCTTCTGCTG TGGCACGCAT CGCCAAGGCA ATCACTGTTC GCATTGAAGG TGCCACCCAA GGTTCAGGAG TGTTGGTCAA GCAAGAAGGC AATCGCTACA CGGTGCTTAC GGCATGGCAT GTAGTGAGTG GCAATAGACC AGGAGAAGAG GTTGGGATCT ATACCTCTGA TGGGAATGAG CACCAACTAG AGCAAGGCAG CATCCAAAGG TTGGGAGAGG TTGATATGGC AGTGCTCTCC TTCTCTAGTG GCAGTGCTTA TGAGGTTGCA AATGTTGGTG ATATCAAAAA GGTCAAGCAT GATCAACCGA TTTATGTGGC AGGTTTTCCT TTAAATAACT CACAAAACCT TCGCTATGAA ACTGGAGAGG TTGTTGCTAA TGCAGAAGTA GGAATTGATC AGGGTTATCA ACTGCTATAC GACAACGAAA CAGTCGCTGG AATGAGTGGA GGCGTGCTGC TTAATGCTGA TGGAGATTTG GTGGGACTTC ATGGCAGGGG AGAGAAAGAT GAACAAGCAT CAAGTGGTGA GTTAGTAATG AAGACAGGAG TTAATCAAGG CGTGCCAATT ACTTACTACA ACCTCTTTGC AAGTGGTGCT CCTGTTGTTG TTGCCAAGAA CACTGCAACC ACTGCTGATG ACTATCTGGC GCAAGCAAAA GCATCCCAGA CAAAGAAGGG AAGAGAGCAG ACAGTTATTA AGTTAACAAC CCAGGCATTA GCATTACGAT CCAGTATGCG GGGATACTTC CTTCGTGCTT ATGCCAAGGA TGCATTAAAC GATTACCAAG GAGCAATTTC TGATTTAAAC AAGGCACTAG AGATTAATCC GCAGTATGCT CCTGCCTACG AAAACCGTGG TAATGCCAAG AAGAAATTAA AAGATTATCA AGGAGCAATT ACTGATTACA ATAAGGCAAT AGAGATTAAT CCGCAGCATA CCGGACCCTT TAATAACCGT GGTAATACCA AGAAGCAATT AAAAGATTAT CAAGGAGCAA TTGCTGATTA CAACAAGGCA ATAGAACTTG ATCCACAGCA TGCCTATGGC TACTACAACC GTGGTCTTGC AAAAAAGAAT TTAGGTGATT ATCAAGGAGC AATTGCTGAT TACAACAAGG CAATAACAAT TAATCCACAG CATGCCGATG CCTTCAATAA CCGTGGTAAT GCCAAGGATG GATTAGGAGA TACTCAAGGA GCAATATCTG ATTACAACAA GGCAATAGAA CTTGATCCAC AGCATACTCT TGCCTACAAC AACCGTGGTA GTTCCAAGAG TGATTTAAAG GATTATCAAG GAGCAATTCC TGATTACAAC AAGGCAATAG AGATTAATCC ACAGTATGCC GATGCCTTCA ATAACCGTGG TATTGCTAAG GATAATTCAG GAGATCATCA AGGAGCAATC GCTGATTACA ACAAGGCAAT AGAACTTGAT CCACAGCATG CCTTTGCCTT CAATAACCGT GGTATTGCTA AGGATAATTT AGGAGATCAT CAAGGAGCAA TCGCTGATTA CAACAAGGCA ATAGAGATTG ATCCGAAGTA TGCAAGTGCC TACAACAACC GTGGATATGC CAAGAGTGAT TTAAAAGATT ATCAAGGCGC AATTGCTGAT TTCAACAAGG CAATCGCAAT TAATCCGCAG TATGCCCTTG CCTACACCAA CCGTGGATGG TTTAAATATC TACAAGGAGA TTTTCAAGAT GCTCTTAAGG ATGCTAACAA AGCACTTGCA ATTACTCCAA ATGATGGTGC CACATTAGAT ACACGTGGTC TTGCAAAACA TGCGCTTGGC CAAGATAGAA GTGCCTGTAA AGATTTAAAG CGGGCATCGT CTCTAGGTTA TCAGGGAACC TCCCAATATC TACAAAGTGA AGAAGGTGCC TGGTGCGACA ATATGCGTTG A
|
Protein sequence | MPCCCCQRGE KEHLPYPEPS STQQMSRRNT ALAAALSLLP IGQPLLVGTL GITTATTAVV LQAPPAVAQD ASAVARIAKA ITVRIEGATQ GSGVLVKQEG NRYTVLTAWH VVSGNRPGEE VGIYTSDGNE HQLEQGSIQR LGEVDMAVLS FSSGSAYEVA NVGDIKKVKH DQPIYVAGFP LNNSQNLRYE TGEVVANAEV GIDQGYQLLY DNETVAGMSG GVLLNADGDL VGLHGRGEKD EQASSGELVM KTGVNQGVPI TYYNLFASGA PVVVAKNTAT TADDYLAQAK ASQTKKGREQ TVIKLTTQAL ALRSSMRGYF LRAYAKDALN DYQGAISDLN KALEINPQYA PAYENRGNAK KKLKDYQGAI TDYNKAIEIN PQHTGPFNNR GNTKKQLKDY QGAIADYNKA IELDPQHAYG YYNRGLAKKN LGDYQGAIAD YNKAITINPQ HADAFNNRGN AKDGLGDTQG AISDYNKAIE LDPQHTLAYN NRGSSKSDLK DYQGAIPDYN KAIEINPQYA DAFNNRGIAK DNSGDHQGAI ADYNKAIELD PQHAFAFNNR GIAKDNLGDH QGAIADYNKA IEIDPKYASA YNNRGYAKSD LKDYQGAIAD FNKAIAINPQ YALAYTNRGW FKYLQGDFQD ALKDANKALA ITPNDGATLD TRGLAKHALG QDRSACKDLK RASSLGYQGT SQYLQSEEGA WCDNMR
|
| |