Gene P9303_28721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28721 
Symbol 
ID4778589 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2543667 
End bp2545787 
Gene Length2121 bp 
Protein Length706 aa 
Translation table11 
GC content44% 
IMG OID640088395 
Producthypothetical protein 
Protein accessionYP_001018867 
Protein GI124024560 
COG category[O] Posttranslational modification, protein turnover, chaperones
[R] General function prediction only 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCTTGTT GTTGCTGCCA GAGGGGAGAG AAAGAACACC TGCCATACCC TGAACCCAGC 
AGCACCCAGC AGATGTCACG CAGAAATACT GCACTTGCTG CTGCTCTATC GCTGCTGCCA
ATAGGACAAC CACTGCTAGT GGGCACCCTT GGCATCACAA CAGCAACAAC AGCAGTCGTT
CTGCAAGCGC CACCAGCAGT TGCTCAAGAT GCTTCTGCTG TGGCACGCAT CGCCAAGGCA
ATCACTGTTC GCATTGAAGG TGCCACCCAA GGTTCAGGAG TGTTGGTCAA GCAAGAAGGC
AATCGCTACA CGGTGCTTAC GGCATGGCAT GTAGTGAGTG GCAATAGACC AGGAGAAGAG
GTTGGGATCT ATACCTCTGA TGGGAATGAG CACCAACTAG AGCAAGGCAG CATCCAAAGG
TTGGGAGAGG TTGATATGGC AGTGCTCTCC TTCTCTAGTG GCAGTGCTTA TGAGGTTGCA
AATGTTGGTG ATATCAAAAA GGTCAAGCAT GATCAACCGA TTTATGTGGC AGGTTTTCCT
TTAAATAACT CACAAAACCT TCGCTATGAA ACTGGAGAGG TTGTTGCTAA TGCAGAAGTA
GGAATTGATC AGGGTTATCA ACTGCTATAC GACAACGAAA CAGTCGCTGG AATGAGTGGA
GGCGTGCTGC TTAATGCTGA TGGAGATTTG GTGGGACTTC ATGGCAGGGG AGAGAAAGAT
GAACAAGCAT CAAGTGGTGA GTTAGTAATG AAGACAGGAG TTAATCAAGG CGTGCCAATT
ACTTACTACA ACCTCTTTGC AAGTGGTGCT CCTGTTGTTG TTGCCAAGAA CACTGCAACC
ACTGCTGATG ACTATCTGGC GCAAGCAAAA GCATCCCAGA CAAAGAAGGG AAGAGAGCAG
ACAGTTATTA AGTTAACAAC CCAGGCATTA GCATTACGAT CCAGTATGCG GGGATACTTC
CTTCGTGCTT ATGCCAAGGA TGCATTAAAC GATTACCAAG GAGCAATTTC TGATTTAAAC
AAGGCACTAG AGATTAATCC GCAGTATGCT CCTGCCTACG AAAACCGTGG TAATGCCAAG
AAGAAATTAA AAGATTATCA AGGAGCAATT ACTGATTACA ATAAGGCAAT AGAGATTAAT
CCGCAGCATA CCGGACCCTT TAATAACCGT GGTAATACCA AGAAGCAATT AAAAGATTAT
CAAGGAGCAA TTGCTGATTA CAACAAGGCA ATAGAACTTG ATCCACAGCA TGCCTATGGC
TACTACAACC GTGGTCTTGC AAAAAAGAAT TTAGGTGATT ATCAAGGAGC AATTGCTGAT
TACAACAAGG CAATAACAAT TAATCCACAG CATGCCGATG CCTTCAATAA CCGTGGTAAT
GCCAAGGATG GATTAGGAGA TACTCAAGGA GCAATATCTG ATTACAACAA GGCAATAGAA
CTTGATCCAC AGCATACTCT TGCCTACAAC AACCGTGGTA GTTCCAAGAG TGATTTAAAG
GATTATCAAG GAGCAATTCC TGATTACAAC AAGGCAATAG AGATTAATCC ACAGTATGCC
GATGCCTTCA ATAACCGTGG TATTGCTAAG GATAATTCAG GAGATCATCA AGGAGCAATC
GCTGATTACA ACAAGGCAAT AGAACTTGAT CCACAGCATG CCTTTGCCTT CAATAACCGT
GGTATTGCTA AGGATAATTT AGGAGATCAT CAAGGAGCAA TCGCTGATTA CAACAAGGCA
ATAGAGATTG ATCCGAAGTA TGCAAGTGCC TACAACAACC GTGGATATGC CAAGAGTGAT
TTAAAAGATT ATCAAGGCGC AATTGCTGAT TTCAACAAGG CAATCGCAAT TAATCCGCAG
TATGCCCTTG CCTACACCAA CCGTGGATGG TTTAAATATC TACAAGGAGA TTTTCAAGAT
GCTCTTAAGG ATGCTAACAA AGCACTTGCA ATTACTCCAA ATGATGGTGC CACATTAGAT
ACACGTGGTC TTGCAAAACA TGCGCTTGGC CAAGATAGAA GTGCCTGTAA AGATTTAAAG
CGGGCATCGT CTCTAGGTTA TCAGGGAACC TCCCAATATC TACAAAGTGA AGAAGGTGCC
TGGTGCGACA ATATGCGTTG A
 
Protein sequence
MPCCCCQRGE KEHLPYPEPS STQQMSRRNT ALAAALSLLP IGQPLLVGTL GITTATTAVV 
LQAPPAVAQD ASAVARIAKA ITVRIEGATQ GSGVLVKQEG NRYTVLTAWH VVSGNRPGEE
VGIYTSDGNE HQLEQGSIQR LGEVDMAVLS FSSGSAYEVA NVGDIKKVKH DQPIYVAGFP
LNNSQNLRYE TGEVVANAEV GIDQGYQLLY DNETVAGMSG GVLLNADGDL VGLHGRGEKD
EQASSGELVM KTGVNQGVPI TYYNLFASGA PVVVAKNTAT TADDYLAQAK ASQTKKGREQ
TVIKLTTQAL ALRSSMRGYF LRAYAKDALN DYQGAISDLN KALEINPQYA PAYENRGNAK
KKLKDYQGAI TDYNKAIEIN PQHTGPFNNR GNTKKQLKDY QGAIADYNKA IELDPQHAYG
YYNRGLAKKN LGDYQGAIAD YNKAITINPQ HADAFNNRGN AKDGLGDTQG AISDYNKAIE
LDPQHTLAYN NRGSSKSDLK DYQGAIPDYN KAIEINPQYA DAFNNRGIAK DNSGDHQGAI
ADYNKAIELD PQHAFAFNNR GIAKDNLGDH QGAIADYNKA IEIDPKYASA YNNRGYAKSD
LKDYQGAIAD FNKAIAINPQ YALAYTNRGW FKYLQGDFQD ALKDANKALA ITPNDGATLD
TRGLAKHALG QDRSACKDLK RASSLGYQGT SQYLQSEEGA WCDNMR