Gene P9303_28701 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagP9303_28701 
Symbol 
ID4778393 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProchlorococcus marinus str. MIT 9303 
KingdomBacteria 
Replicon accessionNC_008820 
Strand
Start bp2540760 
End bp2542508 
Gene Length1749 bp 
Protein Length582 aa 
Translation table11 
GC content45% 
IMG OID640088393 
Producthypothetical protein 
Protein accessionYP_001018865 
Protein GI124024558 
COG category[O] Posttranslational modification, protein turnover, chaperones
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain
[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCGCC GCAGAAGAAC GGCACTTGCT GCTGCACTCT CTTTGCTGCC AATAGGACAA 
CCTCTGCTCC TGGGCACTCT TACTGGCACC ACAACTGCAA CCACAGCAGT CATTCTTCAA
GCAGCACCAG TATTTGCTCA GGATGTTTCT GCTGTTGCCC GTATCGCCAA GGCAATCACT
GTTCGCATAG AAGGTGCAAC ACAAGGTTCA GGGGTGCTCG TCAAGCAAGA AGGCAATCGC
TACACGGTGC TTACGGCATG GCATGTAGTT AGTGGCAATA GACCAGGAGA AGAGGTTGGG
ATCTATACCT CTGATGGGAA TGAGCACCAA CTAGAGCAAG GCAGCATCCA AAGGTTGGGA
GAGGTTGATA TGGCAGTGCT CTCCTTCTCT AGTGGCAGTG CTTATGAGGT GGCAAATGTT
GGTGATATCA AAAAGGTCAA GCATGATCAA CCCATTTATG TGGCAGGTTT TCCTTTAAAT
AACTCACAAA ACCTTCGCTA TGAAACTGGA GAGGTTGTTG CTAATGCAGA GGTAGGAATT
GATCAGGGGT ATCAACTCCT TTATGACAAC ACAACAGTCG CTGGAATGAG TGGTGGGGTG
CTGCTGAATT CTGATGGAGA TTTGGTGGGA CTTCATGGCA GGGGAGAGAG AGATGAACAG
GCATCAAGTG GTGAGTTAGT AATGAAGACA GGGGTGAATC AAGGCGTGCC AATTACTTAC
TACAACCTCT TTGCAAGTGG TGCTCCTGTT GTTGTTGCCA AGAACACTGC AACCACTGCT
GATGACTATC TGGCGCAAGC AAAAGCATCC CAGTCAAGGA AGGGAAGAGA GCAGACAGTT
ATTAAGTTAA CAACCCAGGC ATTAGCATTG CGATCAAGTG GAGGAGGATA CTTTCTTCGT
GCTTATGCCA AGAAGAAATT AAAAGACTAT CAAGGAGCAA TTGCTGATTA CAGCAAGGCA
CTAGAGATTA ATCCGGAGGA TGCTAATACC TTCAACAACC GTGGTAATGC CAAGCATGGA
TTAGGAGATT ATCAAGGAGC AATATCTGAT TACACCAAGG CAATAGAACT TGATCCACAG
CATGCTCTTG CCTACGACAA CCGTGGTTAT TCCAAGCATG ACTTAAAAGA TTATCAAGCA
GCAATTGCAG ATTACAACAA AGCAATAGAG ATTGATCCGC AGTATGCCAT TGCCTACAAC
AACCGTGGTA CTGCTAAGGA TGATTTAAAA GATTATCAAG GAGCAATCGC TGATTACAAC
AAGGCAATAG AACTTGATCC ACAGCATGCC TTTGCCTTCT CCAACCGTGG TATTACCAAG
AGAAACTTAG GAGATACTCA AGGAGCAATC GCTGATTACA ACAAGGCAAT AGAGATTAAT
CCGCAGAATG CCATTGCTTA CAACAACCGT GGTCTTGCTA AGAGTAATTT AGGTAGTTAT
CAAGAAGCAA TCGCTGATTG CAACAAGGCA ATTCAGATTG ATCCGCAGTA TGCCGGTGCC
TACAATAGCC GTGGATGGAT AAAATATCTA CAAGGAGATT TTCAAGGTGC TCTTAAGGAT
GCTAACAAAG CACTAGCAAT TGCTCCAAAT GATGGTGCGA CATTAGACAC CCGTGGTCTT
GCAAAACATG CGCTTGGTCA AGATAGAAGT GCCTGTAAAG ATTTAAAGAG GGCATCGTCT
CTAGGTTATC AGGGAACCTC CCAATATCTA CAAAGTGAAG AAGGTGCCTG GTGCAGCAAT
ATGCGATGA
 
Protein sequence
MTRRRRTALA AALSLLPIGQ PLLLGTLTGT TTATTAVILQ AAPVFAQDVS AVARIAKAIT 
VRIEGATQGS GVLVKQEGNR YTVLTAWHVV SGNRPGEEVG IYTSDGNEHQ LEQGSIQRLG
EVDMAVLSFS SGSAYEVANV GDIKKVKHDQ PIYVAGFPLN NSQNLRYETG EVVANAEVGI
DQGYQLLYDN TTVAGMSGGV LLNSDGDLVG LHGRGERDEQ ASSGELVMKT GVNQGVPITY
YNLFASGAPV VVAKNTATTA DDYLAQAKAS QSRKGREQTV IKLTTQALAL RSSGGGYFLR
AYAKKKLKDY QGAIADYSKA LEINPEDANT FNNRGNAKHG LGDYQGAISD YTKAIELDPQ
HALAYDNRGY SKHDLKDYQA AIADYNKAIE IDPQYAIAYN NRGTAKDDLK DYQGAIADYN
KAIELDPQHA FAFSNRGITK RNLGDTQGAI ADYNKAIEIN PQNAIAYNNR GLAKSNLGSY
QEAIADCNKA IQIDPQYAGA YNSRGWIKYL QGDFQGALKD ANKALAIAPN DGATLDTRGL
AKHALGQDRS ACKDLKRASS LGYQGTSQYL QSEEGAWCSN MR