Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28701 |
Symbol | |
ID | 4778393 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2540760 |
End bp | 2542508 |
Gene Length | 1749 bp |
Protein Length | 582 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640088393 |
Product | hypothetical protein |
Protein accession | YP_001018865 |
Protein GI | 124024558 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTCGCC GCAGAAGAAC GGCACTTGCT GCTGCACTCT CTTTGCTGCC AATAGGACAA CCTCTGCTCC TGGGCACTCT TACTGGCACC ACAACTGCAA CCACAGCAGT CATTCTTCAA GCAGCACCAG TATTTGCTCA GGATGTTTCT GCTGTTGCCC GTATCGCCAA GGCAATCACT GTTCGCATAG AAGGTGCAAC ACAAGGTTCA GGGGTGCTCG TCAAGCAAGA AGGCAATCGC TACACGGTGC TTACGGCATG GCATGTAGTT AGTGGCAATA GACCAGGAGA AGAGGTTGGG ATCTATACCT CTGATGGGAA TGAGCACCAA CTAGAGCAAG GCAGCATCCA AAGGTTGGGA GAGGTTGATA TGGCAGTGCT CTCCTTCTCT AGTGGCAGTG CTTATGAGGT GGCAAATGTT GGTGATATCA AAAAGGTCAA GCATGATCAA CCCATTTATG TGGCAGGTTT TCCTTTAAAT AACTCACAAA ACCTTCGCTA TGAAACTGGA GAGGTTGTTG CTAATGCAGA GGTAGGAATT GATCAGGGGT ATCAACTCCT TTATGACAAC ACAACAGTCG CTGGAATGAG TGGTGGGGTG CTGCTGAATT CTGATGGAGA TTTGGTGGGA CTTCATGGCA GGGGAGAGAG AGATGAACAG GCATCAAGTG GTGAGTTAGT AATGAAGACA GGGGTGAATC AAGGCGTGCC AATTACTTAC TACAACCTCT TTGCAAGTGG TGCTCCTGTT GTTGTTGCCA AGAACACTGC AACCACTGCT GATGACTATC TGGCGCAAGC AAAAGCATCC CAGTCAAGGA AGGGAAGAGA GCAGACAGTT ATTAAGTTAA CAACCCAGGC ATTAGCATTG CGATCAAGTG GAGGAGGATA CTTTCTTCGT GCTTATGCCA AGAAGAAATT AAAAGACTAT CAAGGAGCAA TTGCTGATTA CAGCAAGGCA CTAGAGATTA ATCCGGAGGA TGCTAATACC TTCAACAACC GTGGTAATGC CAAGCATGGA TTAGGAGATT ATCAAGGAGC AATATCTGAT TACACCAAGG CAATAGAACT TGATCCACAG CATGCTCTTG CCTACGACAA CCGTGGTTAT TCCAAGCATG ACTTAAAAGA TTATCAAGCA GCAATTGCAG ATTACAACAA AGCAATAGAG ATTGATCCGC AGTATGCCAT TGCCTACAAC AACCGTGGTA CTGCTAAGGA TGATTTAAAA GATTATCAAG GAGCAATCGC TGATTACAAC AAGGCAATAG AACTTGATCC ACAGCATGCC TTTGCCTTCT CCAACCGTGG TATTACCAAG AGAAACTTAG GAGATACTCA AGGAGCAATC GCTGATTACA ACAAGGCAAT AGAGATTAAT CCGCAGAATG CCATTGCTTA CAACAACCGT GGTCTTGCTA AGAGTAATTT AGGTAGTTAT CAAGAAGCAA TCGCTGATTG CAACAAGGCA ATTCAGATTG ATCCGCAGTA TGCCGGTGCC TACAATAGCC GTGGATGGAT AAAATATCTA CAAGGAGATT TTCAAGGTGC TCTTAAGGAT GCTAACAAAG CACTAGCAAT TGCTCCAAAT GATGGTGCGA CATTAGACAC CCGTGGTCTT GCAAAACATG CGCTTGGTCA AGATAGAAGT GCCTGTAAAG ATTTAAAGAG GGCATCGTCT CTAGGTTATC AGGGAACCTC CCAATATCTA CAAAGTGAAG AAGGTGCCTG GTGCAGCAAT ATGCGATGA
|
Protein sequence | MTRRRRTALA AALSLLPIGQ PLLLGTLTGT TTATTAVILQ AAPVFAQDVS AVARIAKAIT VRIEGATQGS GVLVKQEGNR YTVLTAWHVV SGNRPGEEVG IYTSDGNEHQ LEQGSIQRLG EVDMAVLSFS SGSAYEVANV GDIKKVKHDQ PIYVAGFPLN NSQNLRYETG EVVANAEVGI DQGYQLLYDN TTVAGMSGGV LLNSDGDLVG LHGRGERDEQ ASSGELVMKT GVNQGVPITY YNLFASGAPV VVAKNTATTA DDYLAQAKAS QSRKGREQTV IKLTTQALAL RSSGGGYFLR AYAKKKLKDY QGAIADYSKA LEINPEDANT FNNRGNAKHG LGDYQGAISD YTKAIELDPQ HALAYDNRGY SKHDLKDYQA AIADYNKAIE IDPQYAIAYN NRGTAKDDLK DYQGAIADYN KAIELDPQHA FAFSNRGITK RNLGDTQGAI ADYNKAIEIN PQNAIAYNNR GLAKSNLGSY QEAIADCNKA IQIDPQYAGA YNSRGWIKYL QGDFQGALKD ANKALAIAPN DGATLDTRGL AKHALGQDRS ACKDLKRASS LGYQGTSQYL QSEEGAWCSN MR
|
| |