Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_28681 |
Symbol | |
ID | 4776250 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2538048 |
End bp | 2539667 |
Gene Length | 1620 bp |
Protein Length | 539 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 640088391 |
Product | hypothetical protein |
Protein accession | YP_001018863 |
Protein GI | 124024556 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [R] General function prediction only |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.71597 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCGCC GCACTAGTGC AATTGCTGCT GCCTTATCGC TGCTGCCAAT AGGACAACCA CTGCTATTGG GCACCCTTGG CATCACAACA GCAACCACTG CAGTCGTTCT TCAACAGACA CCAGCAATTG CTCAAGATGC TTCTGCTGTT GCACGTATCG CCAAGGCAAT CACTGTTCGC ATAGAAGGTG CCACCCAAGG TTCAGGGGTG CTCGTCAAGC AAGAAGGCAA TCGCTACACG GTGCTCACGG CATGGCATGT AGTCAGTGGC AATAGACCAG GAGAAGAGGT TGGGATCTAT ACCTCTGATG GGAATGAGCA CCAACTAGAG CAAGGCAGCA TCCAAAGGTT GGGAGAGGTT GATATGGCAG TGCTCTCCTT CTCTAGTGGC AGTGCTTATG AGGTTGCTGA AGTCGGAGAC GTCAAAAAGG TCAAGCATGA TCAACCGATT TATGTGGCAG GTTTTCCTTT AAATAACTCA CAAAACCTTC GCTATGAAAC TGGAGAGGTT GTTGCTAATG CAGAAGTAGG AATTGATCAG GGTTATCAAC TGCTATACGA CAACGAAACA GTCGCTGGAA TGAGTGGAGG CGTGCTGCTT AATGCTGATG GAGATTTGGT GGGACTTCAT GGCAGGGGAG AGAAAGATGA ACAGGCATCA AGTGGTGAGT TAGTAATGAA GACAGGAGTT AATCAAGGCG TGCCAATTAC TTACTACAAC CTCTTTGCAA GTGGTGCTCC TGTTGTTGTT GCCAAGAACA CTGCAACCAC TGCTGATGAC TATCTGGCGC AAGCAAAAGC ATCCCAGTCA AGGAAGGGAA GAGAACAGAC AGTTATTAAG TTAACAACCC AGGCATTAGC ATTGCGATCC AGTGTGGAGG GATACTTTCT TCGTGCTTAT GCCAAGTATG ACTTAAGAGA TTATCAAGAA GCAATTGCTG ATTACACAAA GACAATAGAG ATTCATCCGC AGAACACCGT TTCCTACAAT AACCGTGGTA ATGCCAAGCA GAAATTAAAA GATCATCAAG GGGCAATTGC TGATTTCAAC AAGGCAATAG CAATTGATCC GCAAAATCAC ACTGCCTACA CCAACCGCGG TAGTGCCAAG GATGATTTAG GAGATTATCA AGGGGCAATT GCTGATTACA ACAAGGCAAT AGCAATTAAT CCGCAGGATG ACGCTGCCTA CAACAACCGT GGTAATGCTA AGCAGAAATT AAAAGATCAT CAAGGAGCAA TTTCTGATTA CAGCAAGGCA ATTGCAATTA ATCCGCAGAA TGCCATTTCC TACACCAACC GTGGTAATAC CAAGGATGAT TTAGGAGATT ATCAAGGAGC AATTGCTGAT TTCAACAAGG CAATAGAAAT TAAACCAGAT TCTGCAAATG CCTACAACAA CCGTGGTAAT GCCAAGGATG ATTTAGGAGA TCATCAAGGG GCAATTGCTG ATTACAACAA GGCAATAGAG ATTAATCCGC AGGATGCCGT TTCTCACGCT AATCGTGGTA TTGCCAAGGA ATTAGTTGGA GACCTCAAAG GTGCTTGTGC TGATTGGAGA AAGGCATCCT CGCTAGGTGT TCAAGTTGTT GCTAGTTGGG TAAGAAAGCA ATGCCAATAA
|
Protein sequence | MSRRTSAIAA ALSLLPIGQP LLLGTLGITT ATTAVVLQQT PAIAQDASAV ARIAKAITVR IEGATQGSGV LVKQEGNRYT VLTAWHVVSG NRPGEEVGIY TSDGNEHQLE QGSIQRLGEV DMAVLSFSSG SAYEVAEVGD VKKVKHDQPI YVAGFPLNNS QNLRYETGEV VANAEVGIDQ GYQLLYDNET VAGMSGGVLL NADGDLVGLH GRGEKDEQAS SGELVMKTGV NQGVPITYYN LFASGAPVVV AKNTATTADD YLAQAKASQS RKGREQTVIK LTTQALALRS SVEGYFLRAY AKYDLRDYQE AIADYTKTIE IHPQNTVSYN NRGNAKQKLK DHQGAIADFN KAIAIDPQNH TAYTNRGSAK DDLGDYQGAI ADYNKAIAIN PQDDAAYNNR GNAKQKLKDH QGAISDYSKA IAINPQNAIS YTNRGNTKDD LGDYQGAIAD FNKAIEIKPD SANAYNNRGN AKDDLGDHQG AIADYNKAIE INPQDAVSHA NRGIAKELVG DLKGACADWR KASSLGVQVV ASWVRKQCQ
|
| |