Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_03401 |
Symbol | |
ID | 4778547 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 347511 |
End bp | 349448 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640085843 |
Product | hypothetical protein |
Protein accession | YP_001016357 |
Protein GI | 124022050 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4252] Predicted transmembrane sensor domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.689704 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGAAGCTCA GCCAGCGCAT CCGCGACGGT CTCGTTCAAG CCGGGCTGAT CGGCGCCGCG GCCCTGTTCC TGGGGGGACT GTCGACAACG GGAATCAGTG CATCGATCGA TTGGCTGCTG TACGACAGCG TCATCACCCT GCGTTCGCGC GATTCCGCCC AACGACATCC GGTGACGATC GTCGGAATCG ACGAAGACGA CATCAGCCAC TACGGATGGC CGATCGACGA TGCCGTGCTC TGTCGTGCCC TTCGGAACGC CCTGCAAGCC AACGCCAGTG CTATCGGCCT GGATCTCTAC CGCGACCAGG GAATTGGTCC GCAGCAGAGC TGTCTGCCGG AGCTGATCCG GCAAAACTCC GAGATCGTGG CCATCTTCAA CGCTGCCGAG GGCATCACCG CTCCACCGGG GACACCTGCC GCCCAGCAGG CCTTCAATGA CCTGGTGGTC GATGCCGATG GCGTAATCCG TCGGGACCTG ATCCATGTCA GCGGTCAGGA CGCCGCCACC GTGAGCCTGC CGGTCCGACT GATCGAGACA TCCGGTCTGC AGCCCGGCCT GCTCGATCTG CTGAAGAAGC CAGACAGAGC AGAACAACTC GGACCATGGC TGCTGCCCCA TTCCGGGGGA TACCGCGACC TCGATGCTGC CGGTTACCAG CGACTGCTGC CGTTTCATCA ACCCGGGAGC TTCCGCACCA TCAGCCTGCG TACGCTGGCT GATGGCAAAT GGGCTGCAGA GGCCCTGCAG CAGGGAGACA TTGTGCTGCT CGGGAGCACC GCGCCCAGCC TGAAAGATCT GTTCGAAATC CCGCACAGCC GTTTCAGCCA AAGCAACAAA TTCCTCATGC CCGGGGTGGA AGTGCATGCT CTGCGCGTCG CCGCCCTGCT GAATGGTCTG GATCAGCCAT GGACGCTCCG AACACTGCCG CCCTGGAACG AGCAGGGCCT GGAGCTGATC GCCATCCTCG TCGGGATCAG CCTTGGTGCC AGCTGCAGCA AGCTGCAACG CAGCATCACG ATCACCACAG TGCTAACGGT CGTTCTGGCG GGCTGCGGTG CCGCTTTGCT CTGGACGCAA GGACTTTGGA TTGGCCTCAC CCTGCCGGTG ATCTCCCTAC CCGTGATGGC TGGGGTGGGT TGGCTGCGCC GCGGAGCGCT CCTGCAACGT CAAAAACAAC AAATCGAACG CCTGCTGGGC CAAACCACCT CCCCTGCCGT GGCACAACAG TTGTGGGAAC AACGCGACTC CCTCCTGCGG GATGGCCAGT TCGAAGGGAA GCAGGTCACC GCAACAGTGT TGTTCACCGA CACCCAAGAC TTCACCAGCA TTTCCGAACA GCTGTCACCC TCCGAGCTGC TGACATGGCT CAACCGTGGC ATGAGCCTGT TGGTGCAGGA GATCACCAAC CACGGCGGCA TCATCAATAA ATTCACCGGT GATGGACTTC TGGCGGTTTT TGGAGCACCG ATCAGCCAGG GAATGGCCGT GGATGCAGGC CATGCGATTG ATGCGTCTTT GGCGATTACG GCTCGGCTAG CCGAGCTCAA TCAAGCATTG AAACTGGAGC AGGCGCCAGC CATGCGCATG CGGATCGGCA TTCATTCCGG TCCGGTGATC GCCGGCTCGA TGGGAAGCAG CGCGCGGCTG GAATTCACGG TGATGGGGGA CACGGTGAAT TGCGCGTCGC GGTTGGAAAG CCTGGCCAGG GTTCCAGCCG ACGACAGCTG CCGCACCCTC TTCAGCCAAG AGACCCTGAT GCGGTGTGAG CGCGACGACC TGCTCTGGCA TTCGGTGGGG CGATTGCAGG TGAAAGGGCG TCAGCAAGAG CTGGACGTTC TCGAACTCAA GGGCACCAAA CCAGCCGCCA ATGTCAGAAC AGGCAGCGCA CCGGCAGACG ATCGGGCCAG GAGCGCAGAT CAAGAGCTGC CAGGTTGA
|
Protein sequence | MKLSQRIRDG LVQAGLIGAA ALFLGGLSTT GISASIDWLL YDSVITLRSR DSAQRHPVTI VGIDEDDISH YGWPIDDAVL CRALRNALQA NASAIGLDLY RDQGIGPQQS CLPELIRQNS EIVAIFNAAE GITAPPGTPA AQQAFNDLVV DADGVIRRDL IHVSGQDAAT VSLPVRLIET SGLQPGLLDL LKKPDRAEQL GPWLLPHSGG YRDLDAAGYQ RLLPFHQPGS FRTISLRTLA DGKWAAEALQ QGDIVLLGST APSLKDLFEI PHSRFSQSNK FLMPGVEVHA LRVAALLNGL DQPWTLRTLP PWNEQGLELI AILVGISLGA SCSKLQRSIT ITTVLTVVLA GCGAALLWTQ GLWIGLTLPV ISLPVMAGVG WLRRGALLQR QKQQIERLLG QTTSPAVAQQ LWEQRDSLLR DGQFEGKQVT ATVLFTDTQD FTSISEQLSP SELLTWLNRG MSLLVQEITN HGGIINKFTG DGLLAVFGAP ISQGMAVDAG HAIDASLAIT ARLAELNQAL KLEQAPAMRM RIGIHSGPVI AGSMGSSARL EFTVMGDTVN CASRLESLAR VPADDSCRTL FSQETLMRCE RDDLLWHSVG RLQVKGRQQE LDVLELKGTK PAANVRTGSA PADDRARSAD QELPG
|
| |