Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_21831 |
Symbol | |
ID | 4777794 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 1940907 |
End bp | 1942373 |
Gene Length | 1467 bp |
Protein Length | 488 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640087698 |
Product | putative neutral invertase-like protein |
Protein accession | YP_001018183 |
Protein GI | 124023876 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.28197 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCGGAC GCTTAAACCA GCAAAACCAG AGGGTTCGTC CCAACTCCAA TGAAGACCAG GTGGTGCAGC AGGTCAAGGA GCATTTCGAA CGCACGCTGA TCGAAGTAGG CGGCACTGTG GCAGGCAGCG TGGCGGCTCT TGAGCATCAG CCCCATAACA AAGCTCTCAA TTACGGCGAG GTCTTCCTGC GCGACAACGT GCCCGTGATG ATCTACCTGC TCACCCAAAA GCGTTACAAG GAGGTGAAGC AATTCCTCAG CGTTTGCCTA GATCTGCAAA GCACCACATA TCAGACACGC GGGGTGTTCC CCACCAGCTT CGTTGAGGAA CAAGGCGAGC TCATTGCTGA CTATGGCCAG CGCTCAATCG GCAGGATCAC CTCTGTGGAC GCAAGCCTTT GGTGGCCCAT TTTGTGCTGG CTTTATGTCA AGCGCAGCGG TGACAAAAAC TTCGGCACCA ACCAAAAGGT CCAACGAGGT GTGCAGCTGA TGCTCGATCT CGTGCTGCAT CCCACCTTCG AGGGCACTCC AGTTCTATTT GTTCAGGATT GCTCATTCAT GATTGACCGC CCCATGGATG TTTGGGGGGC ACCACTTGAA GTGGAAGTAC TGCTCTATGC CTGCCTGCGC AGCTGCATCG AGCTGATGGA ACTGAGTCGC AAGAATCATG TCAGTCGGCT TCTCGACCAA CGTCTCTTAC TGACCCGCCA ATGGGTGCAT GACCTACGTC AATTCCTGCT CAAGCACTAC TGGGTCACCA GCAAGACCAT GCAGGTGCTG CGAAGAAGAC CCACAGAGCA ATACGGCGAC AACCAGCACC AAAACGAGTT CAACGTGCAG CCACAGGTCG TTCCCGACTG GCTTCAAGAC TGGCTTGAAA ACCGGGGTGG CTATCTCATC GGCAACATCC GCACCGGACG ACCTGACTTT CGTTTTTACA GTCTCGGAAA CTCCCTAGCC TGCCTGTTTG GGCTACTGAC CGCCCCACAA CAGCGCGCTC TGTTCCGACT GGTGCTTCAC AACCGACAGC ATCTGATGGC TCAGATGCCA ATGCGCATCT GTCATCCACC CATGGAAGGG GCGGAATGGC AGAACAAGAC AGGTTCAGAC CCGAAAAACT GGCCATGGAG CTATCACAAC GGCGGACACT GGCCCAGCCT GCTCTGGTTC TTCGGAGCCT CAATCTTGTT ACATGAACGG CGTCATCCTG AGGCCGACGT GCTGCTGATG GGTGAAATGC GCGCCCTACT GGAGGAGTGC TACTGGAGCC AATTAAACCA ACTGCCGCGT CAAAAATGGG CCGAATACTT TGATGGACCA ACAGGCACCT GGGTAGGACA GCAGTCAAGG ACCTACCAGA CCTGGACCAT GGTGGGCTTT CTACTTCTCC ACCACCTTCT ACGCGTCTGC CCCGACGACG TGTTGTGGTT GGATCTCGAC GAACTGCTGC CAACACATGA GGTGTAA
|
Protein sequence | MAGRLNQQNQ RVRPNSNEDQ VVQQVKEHFE RTLIEVGGTV AGSVAALEHQ PHNKALNYGE VFLRDNVPVM IYLLTQKRYK EVKQFLSVCL DLQSTTYQTR GVFPTSFVEE QGELIADYGQ RSIGRITSVD ASLWWPILCW LYVKRSGDKN FGTNQKVQRG VQLMLDLVLH PTFEGTPVLF VQDCSFMIDR PMDVWGAPLE VEVLLYACLR SCIELMELSR KNHVSRLLDQ RLLLTRQWVH DLRQFLLKHY WVTSKTMQVL RRRPTEQYGD NQHQNEFNVQ PQVVPDWLQD WLENRGGYLI GNIRTGRPDF RFYSLGNSLA CLFGLLTAPQ QRALFRLVLH NRQHLMAQMP MRICHPPMEG AEWQNKTGSD PKNWPWSYHN GGHWPSLLWF FGASILLHER RHPEADVLLM GEMRALLEEC YWSQLNQLPR QKWAEYFDGP TGTWVGQQSR TYQTWTMVGF LLLHHLLRVC PDDVLWLDLD ELLPTHEV
|
| |