Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_27951 |
Symbol | mmsB |
ID | 4778819 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | - |
Start bp | 2461719 |
End bp | 2462624 |
Gene Length | 906 bp |
Protein Length | 301 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 640088318 |
Product | putative 3-hydroxyisobutyrate dehydrogenase |
Protein accession | YP_001018790 |
Protein GI | 124024483 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2084] 3-hydroxyisobutyrate dehydrogenase and related beta-hydroxyacid dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAAAG CTCAGCTAAG CCCAAGGTCT AGCCTTGCCT TTGTTGGCCT CGGCGCATTG GGTTTACCAA TGGCTGCCAA TCTGCTGGCT GCGGGCTATG AGCTCAAGGT TCACAGCCGC AGTCGTGCGG CTGAACTGAA TTTGGCTTTA AAAGGTAGCC GACCCTGCAG CTCACCAGCA GAAGCTGCAG CAGATAGCCA GGCTCTATTG ATTTGCGTTA GCGATGATGA AGCAGTAGAA GCGGTTCTTT TTGGTTCTCA GGGAGCAGCT TCTAAGCTCA GCTCAGGCGC CGTTGTGGTG GATTTCTCGA CGATCGCCCC AGCAACGTCG ATCACCCTTG CTGAGCGACT TGCTCAGCAA GGGGTGACCT ATCTGGATGC ACCGGTAACC GGAGGCACTG AAGGGGCCAG AGCAGGAACT CTTACCGTTC TAGTGGGTGG CAACACACAA GCTCTGGCAA GGGTGCAGCC ACTGCTGGAG GTGATCGGTG AGAGCATTCA TCACTTCGGT TCGGTGGGAC GGGGCCAGCA GGTGAAGGCG CTCAACCAAG TGCTGGTGGC AGGTAGCTAT GCCGCATTAG CAGAAGCAAT CGCGCTAGGG CAGCAGCTGG GTCTCCCTAT GCCGGAGGTG ATCACAGCCT TACAGCATGG TGCCGCAGGC TCATGGGCCC TTCAACACCG ATCAACAGCG ATGCTGGAAG ATCACTATCC GCTTGGCTTC AAGCTGGCTT TGCATCACAA GGATCTCGGC ATTGCCCTGG AAACGGCAGA ACGCGTGGGA CTACAACTGC CAATCACAAG CAAAGTGAAG TCCATGGAGG CAAACCTGAT TGAACTCGGC CATAGCGAGG AAGATGTATC AGTGCTGAGG CGGTGGTTTG ATCAGCAACA GGCAGAATTT CAGTAA
|
Protein sequence | MSKAQLSPRS SLAFVGLGAL GLPMAANLLA AGYELKVHSR SRAAELNLAL KGSRPCSSPA EAAADSQALL ICVSDDEAVE AVLFGSQGAA SKLSSGAVVV DFSTIAPATS ITLAERLAQQ GVTYLDAPVT GGTEGARAGT LTVLVGGNTQ ALARVQPLLE VIGESIHHFG SVGRGQQVKA LNQVLVAGSY AALAEAIALG QQLGLPMPEV ITALQHGAAG SWALQHRSTA MLEDHYPLGF KLALHHKDLG IALETAERVG LQLPITSKVK SMEANLIELG HSEEDVSVLR RWFDQQQAEF Q
|
| |