Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Mkms_3858 |
Symbol | mhpB |
ID | 4611793 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Mycobacterium sp. KMS |
Kingdom | Bacteria |
Replicon accession | NC_008705 |
Strand | - |
Start bp | 4074026 |
End bp | 4074976 |
Gene Length | 951 bp |
Protein Length | 316 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 639793538 |
Product | 3-(2,3-dihydroxyphenyl)propionate dioxygenase |
Protein accession | YP_939841 |
Protein GI | 119869889 |
COG category | [S] Function unknown |
COG ID | [COG3384] Uncharacterized conserved protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0874943 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 24 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTCACCA TGGCGCAGAT CGCCCTGTGC TGCACGTCGC ACAGTCCGCT ACTGAACCTT CCGGGACCGT CGCGGGAACT CCTTGACGAC ATCGGGTCGG CGTTGGCCGT CGCCCGTGAT TTCGTCACCG AATTCGACCC GGACCTGGTG GTCACCTTCT CGCCGGACCA CTACAACGGG TTCTTCTACA AGGTCATGCC CCCGTTCTGC GTCGGCACCT CCGCGCAGGG CGTCGGGGAC TACGGCACCC ACGCCGGACC GCTCGACGTG CCGGAGGACC TCGCGAACGA GCTGGCCACC GCAGTACTGG AAGCCGGTGT GGACGTGGCG ATCTCGGCGA GTATGGACGT CGACCACGGC ACCGTGCAGC CGCTGCAGAA TCTGTTCGGC GATGCGATGG CCCGTCCCGT CATCCCGGTG TTCATCAACT CCGTCGCCAC CCCGCTGGGC CCGCTGCGGC GCACCCGTGC GCTGGGTACG GCGATCGGAC GGTACCTCGC GACCCTCGAC AAGCGTGTCC TGGTGATCGG TTCGGGTGGG TTGTCCCACG ACCCGCCGGT GCCCACCCTG GCGACCGCCC CGCCGGCGGC GCTCGACCGC ATCGTGCACG GCGCACCGAT GAGCACCGAA CAGCGGATGG CCCGGCAGTC CGCGGTGATC GACGCGGCGC ACGCGTTCGC GCACGGGGAA AGCCCCCTGC AGCCGCTCAA TCCGGCGTGG GATGCGACGT TCCTGGAGAT CCTCGACGAG GGCTGGCTCT CGGATCTCGA CGGCTGGTCC AACGCGTTCA TCGCCCGCGA GGGTGGGAAC TCCGCACACG AGATCCGCAC CTGGGTAGCG GCTTTCGCGG CGCTGGCGGC CGGCGGTGAC TACCGCACCG GCCTGCGCTT CTACCGTGCC GCACCCGAGT TGATCGCGGG TTTCGCGATC CGGACGGCGG TGCTCGCGTG A
|
Protein sequence | MVTMAQIALC CTSHSPLLNL PGPSRELLDD IGSALAVARD FVTEFDPDLV VTFSPDHYNG FFYKVMPPFC VGTSAQGVGD YGTHAGPLDV PEDLANELAT AVLEAGVDVA ISASMDVDHG TVQPLQNLFG DAMARPVIPV FINSVATPLG PLRRTRALGT AIGRYLATLD KRVLVIGSGG LSHDPPVPTL ATAPPAALDR IVHGAPMSTE QRMARQSAVI DAAHAFAHGE SPLQPLNPAW DATFLEILDE GWLSDLDGWS NAFIAREGGN SAHEIRTWVA AFAALAAGGD YRTGLRFYRA APELIAGFAI RTAVLA
|
| |