Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BcerKBAB4_0221 |
Symbol | |
ID | 5840245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus weihenstephanensis KBAB4 |
Kingdom | Bacteria |
Replicon accession | NC_010184 |
Strand | + |
Start bp | 230362 |
End bp | 231534 |
Gene Length | 1173 bp |
Protein Length | 390 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 641375362 |
Product | homogentisate 12-dioxygenase |
Protein accession | YP_001643117 |
Protein GI | 163938233 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTTATC GTCACATGGG GGAGCTACCT CATAAACGAC ATGTACAATT TCGTAAAAAA GATGGATCGC TTTATCGTGA GCAGGTAATG GGGACAAAAG GTTTCTCTGG TACCCAATCT ATTTTGTATC ATCATTATAT GCCAACAGAA GTAGGGCATG CGGCATTATC GCATTCTTGT CAGTTACAGT ATGAAGAGGA TGCCCTTCTT GCTCATCGCC ACTTCCGTAC GAAAGAAAAT AAAAAAACTG GTGATGCAGT AAGCGGAAGA AACTTTATGC TTGGGAATGA GGATTTATTA ATTGGAGTAG TGAGTCCAAC AGAAAAAATG GATTATTTTT ACCGTAATGG TGATGGCGAT GAAATGTTAT TTGTTCATTA TGGAACAGGA AAAATTGAAA CGATGTTTGG AACGATTCAC TATAGAAAAG GTGATTATGT AACAATTCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT GAGGGAGAGA CTAAGTTTCT TGTTGTAGAG GCAAATAGCC AAATTACAAC ACCAAGTCGC TATCGCAATG AATATGGGCA ATTGTTAGAA CATAGTCCGT TTTGTGAAAG AGATATGCGT GGACCGGAGA AATTGGAGAC ATATGATGAA AAAGGTGAGT TTGTCGTAAT GACAAAGTCG AGAGGTTATA TGCACAAGCA TGTGTTAGGG CACCATCCGT TAGATGTTGT TGGATGGGAT GGGTATTTAT ATCCGTGGGT CTTTAATGTG GAGGATTTTG AACCAATTAC AGGTCGTATT CATCAGCCAC CACCAGTACA TCAAACATTC GAAGGGCATA ATTTCGTCAT TTGCTCTTTC GTACCGCGTT TATACGATTA TCATCCAGAA TCTATTCCGG CACCATATTA TCATAGTAAT GTGAATAGTG ATGAGGTACT ATACTATGTA GAAGGTAATT TTATGAGCCG AAAAGGTGTA GAAGAAGGTT CTATTACGCT TCATCCAAGC GGGATCCCGC ATGGGCCACA TCCAGGGAAA ACAGAAGCGA GTATAGGGAA GAAAGAGACA CTTGAACTGG CTGTTATGAT AGATACATTC CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAGACGGAAG ATGAAAAATA TATGTATAGC TGGATTGAAG AGGGTTCATA TACTGTGAAA TAA
|
Protein sequence | MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHAALSHSC QLQYEEDALL AHRHFRTKEN KKTGDAVSGR NFMLGNEDLL IGVVSPTEKM DYFYRNGDGD EMLFVHYGTG KIETMFGTIH YRKGDYVTIP IGTIYRVIPD EGETKFLVVE ANSQITTPSR YRNEYGQLLE HSPFCERDMR GPEKLETYDE KGEFVVMTKS RGYMHKHVLG HHPLDVVGWD GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF RPLRIVKQAH ETEDEKYMYS WIEEGSYTVK
|
| |