Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcer98_0971 |
Symbol | |
ID | 5346539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cytotoxicus NVH 391-98 |
Kingdom | Bacteria |
Replicon accession | NC_009674 |
Strand | + |
Start bp | 1092955 |
End bp | 1093746 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 40% |
IMG OID | 640838569 |
Product | 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, HpaG1 subunit |
Protein accession | YP_001374297 |
Protein GI | 152974780 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) |
TIGRFAM ID | [TIGR02305] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, N-terminal subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGGAAAA TACAGTTTAA ATGTTATGGC CGATCCCAAA TAGAAGAAGC GGAGCTACAT ATAACAGAAG ATATGGTCAT ATGGAACGGA AAAGAGTATA AAAGTCACGA GCTTGCATTG GATATTCCAA CTTCAGGAAA TATTTACGGG ACATTGCTCA ATTATAAAGG AGCACTTGCC GCATTAGGAA ATTCGGTACA TGAATTGCCA TATAAACAAG CGCCAATGGC TCCCATTTTA TATATCAAAC CGATAAACAC AATGATTGCC CGGGGAATGC CGATTCCGTT ACCGAGTGAA GAAAGGGAGT TAGAAGTTGG GGCAGCACTA GGAATTGTCA TCGGAAAAAG AGCGACGAAA GTAAGGGAAG AAGAAGCGTT AACATATATT CAAGGATATA CGATTGTAAA TGACATCAGC ATACCTCATG AAAGCGTGTA TCGCCCAGCG ATTAAGCAAA AAGCACGCGA TGGATTTTGT CCAGTTGGCC CATGGGTGAT AGAGAAAGGG GCTATTCAAA ACCCAAATGA TGTAAGCATT CAAGTATATG TGAACGGTAT ATTGCGGCAA GAAAATCATA CGAAAAACTT AATTAGACCA GTGGAACGAC TTATCGCAGA TGTAACAGAA TTTATGACTT TATATGAAGG AGATATACTG CTTGTTGGTG TCCCGGAAAA TCCACCACTC GTAAAAAATG GAGACCGCAT TCGAATTGAA ATCGAAGGAA TTGGCAGCTT AGAAAATCAA GTCGTTTTAG AAAAAGAACT TGTGAGAGGA GGAGTACGAT GA
|
Protein sequence | MRKIQFKCYG RSQIEEAELH ITEDMVIWNG KEYKSHELAL DIPTSGNIYG TLLNYKGALA ALGNSVHELP YKQAPMAPIL YIKPINTMIA RGMPIPLPSE ERELEVGAAL GIVIGKRATK VREEEALTYI QGYTIVNDIS IPHESVYRPA IKQKARDGFC PVGPWVIEKG AIQNPNDVSI QVYVNGILRQ ENHTKNLIRP VERLIADVTE FMTLYEGDIL LVGVPENPPL VKNGDRIRIE IEGIGSLENQ VVLEKELVRG GVR
|
| |