Gene Bcer98_0971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBcer98_0971 
Symbol 
ID5346539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cytotoxicus NVH 391-98 
KingdomBacteria 
Replicon accessionNC_009674 
Strand
Start bp1092955 
End bp1093746 
Gene Length792 bp 
Protein Length263 aa 
Translation table11 
GC content40% 
IMG OID640838569 
Product4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, HpaG1 subunit 
Protein accessionYP_001374297 
Protein GI152974780 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0179] 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) 
TIGRFAM ID[TIGR02305] 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, N-terminal subunit 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGAAAA TACAGTTTAA ATGTTATGGC CGATCCCAAA TAGAAGAAGC GGAGCTACAT 
ATAACAGAAG ATATGGTCAT ATGGAACGGA AAAGAGTATA AAAGTCACGA GCTTGCATTG
GATATTCCAA CTTCAGGAAA TATTTACGGG ACATTGCTCA ATTATAAAGG AGCACTTGCC
GCATTAGGAA ATTCGGTACA TGAATTGCCA TATAAACAAG CGCCAATGGC TCCCATTTTA
TATATCAAAC CGATAAACAC AATGATTGCC CGGGGAATGC CGATTCCGTT ACCGAGTGAA
GAAAGGGAGT TAGAAGTTGG GGCAGCACTA GGAATTGTCA TCGGAAAAAG AGCGACGAAA
GTAAGGGAAG AAGAAGCGTT AACATATATT CAAGGATATA CGATTGTAAA TGACATCAGC
ATACCTCATG AAAGCGTGTA TCGCCCAGCG ATTAAGCAAA AAGCACGCGA TGGATTTTGT
CCAGTTGGCC CATGGGTGAT AGAGAAAGGG GCTATTCAAA ACCCAAATGA TGTAAGCATT
CAAGTATATG TGAACGGTAT ATTGCGGCAA GAAAATCATA CGAAAAACTT AATTAGACCA
GTGGAACGAC TTATCGCAGA TGTAACAGAA TTTATGACTT TATATGAAGG AGATATACTG
CTTGTTGGTG TCCCGGAAAA TCCACCACTC GTAAAAAATG GAGACCGCAT TCGAATTGAA
ATCGAAGGAA TTGGCAGCTT AGAAAATCAA GTCGTTTTAG AAAAAGAACT TGTGAGAGGA
GGAGTACGAT GA
 
Protein sequence
MRKIQFKCYG RSQIEEAELH ITEDMVIWNG KEYKSHELAL DIPTSGNIYG TLLNYKGALA 
ALGNSVHELP YKQAPMAPIL YIKPINTMIA RGMPIPLPSE ERELEVGAAL GIVIGKRATK
VREEEALTYI QGYTIVNDIS IPHESVYRPA IKQKARDGFC PVGPWVIEKG AIQNPNDVSI
QVYVNGILRQ ENHTKNLIRP VERLIADVTE FMTLYEGDIL LVGVPENPPL VKNGDRIRIE
IEGIGSLENQ VVLEKELVRG GVR