Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BCB4264_A0257 |
Symbol | |
ID | 7097609 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Bacillus cereus B4264 |
Kingdom | Bacteria |
Replicon accession | NC_011725 |
Strand | + |
Start bp | 226047 |
End bp | 227225 |
Gene Length | 1179 bp |
Protein Length | 392 aa |
Translation table | 11 |
GC content | 38% |
IMG OID | 643467813 |
Product | putative homogentisate 1,2-dioxygenase |
Protein accession | YP_002365069 |
Protein GI | 218231675 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG3508] Homogentisate 1,2-dioxygenase |
TIGRFAM ID | [TIGR01015] homogentisate 1,2-dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000636938 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCATGT TTTATCGTCA CATGGGAGAA CTACCTCATA AACGACATGT ACAATTTCGT AAAAAAGATG GATCGCTTTA TCGTGAACAG GTAATGGGAA CAAAAGGTTT TTCTGGTACG CAGTCTATTT TGTATCATCA TTATATGCCG ACAGAAGTAG GTCATTCTGC ATTATCGCAT TCTTGTCAGT TGCAGTATGA AGAGGATGTT GCTCTTTCTC ATCGCCACTT TCGAACGAAA GAAAGTAAAA AAAGTGGTGA TGCAATAAGT GGACGGAATT TCATACTTGG AAATGAAGAT TTATTAATTG GAGTAGTGAG TCCAACAGAA AAAATGGATT ATTTCTATCG TAATGGTGAT GGCGACGAAA TGTTATTTGT TCATTATGGA ACAGGGAAAA TTGAAACGAT GTTTGGAACG ATTCACTATA GAAAAGGCGA CTATGTAACG ATTCCAATTG GAACAATTTA TCGTGTTATT CCAGATGAAG GAGAGACTAA GTTTCTTGTT GTAGAGGCGA ATAGCCAAAT TACAACGCCG CGTCGTTATC GAAATGAATA CGGACAATTG TTAGAGCATA GTCCGTTTTG TGAAAGAGAT CTTCGTGGTC CAGAAAAATT AGAGACATAT GATGAAAAAG GCGAGTTTGT CGTAATGACA AAATCAAGAG GCTATATGCA TAAACATGTT TTAGGACACC ACCCGTTAGA TGTTGTGGGA TGGGATGGCT ATTTATATCC GTGGGTATTT AATGTAGAGG ATTTTGAACC AATTACAGGG CGCATTCATC AGCCACCGCC AGTACATCAA ACATTTGAAG GACATAATTT TGTTATTTGC TCTTTCGTAC CACGTTTATA CGATTATCAT CCAGAGTCAA TTCCGGCACC ATATTATCAT AGTAATGTTA ATAGTGATGA AGTTCTTTAC TATGTAGAAG GAAACTTTAT GAGTCGCAAA GGTGTGGAAG AAGGTTCTAT TACACTTCAT CCGAGCGGGA TTCCCCATGG GCCGCATCCG GGGAAAACAG AGGCAAGTAT AGGGAAGAAA GAGACGCTTG AATTAGCTGT TATGATAGAC ACATTCCGTC CGCTTCGTAT TGTAAAACAA GCACATGAAA CAGAAGATGA AAAGTATATG TATAGCTGGA TTGAACAAGG TTCATATACT GTGAAATAA
|
Protein sequence | MGMFYRHMGE LPHKRHVQFR KKDGSLYREQ VMGTKGFSGT QSILYHHYMP TEVGHSALSH SCQLQYEEDV ALSHRHFRTK ESKKSGDAIS GRNFILGNED LLIGVVSPTE KMDYFYRNGD GDEMLFVHYG TGKIETMFGT IHYRKGDYVT IPIGTIYRVI PDEGETKFLV VEANSQITTP RRYRNEYGQL LEHSPFCERD LRGPEKLETY DEKGEFVVMT KSRGYMHKHV LGHHPLDVVG WDGYLYPWVF NVEDFEPITG RIHQPPPVHQ TFEGHNFVIC SFVPRLYDYH PESIPAPYYH SNVNSDEVLY YVEGNFMSRK GVEEGSITLH PSGIPHGPHP GKTEASIGKK ETLELAVMID TFRPLRIVKQ AHETEDEKYM YSWIEQGSYT VK
|
| |