Gene BCG9842_B5068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBCG9842_B5068 
Symbol 
ID7183662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus G9842 
KingdomBacteria 
Replicon accessionNC_011772 
Strand
Start bp218030 
End bp219202 
Gene Length1173 bp 
Protein Length390 aa 
Translation table11 
GC content39% 
IMG OID643548021 
Productputative homogentisate 1,2-dioxygenase 
Protein accessionYP_002443765 
Protein GI218895354 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG3508] Homogentisate 1,2-dioxygenase 
TIGRFAM ID[TIGR01015] homogentisate 1,2-dioxygenase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00282868 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones62 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTTATC GTCACATGGG AGAACTACCT CATAAACGAC ATGTACAATT TCGTAAAAAA 
GATGGATCGC TTTATCGTGA ACAGGTAATG GGAACAAAAG GTTTTTCTGG TACGCAGTCT
ATTTTGTATC ATCACTATAT GCCAACAGAA GTAGGTCATT CTGCATTATC GCATTCTTGT
CAGTTGCAGT ATGAAGAGGA TGTTGCTCTT TCTCATCGCC ACTTCCGAAC GAAAGAAAAT
AAAAAAAGTG GTGATGCAAT AAGTGGACGA AATTTCATAC TTGGAAATGA AGATTTGTTA
ATTGGAGTAG TGAGCCCAAC AGAAAAAATG GATTATTTCT ATCGTAATGG TGATGGCGAC
GAAATGTTAT TTGTTCATTA TGGAACAGGG AAAATTGAAA CGATGTTTGG AACGATTCAC
TATAGAAAAG GCGACTATGT AACGATCCCA ATTGGAACGA TTTATCGTGT TATTCCAGAT
GAAGGAGAGA CTAAGTTTCT TGTTGTAGAG GCGAATAGCC AAATTACAAC GCCGCGTCGT
TATCGAAATG AATACGGACA ATTGTTAGAG CATAGTCCGT TTTGTGAAAG AGATCTTCGT
GGTCCAGAAA AATTAGAGAC CTATGATGAA AAAGGCGATT TTGTCGTAAT GACAAAATCA
AGAGGTTATA TGCACAAACA TGTTTTAGGA CACCACCCGT TAGATGTTGT TGGATGGGAT
GGCTATTTGT ATCCGTGGGT ATTTAATGTA GAGGATTTTG AACCAATTAC AGGGCGCATT
CATCAGCCGC CGCCAGTACA TCAAACATTT GAAGGGCATA ATTTTGTTAT TTGCTCTTTC
GTACCACGTT TATACGATTA TCATCCAGAG TCAATTCCGG CACCATATTA TCATAGTAAT
GTTAATAGTG ATGAAGTTCT TTACTATGTA GAAGGAAACT TTATGAGTCG CAAAGGTGTG
GAAGAAGGTT CTATTACACT TCATCCGAGC GGGATTCCCC ATGGGCCGCA TCCGGGGAAA
ACAGAGGCAA GTATAGGGAA GAAAGAGACA CTTGAATTAG CTGTTATGAT AGACACATTC
CGTCCGCTTC GTATTGTAAA ACAAGCACAT GAAACAGAAG ATGAAAAGTA TATGTATAGC
TGGATTGAAC AAGGTTCATA TACTGTGAAA TAA
 
Protein sequence
MFYRHMGELP HKRHVQFRKK DGSLYREQVM GTKGFSGTQS ILYHHYMPTE VGHSALSHSC 
QLQYEEDVAL SHRHFRTKEN KKSGDAISGR NFILGNEDLL IGVVSPTEKM DYFYRNGDGD
EMLFVHYGTG KIETMFGTIH YRKGDYVTIP IGTIYRVIPD EGETKFLVVE ANSQITTPRR
YRNEYGQLLE HSPFCERDLR GPEKLETYDE KGDFVVMTKS RGYMHKHVLG HHPLDVVGWD
GYLYPWVFNV EDFEPITGRI HQPPPVHQTF EGHNFVICSF VPRLYDYHPE SIPAPYYHSN
VNSDEVLYYV EGNFMSRKGV EEGSITLHPS GIPHGPHPGK TEASIGKKET LELAVMIDTF
RPLRIVKQAH ETEDEKYMYS WIEQGSYTVK