Gene pE33L466_0302 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0302 
SymboliolD 
ID3399736 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp301311 
End bp303245 
Gene Length1935 bp 
Protein Length644 aa 
Translation table11 
GC content40% 
IMG OID637660126 
Productmyo-inositol catabolism protein 
Protein accessionYP_245790 
Protein GI67078170 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones35 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAACTG TTAGAATGAC GACGGCGCAA GCATTGGTGA AATTTTTGAA TCAACAGTAC 
ATAGAGTTTG ATGGAGAGCA ACAAAAGTTT ATTAAGGGGA TATTTACTAT TTTTGGTCAT
GGAAATGTAG TAGGACTTGG TCAAGCTTTA GAAGAAGACG CAGGAGAATT AGAAGTATAT
CAAGGTAGAA ATGAACAAGG AATGGCAAAT GCTGCGATGG CTTTTGCAAA ACAAAAACAT
AGAAAACAAA TTATGGCATG TACTTCTTCT GTGGGTCCTG GATCAGCAAA TATGATTACC
TCTGCAGCAA CAGCTTCTGC AAATAATATT CCCGTTTTAT TACTTCCAGG AGATGTATTT
GCAACGAGAC AACCTGATCC TGTTCTTCAA CAAATTGAAC AAACACATGA CTTATCTATT
TCTACAAATG ATGCTTTCCG TGCAGTAAGT AAGTACTGGG ACAGGATAAA TCGTCCTGAA
CAATTGATGA CAGCTATGAT TCAAGCAATG CGTGTTTTAA CGAATCCGGC AGATACAGGG
GCTGTAACAA TTTGCTTACC ACAAGATGTC CAAGGAGAAG CGTGGGATTT TCCAAGTTAC
TTCTTCCAAA AGTGCGTTCA CCGTATTGAG CGTCGTCTGC CTACAAGAGC CAGCTTAGCG
GATGCGGTTG AAATGATTAA GAGAAAGAAA AAGCCAGTTA TGATTTGCGG TGGGGGTGTA
AGATACGCAG AAGCGGCAGA GGAACTCAAA CAGTTCGCTG AAGCATTCCG TATTCCATTT
GGAGAAACAC AAGCTGGGAA AAGCGCGATT GAAAGCAGCC ATCCATACAA TCTTGGCGGC
ATTGGGGTAA CTGGGAATCT AGCAGCTAAT ACAATTGCAA AGGAAGCGGA TCTTGTTATT
GGGATTGGGA CGAGATTTAC TGATTTTACA ACGGCATCGA AACAATTATT TCAAAATGAG
GAAGTTGAGT TTGTAAACAT CAACATTTCA GAATTTCATG CGAACAAGCT TGATGCGTTG
AAGGTTATAG CAGATGCGAA AGAAGCACTT CTTGCTCTAA TAAATGAACT GCAAGCAATT
GAGTATCGAT CTAGTTACAC AGTAGAAATT GCTGCTGCAA AAGAGTTTTG GGAAACAGAA
TTAGCACGTT TACATAATAT TCGCTTTACA GGTCAAGATT TTAAACCAGA AGTTGAAGGT
CATTTTGATG ATAATTTAAA TGAGTATGTG GATGCGCTTG GTACACAATT AACACAGACT
GCAGTTATTG GAGAAATGAA CACATTACTT GATGAAGATG CAATTATCGT TGGTGCGGCA
GGAAGTCTTC CGGGTGATTT ACAAAGAATG TGGACATCGC GAAAACCAAA TACATACCAC
ATGGAGTATG GATATTCTTG TATGGGCTAC GAGGTTGCAG GAGCACTTGG TGCGAAGCTA
GCTGAGCCAT CGAAGGAAGT CTATGCGATG GTAGGGGATG GCAGTTACCA AATGCTTCAT
TCTGAGCTCG TCACAAGCCT TCAAGAAAAC AAAAAAATTA ACGTCTTATT GTTTGATAAC
TCCGGATTTG GTTGCATTAA TAACTTACAA ATGGGTAACG GAATGGGGAG CTTTGGAACA
GAGTTTCGCT ATCGAAACGA GGAAACTCGT AAGTTAAACG GGGCTATTAT GAAAATTGAT
TTTGCAGCTA GCGCGGCTGG ATACGGTGTG AAAACGTATC GTGTTACATC GGTGGAACAA
TTACAGGAAG CGCTTAAAGA TGCGAAAAAA CAAACGGTCT CTACATTGAT TGATATTAAA
GTATTACCAA AAACAATGAC AAATGGATAC GAGTCATGGT GGCATGTAGG TGTTGCAGAA
GTATCTAATA GTCAAAGTGT ACAAGCTGCA TATGAGAGTA AAGTAAGTAA CTTGCAAAAG
GCGAGATCTT ATTAG
 
Protein sequence
MQTVRMTTAQ ALVKFLNQQY IEFDGEQQKF IKGIFTIFGH GNVVGLGQAL EEDAGELEVY 
QGRNEQGMAN AAMAFAKQKH RKQIMACTSS VGPGSANMIT SAATASANNI PVLLLPGDVF
ATRQPDPVLQ QIEQTHDLSI STNDAFRAVS KYWDRINRPE QLMTAMIQAM RVLTNPADTG
AVTICLPQDV QGEAWDFPSY FFQKCVHRIE RRLPTRASLA DAVEMIKRKK KPVMICGGGV
RYAEAAEELK QFAEAFRIPF GETQAGKSAI ESSHPYNLGG IGVTGNLAAN TIAKEADLVI
GIGTRFTDFT TASKQLFQNE EVEFVNINIS EFHANKLDAL KVIADAKEAL LALINELQAI
EYRSSYTVEI AAAKEFWETE LARLHNIRFT GQDFKPEVEG HFDDNLNEYV DALGTQLTQT
AVIGEMNTLL DEDAIIVGAA GSLPGDLQRM WTSRKPNTYH MEYGYSCMGY EVAGALGAKL
AEPSKEVYAM VGDGSYQMLH SELVTSLQEN KKINVLLFDN SGFGCINNLQ MGNGMGSFGT
EFRYRNEETR KLNGAIMKID FAASAAGYGV KTYRVTSVEQ LQEALKDAKK QTVSTLIDIK
VLPKTMTNGY ESWWHVGVAE VSNSQSVQAA YESKVSNLQK ARSY