Gene pE33L466_0134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0134 
Symbol 
ID3399633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp139337 
End bp140779 
Gene Length1443 bp 
Protein Length480 aa 
Translation table11 
GC content39% 
IMG OID637659968 
Productglucose-methanol-choline (GMC) oxidoreductase 
Protein accessionYP_245632 
Protein GI67078012 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2303] Choline dehydrogenase and related flavoproteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTGGGTTT ACCCAAATAA TTGGATTCCA ACTAAATCCG TGGAACAAAT GGCAAATACA 
ACATATGATG TTCTAATTGT TGGAAGTGGT GCTGGTGGGG GAGCAATGCT TTATCGATTA
TGCGAAATGT GGAAAAAACA AGGTGCTAAA CGCATTGGTA TACTAGAAAA AGGGGATAAA
TTATTCCACT CGCACGCTTT AAATATTCCC ACTATGAATG ATGATCGAAT GCGTACTCAA
TTATGGCCGG ATAATTCCAC TCCTGTTGGT GAACGTTTAC CTGAGTTTTC TGGTGCTAGA
CATACGTATG CTCTTGGTGG AAGAACCCTT TTCTGGAATG CAGGTTGCCC ACGTCCACCG
CTTTTTGAAA TACAAAAGTG GCCTGTTAAC CCGGGGGAAA TGGATGTATA TTATCGATTA
GCAGAGAAAG TAATGAATGT CACAACTTCT TATGCAAAAG GGTCATCTAT GCAAAGTGTT
TTTATGAGAA GATTGTTGTC AAAGGGAATT CCAGAAGCAA CAGATTTTCC ACTTGCTTGT
GATATGAGAG GTCCTCGTTT TGGAGAAATA CATTCTAATT CTTGGTTTAG TTCGGTTTAC
TTTTTAGCTT ATGCATTAAA CGATCGCCCA TATGATTTAG CATTAAATGC ATATACTAAT
AGGGTCTTTA TTGAGAATGG AAAGGCAACA GGGGTTAATG TGATTACTAA TGATCAAAAA
ACATACAACA TCCGTGCAAA AAATGTGGTA GTTTCTGCTA ATACATTGGA AACACCAAGA
ATCTTACTCA ATTCAGGTAT TAGTAATCAA CCAATAGGAC GTTATTTGAC CAACCATTCA
TTCCTTCTTG GAACTGGAAA AATAAGCCGA AAACAGTTCG CTGATAATGT CGGTAACTTA
GCAATTTTAA AGCGAGAAAC CAAAAGAGCT CCTTATCAAA TCCAAATATT AGGACCTGGT
CAGTATTATT CATACCAACA ATTTGAAGAA AAAATACTTT CTGAAGAATT ACCTATAATT
TTTGCAACCT TTGGTAGAGT AGAACCCCGT CCTGAAAACA GAGTTTTTAT AGACCCTTCT
GCGCGAGATC AATATGGTGT GCCACTGAAT CAAGTTAGCT TTTCCTATAA CGATAGAGAC
AAGGCAGTGA TGAACCAAAT GCGCCAAGGC ATTATACAAT CAGCTCAATC TATGGAAGTA
AAGTTAGATG GTGAACCTAC TTTATATCCA CCAGGATCTG ATGTTCATGA GTCTTGTACA
TGTCGAATGG GTAATGATCC TGGAACTTCA GCAACTAACC GTTTTGGTCA AATTCATGGA
GTTCAGGGAC TTTATGTAGC GGATAATAGC GTTCTTCCAT CATTGGCGGC TGCTAACCCG
ACTCTTTCGA CTGTAGCTTT AGCAATAAGA ACGGCGGATT ATATTGTTCG ACAAAGTGGT
TAA
 
Protein sequence
MWVYPNNWIP TKSVEQMANT TYDVLIVGSG AGGGAMLYRL CEMWKKQGAK RIGILEKGDK 
LFHSHALNIP TMNDDRMRTQ LWPDNSTPVG ERLPEFSGAR HTYALGGRTL FWNAGCPRPP
LFEIQKWPVN PGEMDVYYRL AEKVMNVTTS YAKGSSMQSV FMRRLLSKGI PEATDFPLAC
DMRGPRFGEI HSNSWFSSVY FLAYALNDRP YDLALNAYTN RVFIENGKAT GVNVITNDQK
TYNIRAKNVV VSANTLETPR ILLNSGISNQ PIGRYLTNHS FLLGTGKISR KQFADNVGNL
AILKRETKRA PYQIQILGPG QYYSYQQFEE KILSEELPII FATFGRVEPR PENRVFIDPS
ARDQYGVPLN QVSFSYNDRD KAVMNQMRQG IIQSAQSMEV KLDGEPTLYP PGSDVHESCT
CRMGNDPGTS ATNRFGQIHG VQGLYVADNS VLPSLAAANP TLSTVALAIR TADYIVRQSG