Gene pE33L466_0299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagpE33L466_0299 
SymboliolG 
ID3399733 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBacillus cereus E33L 
KingdomBacteria 
Replicon accessionNC_007103 
Strand
Start bp297501 
End bp298526 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content38% 
IMG OID637660123 
Productmyo-inositol 2-dehydrogenase 
Protein accessionYP_245787 
Protein GI67078167 
COG category[R] General function prediction only 
COG ID[COG0673] Predicted dehydrogenases and related proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000146369 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGTTT TAACGATTGG GATTATTGGT GCTGGACGAA TTGGGAAACT GCATGTTGAT 
AATTTGCGGC TGATGCCACA AGTAAAAATT AAAGCAGTTT CAGATGTAGT AATCAGTCAT
CTAGAAAAGT GGGCTCAAGA TAAAGGGATT TCCACTCTGA CTACAAACTA TCAGGATTTA
TTAGCAGATC CAGAAATTGA TGCTGTATTT ATTTGTTCAC CAACAAATAC ACATGCGCAA
ATTATTAAAG AAGCGGCTCT TGCGAAAAAA CATATTTTCT GCGAAAAGCC TGTTAGTTTC
TCGGTAGAAG AAACATTAGA AGCATTAGAG GTGGTAAAAG AACAAGGAGT ATCTCTTCAA
GTAGGTTTTA ACCGCCGTTT CGATCCTAAC TTCAGAAAGG TCTATGATCT TATTCAACAA
GGAGAAGTGG GACAGCCACA TATTTTAAAA ATTACGTCTA GAGATCCACA ACCACCAAGT
ATAGAGTATG TTCGTTCTTC AGGTGGATTG TTTATGGATA TGATGATTCA TGACTTTGAT
ATGGCTAGGT ATGTGATGAA TAGTGAAGTT GTTGAAGTAT TTGCATATGG AACAACATTA
ATTGATCCGT CCATTCAGGA AGTAAATGAT GTTGATACAG CAATTGTCAC ATTGAAATTT
GCGAATGGAG CTTTAGGGGT AATTGATAAT AGCCGCCAAG CTGTTTATGG ATATGACCAG
CGTGTTGAAG TGTTTGGTGA AAAAGGCGCA GTCGCTGCGG AGAATTGCTG CCCGACAACA
GTACAAGTTT CAAAAACAGA AGGTGTTGTA AAAGATAAGC CGCTTTATTT CTTCTTAGAG
CGCTATACGC AGGCTTACAT TGAAGAAGTA ACACAATTTA CAAAGTCAAT TATAAAAGGA
CAAGCTGTTA TTTGCAGTGG TAATGATGGG TTACAAGCAG AACGAATTGC GAAAGCTGCC
AAGGAATCCT TACTAACAGG AAAACCCGTT CAAATTGAAC ATAAACAACC TGCATTAAAT
CAGTAA
 
Protein sequence
MNVLTIGIIG AGRIGKLHVD NLRLMPQVKI KAVSDVVISH LEKWAQDKGI STLTTNYQDL 
LADPEIDAVF ICSPTNTHAQ IIKEAALAKK HIFCEKPVSF SVEETLEALE VVKEQGVSLQ
VGFNRRFDPN FRKVYDLIQQ GEVGQPHILK ITSRDPQPPS IEYVRSSGGL FMDMMIHDFD
MARYVMNSEV VEVFAYGTTL IDPSIQEVND VDTAIVTLKF ANGALGVIDN SRQAVYGYDQ
RVEVFGEKGA VAAENCCPTT VQVSKTEGVV KDKPLYFFLE RYTQAYIEEV TQFTKSIIKG
QAVICSGNDG LQAERIAKAA KESLLTGKPV QIEHKQPALN Q