Gene ECH74115_3605 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3605 
Symboloxc 
ID6971322 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3322505 
End bp3324199 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content47% 
IMG OID643387400 
Productputative oxalyl-CoA decarboxylase 
Protein accessionYP_002271859 
Protein GI209399559 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03254] oxalyl-CoA decarboxylase 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.376687 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATC AACTTCAAAT GACAGATGGT ATGCATATCA TCGTTGAAGC ATTAAAACAG 
AATAATATTG ACACTATTTA TGGTGTTGTA GGTATTCCTG TGACGGATAT GGCGCGCCAT
GCCCAGGCGG AAGGCATTCG TTATATTGGT TTTCGTCATG AGCAGTCGGC AGGCTATGCC
GCTGCGGCAA GCGGTTTTCT TACCCAAAAA CCAGGGATCT GCCTGACAGT GTCTGCGCCA
GGATTCCTCA ATGGTTTGAC CGCATTGGCC AACGCAACGG TAAATGGTTT TCCGATGATC
ATGATTAGCG GCTCCAGCGA CCGCGCGATC GTCGACCTAC AGCAAGGTGA TTATGAAGAG
CTGGACCAAA TGAATGCGGC AAAACCGTAT GCCAAAGCAG CATTTCGCGT TAATCAGCCG
CAGGATCTTG GCATTGCATT GGCACGCGCT ATCCGGGTCT CAGTATCGGG TCGCCCTGGC
GGAGTTTATC TTGATTTGCC AGCAAATGTC CTGGCCGCGA CGATGGAAAA AGACGAAGCG
TTAACCACGA TTGTAAAAGT TGAAAATCCG TCGCCAGCAT TATTGCCATG CCCGAAGTCA
GTCACTAGCG CAATTTCGCT TTTAGCAAAA GCTGAACGGC CATTAATTAT CCTTGGCAAA
GGCGCGGCGT ATTCACAAGC TGACGAACAG CTTCGTGAAT TTATTGAAAG TGCCCAGATT
CCATTCCTGC CAATGTCTAT GGCGAAAGGG ATCCTCGAAG ATACGCATCC ACTTTCTGCG
GCAGCTGCGC GTTCGTTTGC CCTGGCAAAT GCTGACGTTG TCATGCTTGT TGGTGCACGA
CTGAATTGGT TACTGGCACA CGGTAAAAAA GGATGGGCGG CAGATACACA GTTTATTCAA
CTGGATATTG AACCGCAGGA AATTGACAGC AACCGCCCCA TTGCTGTGCC AGTCGTTGGC
GATATTGCAT CCAGTATGCA AGGTATGCTG GCAGAGCTGA AACAAAACAC ATTTACGACT
CCACTGGTAT GGCGCGATAT TTTAAATATC CACAAGCAGC AAAATGCACA AAAAATGCAT
GAAAAATTAA GTACAGATAC CCAACCATTA AATTACTTTA ATGCATTAAG TGCTGTGCGC
GATGTATTGC GTGAGAACCA GGATATTTAT TTAGTTAATG AAGGTGCAAA TACCCTGGAT
AATGCACGAA ATATTATTGA TATGTATAAA CCACGTCGTC GTCTGGATTG TGGTACCTGG
GGTGTCATGG GCATCGGTAT GGGCTATGCC ATCGGTGCTA GCGTGACCTC TGGTTCTCCG
GTTGTCGCCA TTGAAGGTGA TAGTGCTTTT GGTTTCAGTG GGATGGAAAT TGAAACGATT
TGTCGATATA ACCTGCCGGT AACGATCGTT ATTTTTAATA ATGGCGGCAT CTACAGAGGA
GACGGTGTTG ATCTCAGTGG CGCTGGTGCA CCATCACCAA CAGATCTGTT GCACCATGCA
AGGTATGACA AATTAATGGA TGCGTTTCGT GGCGTTGGCT ATAACGTCAC CACGACAGAT
GAACTTCGTC ATGCTTTAAC CACCGGTATT CAGTCGCGCA AACCGACCAT TATTAATGTG
GTCATCGACC CTGCAGCAGG AACTGAAAGT GGCCATATTA CCAAACTTAA CCCAAAACAA
GTCGCTGGTA ATTAA
 
Protein sequence
MSDQLQMTDG MHIIVEALKQ NNIDTIYGVV GIPVTDMARH AQAEGIRYIG FRHEQSAGYA 
AAASGFLTQK PGICLTVSAP GFLNGLTALA NATVNGFPMI MISGSSDRAI VDLQQGDYEE
LDQMNAAKPY AKAAFRVNQP QDLGIALARA IRVSVSGRPG GVYLDLPANV LAATMEKDEA
LTTIVKVENP SPALLPCPKS VTSAISLLAK AERPLIILGK GAAYSQADEQ LREFIESAQI
PFLPMSMAKG ILEDTHPLSA AAARSFALAN ADVVMLVGAR LNWLLAHGKK GWAADTQFIQ
LDIEPQEIDS NRPIAVPVVG DIASSMQGML AELKQNTFTT PLVWRDILNI HKQQNAQKMH
EKLSTDTQPL NYFNALSAVR DVLRENQDIY LVNEGANTLD NARNIIDMYK PRRRLDCGTW
GVMGIGMGYA IGASVTSGSP VVAIEGDSAF GFSGMEIETI CRYNLPVTIV IFNNGGIYRG
DGVDLSGAGA PSPTDLLHHA RYDKLMDAFR GVGYNVTTTD ELRHALTTGI QSRKPTIINV
VIDPAAGTES GHITKLNPKQ VAGN