Gene EcSMS35_2523 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2523 
Symbol 
ID6144288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2581075 
End bp2582769 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content47% 
IMG OID641617395 
Productputative oxalyl-CoA decarboxylase 
Protein accessionYP_001744566 
Protein GI170682647 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0028] Thiamine pyrophosphate-requiring enzymes [acetolactate synthase, pyruvate dehydrogenase (cytochrome), glyoxylate carboligase, phosphonopyruvate decarboxylase] 
TIGRFAM ID[TIGR03254] oxalyl-CoA decarboxylase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones66 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCAGATC AACTTCAAAT GACAGATGGT ATGCATATCA TCGTTGAAGC ATTAAAACAG 
AATAATATTG ACACTATTTA TGGTGTTGTA GGTATTCCTG TGACGGATAT GGCACGCCAT
GCCCAGGCGG AAGGCATTCG TTATATTGGT TTTCGTCATG AGCAGTCGGC AGGCTATGCC
GCTGCGGCAA GCGGTTTTCT TACCCAAAAA CCAGGGATCT GCCTGACAGT GTCTGCGCCA
GGATTCCTCA ATGGTTTGAC CGCATTGGCC AACGCAACGG TAAATGGTTT TCCGATGATC
ATGATTAGCG GCTCCAGCGA CCGCGCGATC GTCGACCTAC AGCAAGGTGA TTATGAAGAG
CTGGACCAAA TGAATGCGGC AAAACCGTAT GCCAAAGCAG CATTCCGCGT TAATCAGCCG
CAGGATCTTG GCATTGCATT GGCACGCGCT ATCCGGGTCT CAGTATCGGG TCGCCCTGGC
GGAGTTTATC TTGATTTGCC AGCAAATGTC CTGGCCGCGA CGATGGAAAA AGACGAAGCG
TTAACCACGA TTGTAAAAGT TGAAAATCCG TCGCCAGCAT TATTGCCATG CCCGAAGTCA
GTCGCTAGCG CAATTTCGCT TTTAGCAAAA GCTGAACGAC CATTAGTTAT CCTTGGCAAA
GGCGCGGCGT ATTCACAAGC TGATGAACAG CTTCGTGAAT TTATTGAAAG TGCCCAGATT
CCATTCCTGC CAATGTCTAT GGCGAAAGGG ATCCTGGAAG ATACACATCC ACTTTCTGCG
GCAACTGCGC GTTCGTTTGC CCTGGCAAAT GCTGACGTTG TCATGCTTGT TGGTGCACGA
CTGAATTGGT TACTGGCACA CGGTAAAAAA GGATGGGCGG CAGATACACA GTTTATTCAA
CTGGATATTG AACCGCAGGA AATTGACAGC AACCGCCCCA TTGCTGTGCC AGTCGTTGGC
GATATTGCAT CCAGTATGCA AGGTATGCTG GCAGAACTGA AACAAAACAC ATTTACGACT
CCACTGGTAT GGCGCGATAT TTTAAATATC CACAAGCAGC AAAATGCACA AAAAATGCAT
GAAAAGTTAA GTACAGATAC CCAACCATTA AATTACTTTA ATGCATTGAG TGCCGTGCGC
GATGTTTTGC GCGAGAACCA GGATATTTAT TTAGTTAATG AAGGGGCAAA TACCCTGGAT
AATGCACGAA ATATTATTGA TATGTATAAA CCACGTCGTC GTCTGGATTG TGGCACCTGG
GGTGTCATGG GCATCGGTAT GGGCTATGCC ATCGGTGCTA GCGTGACCTC TGGTTCTCCG
GTTGTCGCCA TTGAAGGTGA TAGTGCTTTT GGTTTCAGTG GGATGGAGAT TGAAACGATT
TGTCGATATA ACCTGCCGGT GACGATCGTT ATTTTTAATA ATGGCGGCAT CTACAGAGGA
GACGGTGTTG ATCTCAGTGG CGCTGGTGCA CCATCACCAA CAGATCTGTT GCACCATGCA
AGGTATGACA AATTAATGGA TGCGTTTCGT GGCGTTGGCT ATAACGTCAC CACGACAGAT
GAACTTCGTC ATGCTTTAAC CACCGGTATT CAGTCGCGCA AACCGACCAT TATTAATGTG
GTCATCGACC CTGCAGCAGG AACTGAAAGT GGCCATATTA CCAAACTTAA CCCAAAACAA
GTCGCTGGTA ATTAA
 
Protein sequence
MSDQLQMTDG MHIIVEALKQ NNIDTIYGVV GIPVTDMARH AQAEGIRYIG FRHEQSAGYA 
AAASGFLTQK PGICLTVSAP GFLNGLTALA NATVNGFPMI MISGSSDRAI VDLQQGDYEE
LDQMNAAKPY AKAAFRVNQP QDLGIALARA IRVSVSGRPG GVYLDLPANV LAATMEKDEA
LTTIVKVENP SPALLPCPKS VASAISLLAK AERPLVILGK GAAYSQADEQ LREFIESAQI
PFLPMSMAKG ILEDTHPLSA ATARSFALAN ADVVMLVGAR LNWLLAHGKK GWAADTQFIQ
LDIEPQEIDS NRPIAVPVVG DIASSMQGML AELKQNTFTT PLVWRDILNI HKQQNAQKMH
EKLSTDTQPL NYFNALSAVR DVLRENQDIY LVNEGANTLD NARNIIDMYK PRRRLDCGTW
GVMGIGMGYA IGASVTSGSP VVAIEGDSAF GFSGMEIETI CRYNLPVTIV IFNNGGIYRG
DGVDLSGAGA PSPTDLLHHA RYDKLMDAFR GVGYNVTTTD ELRHALTTGI QSRKPTIINV
VIDPAAGTES GHITKLNPKQ VAGN