Gene ECH74115_0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0043 
SymbolcaiA 
ID6968409 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp43651 
End bp44793 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content54% 
IMG OID643384124 
Productcrotonobetainyl-CoA dehydrogenase 
Protein accessionYP_002268647 
Protein GI209399530 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones70 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTA ATTTAAATGA TGAGCAGGAA CTGTTTGTCG CCGGTATCCG CGAACTGATG 
GCCAGCGAAA ACTGGGAGGC CTATTTTGCC GAGTGCGACC GTGACAGCGT CTACCCGGAA
CGTTTTGTCA AAGCACTGGC GGATATGGGT ATCGACAGTC TGCTGATCCC TGAAGAGCAC
GGTGGTCTGG ACGCGGGGTT TGTTACTCTC GCCGCCGTGT GGATGGAGCT GGGACGTCTG
GGGGCACCAA CCTATGTACT GTACCAGTTG CCGGGCGGGT TCAATACCTT CCTGCGCGAA
GGCACACAAG AGCAGATCGA CAAGATTATG GCTTTCCGCG GCACCGGTAA GCAGATGTGG
AACTCAGCGA TTACCGAACC GGGCGCGGGC TCCGACGTGG GTAGCCTGAA AACGACTTAT
ACCCGTAGAA ATGGTAAGAT TTATCTTAAT GGTAGTAAGT GTTTTATTAC CAGTAGCGCC
TACACCCCGT ACATCGTGGT GATGGCGCGC GACGGGGCTT CTCCGGACAA ACCTGTCTAC
ACCGAATGGT TTGTTGATAT GAGCAAACCG GGCATCAAAG TGACCAAACT TGAGAAGCTC
GGTCTGCGTA TGGATAGCTG CTGTGAAATC ACCTTTGACG ATGTGGAACT GGACGAGAAA
GACATGTTCG GTCGGGAAGG TAACGGCTTT AACCGCGTCA AAGAAGAGTT CGACCATGAA
CGTTTCCTGG TAGCCCTGAC CAACTACGGT ACGGCGATGT GCGCCTTTGA AGATGCGGCG
CGCTACGCCA ACCAGCGCGT GCAGTTTGGC GAGGCTATTG GTCGTTTCCA GTTGATTCAG
GAAAAATTCG CCCACATGGC GATCAAATTA AACTCCATGA AAAACATGCT GTATGAAGCA
GCGTGGAAAG CAGACAACGG CACCATCACC TCTGGCGATG CAGCGATGTG CAAATACTTC
TGCGCCAATG CGGCATTTGA AGTGGTGGAT AGCGCAATGC AGGTGCTGGG CGGTGTCGGG
ATTGCGGGCA ACCACCGCAT CAGCCGCTTC TGGCGTGACC TGCGTGTAGA CCGCGTTTCC
GGAGGATCTG ACGAAATGCA GATCCTGACG CTGGGTCGTG CGGTGCTGAA GCAATACCGC
TAA
 
Protein sequence
MDFNLNDEQE LFVAGIRELM ASENWEAYFA ECDRDSVYPE RFVKALADMG IDSLLIPEEH 
GGLDAGFVTL AAVWMELGRL GAPTYVLYQL PGGFNTFLRE GTQEQIDKIM AFRGTGKQMW
NSAITEPGAG SDVGSLKTTY TRRNGKIYLN GSKCFITSSA YTPYIVVMAR DGASPDKPVY
TEWFVDMSKP GIKVTKLEKL GLRMDSCCEI TFDDVELDEK DMFGREGNGF NRVKEEFDHE
RFLVALTNYG TAMCAFEDAA RYANQRVQFG EAIGRFQLIQ EKFAHMAIKL NSMKNMLYEA
AWKADNGTIT SGDAAMCKYF CANAAFEVVD SAMQVLGGVG IAGNHRISRF WRDLRVDRVS
GGSDEMQILT LGRAVLKQYR