Gene EcHS_A0043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A0043 
SymbolcaiA 
ID5591853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp42502 
End bp43644 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content54% 
IMG OID640919231 
Productcrotonobetainyl-CoA dehydrogenase 
Protein accessionYP_001456826 
Protein GI157159508 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones62 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATTTTA ATTTAAATGA TGAGCAGGAA CTGTTTGTCG CCGGTATCCG CGAACTGATG 
GCCAGCGAAA ACTGGGAGGC CTATTTTGCC GAGTGCGACC GTGACAGCGT CTACCCGGAA
CGTTTTGTCA AAGCACTGGC GGATATGGGT ATCGACAGTC TGCTGATCCC TGAAGAGCAC
GGTGGTCTGG ACGCGGGGTT TGTTACTCTC GCCGCCGTGT GGATGGAGCT GGGACGTCTG
GGGGCACCAA CCTATGTACT GTACCAGTTG CCGGGCGGGT TCAACACCTT CCTGCGCGAA
GGCACACAAG AGCAGATCGA CAAGATTATG GCTTTCCGCG GCACCGGTAA GCAGATGTGG
AACTCAGCGA TTACCGAACC GGGCGCGGGC TCCGACGTGG GTAGCCTGAA AACGACTTAT
ACCCGTAGAA ATGGTAAGAT TTATCTTAAT GGTAGTAAGT GTTTTATTAC CAGCAGCGCC
TACACCCCGT ACATCGTGGT GATGGCGCGC GACGGGGCTT CTCCGGACAA ACCTGTCTAC
ACCGAATGGT TTGTTGATAT GAGCAAACCG GGCATCAAAG TGACCAAACT TGAAAAGCTC
GGTCTGCGTA TGGATAGCTG CTGTGAAATC ACCTTTGACG ACGTGGAACT GGACGAGAAA
GACATGTTCG GTCGGGAAGG TAACGGCTTT AACCGCGTCA AAGAAGAGTT CGACCATGAA
CGTTTCCTGG TAGCCCTGAC CAACTACGGT ACGGCGATGT GCGCCTTTGA AGATGCGGCG
CGCTACGCCA ACCAGCGCGT GCAGTTTGGC GAGGCTATTG GTCGTTTCCA GTTGATTCAG
GAAAAATTCG CCCACATGGC GATCAAATTA AACTCCATGA AAAACATGCT GTATGAAGCA
GCGTGGAAAG CAGACAACGG CACCATCACC TCTGGCGATG CAGCGATGTG CAAATACTTC
TGCGCCAATG CGGCATTTGA AGTGGTGGAT AGCGCAATGC AGGTGCTGGG CGGTGTCGGG
ATTGCGGGCA ACCACCGCAT CAGCCGCTTC TGGCGTGACC TGCGTGTAGA CCGCGTCTCC
GGGGGATCTG ACGAAATGCA GATCCTGACG CTGGGTCGTG CGGTGCTGAA GCAATACCGC
TAA
 
Protein sequence
MDFNLNDEQE LFVAGIRELM ASENWEAYFA ECDRDSVYPE RFVKALADMG IDSLLIPEEH 
GGLDAGFVTL AAVWMELGRL GAPTYVLYQL PGGFNTFLRE GTQEQIDKIM AFRGTGKQMW
NSAITEPGAG SDVGSLKTTY TRRNGKIYLN GSKCFITSSA YTPYIVVMAR DGASPDKPVY
TEWFVDMSKP GIKVTKLEKL GLRMDSCCEI TFDDVELDEK DMFGREGNGF NRVKEEFDHE
RFLVALTNYG TAMCAFEDAA RYANQRVQFG EAIGRFQLIQ EKFAHMAIKL NSMKNMLYEA
AWKADNGTIT SGDAAMCKYF CANAAFEVVD SAMQVLGGVG IAGNHRISRF WRDLRVDRVS
GGSDEMQILT LGRAVLKQYR