Gene EcSMS35_0040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0040 
SymbolcaiA 
ID6146162 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp44423 
End bp45565 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content54% 
IMG OID641614941 
Productcrotonobetainyl-CoA dehydrogenase 
Protein accessionYP_001742157 
Protein GI170680462 
COG category[I] Lipid transport and metabolism 
COG ID[COG1960] Acyl-CoA dehydrogenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones53 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTTTA ATTTAAATGA TGAGCAGGAA CTGTTTGTCG CCGGTATCCG CGAACTGATG 
GCCAGCGAAA ACTGGGAGGC CTATTTTGCC GAGTGCGACC GTGACAGCGT CTACCCGGAA
CGTTTTGTCA AAGCACTGGC GGATATGGGT ATCGACAGTC TGCTGATCCC TGAAGAGCAC
GGTGGTCTGG ACGCGGGGTT TGTTACTCTC GCCGCCGTGT GGATGGAGCT GGGACGTCTG
GGGGCACCAA CCTATGTGCT GTACCAGTTG CCGGGCGGGT TCAATACCTT CCTGCGCGAA
GGCACACAAG AGCAGATCGA CAAGATTATG GCTTTCCGCG GCACCGGTAA GCAGATGTGG
AACTCAGCGA TTACCGAACC GGGTGCGGGC TCCGACGTGG GTAGCCTGAA AACGACTTAT
ACCCGTAGAA ATGGTAAGAT TTATCTTAAT GGTAGTAAGT GTTTTATTAC CAGTAGCGCC
TACACCCCGT ACATCGTGGT GATGGCGCGC GACGGGGCTT CTCCGGACAA ACCTGTCTAC
ACCGAATGGT TTGTTGATAT GAGCAAACCG GGCATCAAAG TGACCAAACT TGAAAAGCTC
GGTCTGCGTA TGGATAGCTG CTGTGAAATC ACTTTTGACG ACGTGGAACT GGACGAGAAA
GACATGTTCG GTCGGGAAGG TAACGGCTTT AACCGCGTCA AAGAAGAGTT CGACCATGAA
CGTTTCCTGG TAGCCCTGAC CAACTACGGT ACGGCGATGT GCGCCTTTGA AGATGCGGCG
CGCTACGCCA ACCAGCGTGT GCAGTTTGGC GAGGCTATTG GTCGTTTCCA GTTGATTCAG
GAAAAATTCG CCCACATGGC GATCAAATTA AACTCCATGA AAAACATGCT GTATGAAGCA
GCGTGGAAAG CAGACAACGG CACCATCACC TCTGGCGATG CAGCGATGTG CAAATACTTC
TGCGCCAATG CGGCATTTGA AGTGGTGGAT AGCGCAATGC AGGTGCTGGG CGGTGTCGGG
ATTGCGGGCA ACCACCGCAT CAGCCGCTTC TGGCGTGACC TGCGTGTAGA CCGCGTTTCC
GGAGGATCTG ACGAAATGCA GATCCTGACG CTGGGTCGCG CGGTGCTGAA GCAATACCGC
TAA
 
Protein sequence
MDFNLNDEQE LFVAGIRELM ASENWEAYFA ECDRDSVYPE RFVKALADMG IDSLLIPEEH 
GGLDAGFVTL AAVWMELGRL GAPTYVLYQL PGGFNTFLRE GTQEQIDKIM AFRGTGKQMW
NSAITEPGAG SDVGSLKTTY TRRNGKIYLN GSKCFITSSA YTPYIVVMAR DGASPDKPVY
TEWFVDMSKP GIKVTKLEKL GLRMDSCCEI TFDDVELDEK DMFGREGNGF NRVKEEFDHE
RFLVALTNYG TAMCAFEDAA RYANQRVQFG EAIGRFQLIQ EKFAHMAIKL NSMKNMLYEA
AWKADNGTIT SGDAAMCKYF CANAAFEVVD SAMQVLGGVG IAGNHRISRF WRDLRVDRVS
GGSDEMQILT LGRAVLKQYR