Gene EcSMS35_4678 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4678 
Symbol 
ID6145723 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4776394 
End bp4777935 
Gene Length1542 bp 
Protein Length513 aa 
Translation table11 
GC content53% 
IMG OID641619494 
ProductCoA-transferase 
Protein accessionYP_001746602 
Protein GI170684002 
COG category[I] Lipid transport and metabolism 
COG ID[COG4670] Acyl CoA:acetate/3-ketoacid CoA transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.696656 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAAGA TAACAACCGC CGAGGCACTG GCGGCGCAAA TTCAGGACGG TGCGACTATT 
GCTATTAGCG GTAACGGCGG CGGTATGGTG GAAGCCGACC ATATTCTGGC TGCTATTGAA
GCGCGTTTCC TGCAAACCGG ACACCCGCGC GATCTGACAT TAATCCACTC GCTGGGTATT
GGCGATCGCG ACAGCAAAGG CACTAACCGT TTTGCTCACG CCGAAATGCT CAAACGCATT
ATTGCCGGAC ATTTTACCTG GTCGCCCAAG ATGCAGGAAC TGGTGAAAAG CAATGCTATT
GAAGCCTACT GTTTCCCTGG TGGCGTGATT CAGGCGTTGC TACGGGAGAT CGGTGCCGGA
CGTCCGGGGC TTTTTACCCA CGTTGGGCTG GGATCGTTTG TTGATCCACG CAATGGCGGC
GGTAAGTCGA ATGAATGCAC TACCACCGAC CTGGTAGAAC TGATTGAAAT CGATGGTGAA
ACCAAACTTC GTTATCGCCC TTTCAAAGTG GATTACGCGA TTTTGCGTGG CACTTATGCC
GATCCTCGAG GCAACGTCAG CCTTGAAGAA GAAGCGATTG ATATGGATAG CTATTCCATG
GCGCTGGCAG CACACAACAG CGGCGGCAAA GTGTTCGTAC AGGTACGCGA TGTGCTGGAA
GCTGGCGCGA TTGAACCACG TCGGGTCAAA TTACCGGGGA TTCTGGTTGA TGGCATCGTT
GAGCACCGCG AACAACCGCA AACCTATCTT GGTGGTTACG ACCTGACCAT TAGCGGTCAA
CATCGCCGTC TAAGTTCTAA CGACGCTATT GAACTGGTTA GTCATCCGGT GCGTCGCCTG
ATTGCCCGTC GGGCAGCACG GGAACTGGTG GCAGGCGCTT CAACCAACTT TGGCTTTGGT
ATTCCGGGCG GTATTCCAGG CGTAGCGCTG CGCGAAGGCG TGCCTTATCA AAGTTTGTGG
CTGAGTGTAG AACAGGGTGT ACATAACGGC ATGATGCTGG ATGATGCTCT GTTCGGCTGC
GCCCGTAACG CCGATGCCAT TATTCCATCA CTCGATCAAT TCGAATTCTA CAGTGGCGGA
GGGATCGATA TCACCTTCCT CGGCATGGGA GAGATGGATC AGTACGGTAA CGTCAACGTC
TCCCATCTCA ATGGCAATCT GATTGGCCCC GGCGGATTTC TCGAAATTGC GCAAAACGCC
CGTAAAGTGG TGTTCTGCGG CACGTTCGAC GCCAAAGGTA GCAAGATTGA TGTAACGCCA
GATGGCTTGC ATATCGCCCA GTCAGGTCAA ATCCCTAAAC TGGTTACCCA GGTGGAAAAA
ATCACTTTTA GCGCCGCCTA CGCACAGCAA AGTGGTCAGG AAGTGTTGTA TATCACTGAA
CGTGCAGTAT TCCAGTTAAC GGCAGAAGGC GTTGAATTAA TTGAAATCGC ACCAGGTGTG
GAGATTGAGC GCGACATTCT GCCGTATATG GCCTTCCGTC CAATTATCAA TCAGCCACGC
CTGATGGAAA GTAGCCTGTT TACGCCGATG GAGGATGCAT GA
 
Protein sequence
MRKITTAEAL AAQIQDGATI AISGNGGGMV EADHILAAIE ARFLQTGHPR DLTLIHSLGI 
GDRDSKGTNR FAHAEMLKRI IAGHFTWSPK MQELVKSNAI EAYCFPGGVI QALLREIGAG
RPGLFTHVGL GSFVDPRNGG GKSNECTTTD LVELIEIDGE TKLRYRPFKV DYAILRGTYA
DPRGNVSLEE EAIDMDSYSM ALAAHNSGGK VFVQVRDVLE AGAIEPRRVK LPGILVDGIV
EHREQPQTYL GGYDLTISGQ HRRLSSNDAI ELVSHPVRRL IARRAARELV AGASTNFGFG
IPGGIPGVAL REGVPYQSLW LSVEQGVHNG MMLDDALFGC ARNADAIIPS LDQFEFYSGG
GIDITFLGMG EMDQYGNVNV SHLNGNLIGP GGFLEIAQNA RKVVFCGTFD AKGSKIDVTP
DGLHIAQSGQ IPKLVTQVEK ITFSAAYAQQ SGQEVLYITE RAVFQLTAEG VELIEIAPGV
EIERDILPYM AFRPIINQPR LMESSLFTPM EDA