Gene ECH74115_0041 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_0041 
SymbolcaiC 
ID6969473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp40678 
End bp42231 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content51% 
IMG OID643384122 
Productputative crotonobetaine/carnitine-CoA ligase 
Protein accessionYP_002268645 
Protein GI209400477 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCA TTGGCGGACA ACATCTACGT CAAATGTGGG ACGATCTGGC GGACGTTTAC 
GGTCATAAAA CGGCGCTGAT TTGTGAATCC AGCGGCGGAG TCGTTAACCG GTATAGTTAT
CTTGAGTTAA ATCAGGAGAT TAACCGCACG GCAAACCTGT TTTATACGCT GGGGATTCGC
AAAGGCGACA AGGTTGCACT ACATCTCGAC AACTGCCCGG AATTTATCTT TTGCTGGTTC
GGGCTGGCAA AAATTGGCGC GATTATGGTG CCGATTAACG CCCGCCTGTT ACGCGAGGAA
AGTGCGTGGA TCCTGCAAAA TAGCCAGGCG TGCCTGCTGG TGACCAGTGC GCAATTCTAT
CCTATGTATC AACAGATTCA GCAAGAAGAT GCCACTCAAT TGCGGCACAT TTGCCTGACA
GATGTGGCAC TTCCCGCTGA TGATGGCGTG AGTTCGTTTA CTCAACTGAA AAATCAACAA
CCTGCCACCT TGTGCTATGC ACCGCCGCTA TCGACTGACG ATACGGCGGA AATTCTCTTC
ACCTCCGGCA CCACCTCCCG ACCGAAAGGT GTGGTGATTA CCCATTACAA CCTGCGCTTC
GCTGGATATT ACTCCGCCTG GCAGTGTGCA CTGCGTGACG ATGACGTCTA CCTGACGGTA
ATGCCTGCGT TTCATATCGA TTGCCAGTGT ACTGCGGCGA TGGCGGCGTT TTCTGCCGGG
GCCACCTTTG TGCTGGTCGA GAAATACAGC GCCCGCGCCT TCTGGGGACA GGTACAGAAG
TACCGCGCCA CCATTACCGA ATGTATTCCG ATGATGATTC GTACGTTGAT GGTGCAGCCG
CCTTCAGCGA ACGATCGGCA ACACCGCCTG CGGGAAGTGA TGTTTTATCT CAACTTGTCG
GAGCAGGAAA AAGACACATT TTGTGAACGC TTCGGTGTTC GCTTGCTGAC GTCTTATGGG
ATGACGGAAA CCATTGTGGG CATTATCGGC GATCGCCCTG GCGATAAACG ACGCTGGCCG
TCGATTGGTC GGGCGGGGTT TTGCTACGAC GCGGAGATCC GCGACGATCA CAATCGCCCG
CTCCCGGCAG GTGAGATCGG TGAAATCTGT ATTAAAGGCG TACCAGGGAA AACCATCTTC
AAAGAGTATT TTCTCAACCC GAAAGCCACT GCAAAAGTGC TGGAAGCTGA TGGCTGGCTG
CATACCGGCG ATACCGGATA CCGCGACGAA GAAGGCTTTT TTTATTTCAT CGATCGCCGC
TGCAATATGA TCAAACGCGG TGGCGAGAAT GTCTCCTGCG TGGAGCTGGA AAATATCATC
GCCACCCATC CAAAAATTCA GGATATCGTG GTTGTGGGTA TTAAAGATTC GATTCGCGAT
GAAGCCATCA AAGCATTTGT GGTGCTGAAT GAAGGTGAAA CATTGAGCGA AGAGGAATTT
TTCCGCTTCT GCGAACAAAA TATGGCGAAA TTTAAAGTGC CCTCTTATCT GGAGATCAGA
AAAGATCTGC CACGTAATTG CTCGGGGAAA ATAATTAGAA AGAATCTGAA ATAA
 
Protein sequence
MDIIGGQHLR QMWDDLADVY GHKTALICES SGGVVNRYSY LELNQEINRT ANLFYTLGIR 
KGDKVALHLD NCPEFIFCWF GLAKIGAIMV PINARLLREE SAWILQNSQA CLLVTSAQFY
PMYQQIQQED ATQLRHICLT DVALPADDGV SSFTQLKNQQ PATLCYAPPL STDDTAEILF
TSGTTSRPKG VVITHYNLRF AGYYSAWQCA LRDDDVYLTV MPAFHIDCQC TAAMAAFSAG
ATFVLVEKYS ARAFWGQVQK YRATITECIP MMIRTLMVQP PSANDRQHRL REVMFYLNLS
EQEKDTFCER FGVRLLTSYG MTETIVGIIG DRPGDKRRWP SIGRAGFCYD AEIRDDHNRP
LPAGEIGEIC IKGVPGKTIF KEYFLNPKAT AKVLEADGWL HTGDTGYRDE EGFFYFIDRR
CNMIKRGGEN VSCVELENII ATHPKIQDIV VVGIKDSIRD EAIKAFVVLN EGETLSEEEF
FRFCEQNMAK FKVPSYLEIR KDLPRNCSGK IIRKNLK