Gene EcSMS35_0038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0038 
SymbolcaiC 
ID6143120 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp41450 
End bp43003 
Gene Length1554 bp 
Protein Length517 aa 
Translation table11 
GC content51% 
IMG OID641614939 
Productputative crotonobetaine/carnitine-CoA ligase 
Protein accessionYP_001742155 
Protein GI170681586 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.565789 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones52 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATATCA TTGGCGGACA ACATCTACGT CAAATGTGGG ACGATCTGGC GGACGTTTAC 
GGTCATAAAA CGGCGCTGAT TTGTGAATCC AGCGGCGGAG TCGTTAACCG GTATAGTTAT
CTTGAGTTAA ATCAGGAGAT TAACCGCACG GCAAACCTGT TTTATACGCT GGGGATTCGC
AAAGGCGACA AGGTTGCTCT ACATCTCGAC AACTGCCCGG AATTTATCTT TTGCTGGTTC
GGGCTGGCAA AAATTGGCGC GATTATGGTG CCGATTAACG CCCGCCTGTT ACGCGAGGAA
AGCGCGTGGA TCCTGCAAAA TAGCCAGGCG TGCCTGCTGG TGACCAGTGC GCAATTCTAT
CCCATGTATC AACAGATTCA GCAGGAAGAT GCCACGCAGT TGCGGCACAT TTGCCTGACA
GATGTGGCAC TTCCCGCTGA TGATGGCGTG AGTTCGTTTA CTCAACTGAA AAATCAACAA
CCTGCCACCT TGTGCTATGC ACCGCCGCTA TCGACTGACG ATACGGCGGA AATTCTCTTC
ACCTCCGGCA CTACCTCCCG ACCGAAAGGT GTGGTGATTA CCCATTACAA CCTGCGCTTC
GCTGGATATT ACTCCGCCTG GCAGTGTGCA CTGCGTGACG ATGACGTCTA CCTGACGGTA
ATGCCTGCGT TTCATATCGA TTGCCAGTGT ACTGCGGCGA TGGCGGCGTT TTCTGCCGGG
GCCACCTTTG TGCTGGTCGA GAAATACAGC GCCCGCGCCT TCTGGGGACA GGTGCAGAAG
TACCGCGCCA CCATTACCGA ATGTATTCCG ATGATGATCC GTACCTTGAT GGTGCAACCG
CCTTCTGCGA ACGATCAGCA ACACCGCCTG CGGGAAGTGA TGTTTTATCT CAACTTGTCG
GCGCAGGAAA AAGACGCATT TTGTGAACGC TTCGGCGTTC GCTTGCTGAC GTCTTATGGG
ATGACGGAAA CCATTGTGGG CATTATCGGC GATCGCCCTG GCGATAAACG GCGCTGGCCG
TCGATTGGTC GGGCGGGCTT TTGCTACGAA GCGGAGATCC GCGACGATCA CAATCGCCCG
CTCCCGGCAG GTGAGATCGG TGAAATCTGT ATTAAAGGCG TACCAGGGAA AACCATCTTC
AAAGAGTATT TTCTCAACCC GAAAGCCACT GCGAAAGTGC TGGAAGCCGA CGGCTGGCTA
CATACCGGCG ATACCGGATA CCGCGACGAA GAGGGCTTTT TTTATTTCGT CGATCGCCGC
TGCAATATGA TCAAACGCGG TGGCGAGAAT GTCTCCTGCG TGGAGCTCGA AAATATCATC
GCCACCCATC CAAAAATTCA GGATATCGTG GTTGTGGGAA TTAAAGATTC GATACGCGAT
GAAGCCATCA AAGCATTTGT GGTGCTGAAT GAAGGTGAAA CATTGAGCGA AGAGGAATTT
TTCCGCTTCT GCGAACAAAA TATGGCGAAA TTTAAAGTGC CCTCTTATCT GGAGATCAGA
AAAGATCTGC CACGTAATTG CTCGGGGAAA ATAATTAGAA AGAATCTGAA ATAA
 
Protein sequence
MDIIGGQHLR QMWDDLADVY GHKTALICES SGGVVNRYSY LELNQEINRT ANLFYTLGIR 
KGDKVALHLD NCPEFIFCWF GLAKIGAIMV PINARLLREE SAWILQNSQA CLLVTSAQFY
PMYQQIQQED ATQLRHICLT DVALPADDGV SSFTQLKNQQ PATLCYAPPL STDDTAEILF
TSGTTSRPKG VVITHYNLRF AGYYSAWQCA LRDDDVYLTV MPAFHIDCQC TAAMAAFSAG
ATFVLVEKYS ARAFWGQVQK YRATITECIP MMIRTLMVQP PSANDQQHRL REVMFYLNLS
AQEKDAFCER FGVRLLTSYG MTETIVGIIG DRPGDKRRWP SIGRAGFCYE AEIRDDHNRP
LPAGEIGEIC IKGVPGKTIF KEYFLNPKAT AKVLEADGWL HTGDTGYRDE EGFFYFVDRR
CNMIKRGGEN VSCVELENII ATHPKIQDIV VVGIKDSIRD EAIKAFVVLN EGETLSEEEF
FRFCEQNMAK FKVPSYLEIR KDLPRNCSGK IIRKNLK