Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0038 |
Symbol | caiC |
ID | 6143120 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 41450 |
End bp | 43003 |
Gene Length | 1554 bp |
Protein Length | 517 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641614939 |
Product | putative crotonobetaine/carnitine-CoA ligase |
Protein accession | YP_001742155 |
Protein GI | 170681586 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.565789 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATCA TTGGCGGACA ACATCTACGT CAAATGTGGG ACGATCTGGC GGACGTTTAC GGTCATAAAA CGGCGCTGAT TTGTGAATCC AGCGGCGGAG TCGTTAACCG GTATAGTTAT CTTGAGTTAA ATCAGGAGAT TAACCGCACG GCAAACCTGT TTTATACGCT GGGGATTCGC AAAGGCGACA AGGTTGCTCT ACATCTCGAC AACTGCCCGG AATTTATCTT TTGCTGGTTC GGGCTGGCAA AAATTGGCGC GATTATGGTG CCGATTAACG CCCGCCTGTT ACGCGAGGAA AGCGCGTGGA TCCTGCAAAA TAGCCAGGCG TGCCTGCTGG TGACCAGTGC GCAATTCTAT CCCATGTATC AACAGATTCA GCAGGAAGAT GCCACGCAGT TGCGGCACAT TTGCCTGACA GATGTGGCAC TTCCCGCTGA TGATGGCGTG AGTTCGTTTA CTCAACTGAA AAATCAACAA CCTGCCACCT TGTGCTATGC ACCGCCGCTA TCGACTGACG ATACGGCGGA AATTCTCTTC ACCTCCGGCA CTACCTCCCG ACCGAAAGGT GTGGTGATTA CCCATTACAA CCTGCGCTTC GCTGGATATT ACTCCGCCTG GCAGTGTGCA CTGCGTGACG ATGACGTCTA CCTGACGGTA ATGCCTGCGT TTCATATCGA TTGCCAGTGT ACTGCGGCGA TGGCGGCGTT TTCTGCCGGG GCCACCTTTG TGCTGGTCGA GAAATACAGC GCCCGCGCCT TCTGGGGACA GGTGCAGAAG TACCGCGCCA CCATTACCGA ATGTATTCCG ATGATGATCC GTACCTTGAT GGTGCAACCG CCTTCTGCGA ACGATCAGCA ACACCGCCTG CGGGAAGTGA TGTTTTATCT CAACTTGTCG GCGCAGGAAA AAGACGCATT TTGTGAACGC TTCGGCGTTC GCTTGCTGAC GTCTTATGGG ATGACGGAAA CCATTGTGGG CATTATCGGC GATCGCCCTG GCGATAAACG GCGCTGGCCG TCGATTGGTC GGGCGGGCTT TTGCTACGAA GCGGAGATCC GCGACGATCA CAATCGCCCG CTCCCGGCAG GTGAGATCGG TGAAATCTGT ATTAAAGGCG TACCAGGGAA AACCATCTTC AAAGAGTATT TTCTCAACCC GAAAGCCACT GCGAAAGTGC TGGAAGCCGA CGGCTGGCTA CATACCGGCG ATACCGGATA CCGCGACGAA GAGGGCTTTT TTTATTTCGT CGATCGCCGC TGCAATATGA TCAAACGCGG TGGCGAGAAT GTCTCCTGCG TGGAGCTCGA AAATATCATC GCCACCCATC CAAAAATTCA GGATATCGTG GTTGTGGGAA TTAAAGATTC GATACGCGAT GAAGCCATCA AAGCATTTGT GGTGCTGAAT GAAGGTGAAA CATTGAGCGA AGAGGAATTT TTCCGCTTCT GCGAACAAAA TATGGCGAAA TTTAAAGTGC CCTCTTATCT GGAGATCAGA AAAGATCTGC CACGTAATTG CTCGGGGAAA ATAATTAGAA AGAATCTGAA ATAA
|
Protein sequence | MDIIGGQHLR QMWDDLADVY GHKTALICES SGGVVNRYSY LELNQEINRT ANLFYTLGIR KGDKVALHLD NCPEFIFCWF GLAKIGAIMV PINARLLREE SAWILQNSQA CLLVTSAQFY PMYQQIQQED ATQLRHICLT DVALPADDGV SSFTQLKNQQ PATLCYAPPL STDDTAEILF TSGTTSRPKG VVITHYNLRF AGYYSAWQCA LRDDDVYLTV MPAFHIDCQC TAAMAAFSAG ATFVLVEKYS ARAFWGQVQK YRATITECIP MMIRTLMVQP PSANDQQHRL REVMFYLNLS AQEKDAFCER FGVRLLTSYG MTETIVGIIG DRPGDKRRWP SIGRAGFCYE AEIRDDHNRP LPAGEIGEIC IKGVPGKTIF KEYFLNPKAT AKVLEADGWL HTGDTGYRDE EGFFYFVDRR CNMIKRGGEN VSCVELENII ATHPKIQDIV VVGIKDSIRD EAIKAFVVLN EGETLSEEEF FRFCEQNMAK FKVPSYLEIR KDLPRNCSGK IIRKNLK
|
| |