Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A0041 |
Symbol | caiC |
ID | 5591851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 39529 |
End bp | 41097 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640919229 |
Product | putative crotonobetaine/carnitine-CoA ligase |
Protein accession | YP_001456824 |
Protein GI | 157159506 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 68 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGATAGAG GTGCAATGGA TATCATTGGC GGACAACATC TACGTCAAAT GTGGGACGAT CTTGCGGACG TTTACGGTCA TAAAACGGCG CTGATTTGTG AATCCAGCGG CGGAGTCGTT AACCGGTATA GTTATCTTGA GTTAAATCAG GAGATTAACC GCACGGCAAA CCTGTTTTAT ACGCTGGGGA TTCGCAAAGG CGACAAGGTT GCACTACATC TCGACAACTG CCCGGAATTT ATCTTTTGCT GGTTCGGGCT GGCAAAAATT GGCGCGATTA TGGTGCCGAT TAACGCCCGC CTGTTACGCG AGGAAAGCGC GTGGATCCTG CAAAATAGCC AGGCGTGCCT GCTGGTGACC AGTGCGCAAT TCTATCCTAT GTATCAACAG ATTCAGCAGG AAGATGCCAC TCAATTGCGG CACATTTGCC TGACAGATGT GGCACTTCCC GCTGATGATG GCGTGAGTTC GTTTACTCAA CTGAAAAATC AACAACCTGC CACCTTGTGC TATGCACCGC CGCTATCGAC TGACGATACG GCGGAAATTC TCTTCACCTC CGGCACCACC TCCCGACCGA AAGGTGTGGT GATTACCCAT TACAACCTGC GCTTCGCTGG ATATTACTCC GCCTGGCAGT GTGCACTGCG TGACGATGAC GTCTACCTGA CGGTAATGCC TGCGTTTCAT ATCGATTGCC AGTGTACTGC GGCGATGGCG GCGTTTTCTG CCGGGGCCAC CTTTGTGCTG GTCGAGAAAT ACAGCGCCCG CGCCTTCTGG GGACAGGTGC AGAAGTACCG CGCCACCATT ACCGAATGTA TTCCGATGAT GATCCGTACG TTGATGGTGC AGCCGCCTTC AGCGAACGAT CGGCAACACC GCCTGCGGGA AGTGATGTTT TATCTCAACT TGTCGGAGCA GGAAAAAGAT GCGTTTTGTG AACGCTTCGG CGTTCGCTTG CTGACGTCTT ATGGGATGAC GGAAACCATT GTGGGCATTA TCGGCGATCG CCCTGGCGAT AAACGACGCT GGCCGTCGAT TGGTCGGGCG GGGTTTTGCT ACGAAGCGGA GATCCGCGAC GATCACAATC GCCCGCTCCC GGCAGGTGAG ATCGGTGAAA TCTGTATTAA AGGCGTACCA GGGAAAACCA TCTTCAAAGA GTATTTTCTC AACCCGAAAG CCACTGCGAA AGTGCTGGAA GCCGACGGCT GGCTACATAC CGGCGATACC GGATACTGCG ACGAAGAGGG CTTTTTTTAT TTCGTCGATC GCCGCTGCAA TATGATCAAA CGCGGTGGCG AGAATGTCTC CTGCGTGGAG CTGGAAAATA TCATCGCCAC CCATCCAAAA ATTCAGGATA TCGTGGTTGT GGGTATTAAA GATTCGATTC GCGATGAAGC CATCAAAGCA TTTGTGGTGC TGAATGAAGG TGAAACATTG AGCGAAGAGG AATTTTTCCG CTTCTGCGAA CAAAATATGG CGAAATTTAA AGTGCCCTCT TATCTGGAGA TCAGAAAAGA TCTGCCACGT AATTGCTCGG GGAAAATAAT TAGAAAGAAT CTGAAATAA
|
Protein sequence | MDRGAMDIIG GQHLRQMWDD LADVYGHKTA LICESSGGVV NRYSYLELNQ EINRTANLFY TLGIRKGDKV ALHLDNCPEF IFCWFGLAKI GAIMVPINAR LLREESAWIL QNSQACLLVT SAQFYPMYQQ IQQEDATQLR HICLTDVALP ADDGVSSFTQ LKNQQPATLC YAPPLSTDDT AEILFTSGTT SRPKGVVITH YNLRFAGYYS AWQCALRDDD VYLTVMPAFH IDCQCTAAMA AFSAGATFVL VEKYSARAFW GQVQKYRATI TECIPMMIRT LMVQPPSAND RQHRLREVMF YLNLSEQEKD AFCERFGVRL LTSYGMTETI VGIIGDRPGD KRRWPSIGRA GFCYEAEIRD DHNRPLPAGE IGEICIKGVP GKTIFKEYFL NPKATAKVLE ADGWLHTGDT GYCDEEGFFY FVDRRCNMIK RGGENVSCVE LENIIATHPK IQDIVVVGIK DSIRDEAIKA FVVLNEGETL SEEEFFRFCE QNMAKFKVPS YLEIRKDLPR NCSGKIIRKN LK
|
| |