Gene EcolC_1828 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1828 
Symbol 
ID6065267 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2026877 
End bp2028628 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content51% 
IMG OID641601242 
Productlong-chain-fatty-acid--CoA ligase 
Protein accessionYP_001724804 
Protein GI170019850 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.325094 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTAACGG CATGTATATC ATTTGGGGTT GCGATGACGA CGAACACGCA TTTTAGAGGT 
GAAGAATTGA AGAAGGTTTG GCTTAACCGT TATCCCGCGG ACGTTCCGAC GGAGATCAAC
CCTGACCGTT ATCAATCTCT GGTAGATATG TTTGAGCAGT CGGTCGCGCG CTACGCCGAT
CAACCTGCGT TTGTGAATAT GGGGGAGGTA ATGACCTTCC GCAAGCTGGA AGAACGCAGT
CGCGCGTTTG CCGCTTATTT GCAACAAGGG TTGGGGCTGA AGAAAGGCGA TCGCGTTGCG
TTGATGATGC CTAATTTATT GCAATATCCG GTGGCGCTGT TTGGCATTTT GCGTGCCGGG
ATGATCGTCG TAAACGTTAA CCCGTTGTAT ACCCCGCGTG AGCTTGAGCA TCAGCTTAAC
GATAGCGGCG CATCGGCGAT TGTTATCGTG TCTAACTTTG CTCACACACT GGAAAAAGTG
GTTGATAAAA CCGCCGTTCA GCACGTAATT CTGACCCGTA TGGGCGATCA GCTATCTACG
GCAAAAGGCA CGGTAGTCAA TTTCGTTGTT AAATACATCA AGCGTTTGGT GCCGAAATAC
CATCTGCCAG ATGCCATTTC ATTTCGTAGC GCACTGCATA ACGGCTACCG GATGCAGTAC
GTCAAACCCG AACTGGTGCC GGAAGATTTA GCTTTTCTGC AATACACCGG CGGCACCACT
GGTGTGGCGA AAGGCGCGAT GCTGACTCAC CGCAATATGC TGGCGAACCT GGAACAGGTT
AACGCGACCT ATGGTCCGCT GTTGCATCCG GGCAAAGAGC TGGTGGTGAC GGCGCTGCCG
CTGTATCACA TTTTTGCCCT GACCATTAAC TGCCTGCTGT TTATCGAACT GGGTGGGCAG
AACCTGCTTA TCACTAACCC GCGCGATATT CCAGGGTTGG TAAAAGAGTT AGCGAAATAT
CCGTTTACCG CTATCACGGG CGTTAACACC TTGTTCAATG CGTTGCTGAA CAATAAAGAG
TTCCAGCAGC TGGATTTCTC CAGTCTGCAT CTTTCCGCAG GCGGTGGGAT GCCAGTGCAG
CAAGTGGTGG CAGAGCGTTG GGTGAAACTG ACCGGACAGT ATCTGCTGGA AGGCTATGGC
CTTACCGAGT GTGCGCCGCT GGTCAGCGTT AACCCATATG ATATTGATTA TCATAGTGGT
AGCATCGGTT TGCCGGTGCC GTCGACGGAA GCCAAACTGG TGGATGATGA TGATAATGAA
GTACCACCAG GTCAACCGGG TGAGCTTTGT GTCAAAGGAC CGCAGGTGAT GCTGGGTTAC
TGGCAGCGTC CCGATGCTAC CGATGAAATC ATCAAAAATG GCTGGTTACA CACCGGCGAC
ATCGCGGTAA TGGATGAAGA AGGATTCCTG CGCATTGTCG ATCGTAAAAA AGACATGATT
CTGGTTTCCG GTTTTAACGT CTATCCCAAC GAGATTGAAG ATGTCGTCAT GCAGCATCCT
GGCGTACAGG AAGTCGCGGC TGTTGGCGTA CCTTCCGGCT CCAGTGGTGA AGCGGTGAAA
ATCTTCGTAG TGAAAAAAGA TCCATCGCTT ACCGAAGAGT CACTGGTGAC TTTTTGCCGC
CGTCAGCTCA CGGGATACAA AGTACCGAAG CTGGTGGAGT TTCGTGATGA GTTACCGAAA
TCTAACGTCG GAAAAATTTT GCGACGAGAA TTACGTGACG AAGCGCGCGG CAAAGTGGAC
AATAAAGCCT GA
 
Protein sequence
MLTACISFGV AMTTNTHFRG EELKKVWLNR YPADVPTEIN PDRYQSLVDM FEQSVARYAD 
QPAFVNMGEV MTFRKLEERS RAFAAYLQQG LGLKKGDRVA LMMPNLLQYP VALFGILRAG
MIVVNVNPLY TPRELEHQLN DSGASAIVIV SNFAHTLEKV VDKTAVQHVI LTRMGDQLST
AKGTVVNFVV KYIKRLVPKY HLPDAISFRS ALHNGYRMQY VKPELVPEDL AFLQYTGGTT
GVAKGAMLTH RNMLANLEQV NATYGPLLHP GKELVVTALP LYHIFALTIN CLLFIELGGQ
NLLITNPRDI PGLVKELAKY PFTAITGVNT LFNALLNNKE FQQLDFSSLH LSAGGGMPVQ
QVVAERWVKL TGQYLLEGYG LTECAPLVSV NPYDIDYHSG SIGLPVPSTE AKLVDDDDNE
VPPGQPGELC VKGPQVMLGY WQRPDATDEI IKNGWLHTGD IAVMDEEGFL RIVDRKKDMI
LVSGFNVYPN EIEDVVMQHP GVQEVAAVGV PSGSSGEAVK IFVVKKDPSL TEESLVTFCR
RQLTGYKVPK LVEFRDELPK SNVGKILRRE LRDEARGKVD NKA