Gene EcSMS35_1494 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1494 
SymbolfadK 
ID6143828 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1478190 
End bp1479830 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content50% 
IMG OID641616372 
Productshort chain acyl-CoA synthetase 
Protein accessionYP_001743552 
Protein GI170680187 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.433242 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTGA CATTAACGTT TAACGAACAA CGTCGTGCGG CGTATCGTCA GCAAGGGCTA 
TGGGGCGATG CTTCGCTGGC CGATTACTGG CAGCAGACCG CTCGTGCGAT GCCAGACAAA
ATTGCCGTGG TCGATAATCA TGGTGCATCA TACACCTATA GCGCGCTCGA TCACGCCGCG
AGCTGTCTGG CAAACTGGAT GTTGACGAAG GGTATTGAAT CAGGCGATCG CATCGCATTT
CAACTGCCTG GCTGGTGTGA ATTTACCGTT ATCTATCTTG CCTGCCTGAA AATCGGTGCG
GTTTCCGTAC CGCTGTTGCC TTCCTGGCGG GAAGCAGAAC TAGTATGGGT ACTCAATAAG
TGTCAGGCAA AAATGTTCTT TGCACCGACG TTGTTTAAAC AAACGCGTCC GGTAGATTTA
ATCCTGCCGC TGCAAAATCA GCTTCCACAA CTACAACAAA TTGTCGGCGT GGACAAACTG
GCTCCCGCCA CCTCTTCCCT CTCATTAAGT CAGATTCTCG CCGACAATAC CCCACTGACT
ACGGCGATAA CGACCCACGG CGATGAATTA GCCGCTGTGC TATTTACCTC CGGAACCGAG
GGTCTGCCAA AGGGCGTGAT GCTAACGCAT AACAATATTC TCGCCAGTGA GCGGGCTTAT
TGCGCGCGGC TGAATCTGAC CTGGCAGGAT GTCTTTATGA TGCCTGCGCC ACTTGGTCAC
GCAACGGGCT TTCTGCATGG CGTAACAGCA CCATTTTTAA TTGGTGCTCG CAGCGTGTTG
TTAGATATTT TCACTCCTGC TGCGTGTCTC GCGCTGCTTG AGCAGCAGCG TTGCACCTGT
ATGCTCGGCG CAACGCCGTT TGTCTATGAT CTTTTGAATT TACTAGAGAA ACAGCCCGCA
GACCTTTCAG CGCTGCGTTT CTTTCTTTGT GGCGGTACCA CAATCCCCAA AAAAGTGGCG
CGTGAATGCC AGCAGCGCGG CATTAAATTA TTAAGTGTTT ATGGTTCCAC AGAAAGTTCG
CCGCATGCGG TGGTGAATCT CGATGATCCT TTGTCGCGCT TTATGCACAC CGATGGTTAC
GCTGCCGCAG GTGTAGAGAT TAAAGTGGTC GATGGCGCAC GCAAGACCTT ACCGCCAGGT
TGCGAAGGTG AAGAAGCCTC GCGTGGCCCC AATGTGTTTA TGGGGTATTT TGATGAACCT
GAATTAACCG CCCATGCCCT GGATGAAGAA GGCTGGTATT ACAGCGGCGA TCTCTGCCGC
ATGGATGAGG CTGGCTATAT AAAAATAACC GGGCGCAAGA AAGATATTAT TGTCCGCGGC
GGCGAAAATA TTAGCAGCCG TGAAGTGGAA GATATTTTAT TACAGCATCC TAAAATTCAC
GATGCTTGTG TGGTTGCGAT GCCCGATGAA CGCTTAGGTG AACGTTCATG CGCTTATGTC
GTGCTGAAAG CACCGCATCA TTCATTATCG CTGGAAGATG TAGTGGCATT TTTTAGCCGT
AAACGGGTCG CGAAATATAA ATATCCTGAA CATATCGTGG TAATCGAAAA ACTACCGCGC
ACTGCCTCCG GTAAAATACA AAAATTTTTG TTACGTAAAG ATATTCTTCA ACGGCTGGAA
CAAACATGCG TTGAGGCATA A
 
Protein sequence
MKVTLTFNEQ RRAAYRQQGL WGDASLADYW QQTARAMPDK IAVVDNHGAS YTYSALDHAA 
SCLANWMLTK GIESGDRIAF QLPGWCEFTV IYLACLKIGA VSVPLLPSWR EAELVWVLNK
CQAKMFFAPT LFKQTRPVDL ILPLQNQLPQ LQQIVGVDKL APATSSLSLS QILADNTPLT
TAITTHGDEL AAVLFTSGTE GLPKGVMLTH NNILASERAY CARLNLTWQD VFMMPAPLGH
ATGFLHGVTA PFLIGARSVL LDIFTPAACL ALLEQQRCTC MLGATPFVYD LLNLLEKQPA
DLSALRFFLC GGTTIPKKVA RECQQRGIKL LSVYGSTESS PHAVVNLDDP LSRFMHTDGY
AAAGVEIKVV DGARKTLPPG CEGEEASRGP NVFMGYFDEP ELTAHALDEE GWYYSGDLCR
MDEAGYIKIT GRKKDIIVRG GENISSREVE DILLQHPKIH DACVVAMPDE RLGERSCAYV
VLKAPHHSLS LEDVVAFFSR KRVAKYKYPE HIVVIEKLPR TASGKIQKFL LRKDILQRLE
QTCVEA