Gene EcHS_A3155 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3155 
Symbol 
ID5593625 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3168187 
End bp3169917 
Gene Length1731 bp 
Protein Length576 aa 
Translation table11 
GC content53% 
IMG OID640922275 
Productacyl-CoA synthetase 
Protein accessionYP_001459773 
Protein GI157162455 
COG category[I] Lipid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0318] Acyl-CoA synthetases (AMP-forming)/AMP-acid ligases II 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones53 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTGTATA TGTCTAATAA AATCTTTACG CATTCCCTAC CTATGCGCTA TGCCGATTTT 
CCAACGCTGG TTGATGCTTT GGACTACGCC GCTCTGAGTA GCGCCGGAAT GAATTTTTAT
GACAGACGTT GCCAACTTGA AGATCAACTG GAATATCAGA CGTTAAAAGC ACGTGCCGAA
GCTGTTGCGA AGCGGTTGTT ATCGCTGAAC CTGAAAAAAG GCGATCGCGT GGCACTGATT
GCCGAAACAA GTAGCGGGTT CGTAGAGGCT TTTTTTGTCT GCCAGTATGC CGGCTTAGTC
GCCGTCCCGT TGGCGATTCC AATGGGCGTT GGTCAGCGGG ATTCCTGGAG CGCCAAATTG
CAGGGTTTAC TGGCAAGTTG CCAGCCCGCA GCCATTATCA CTGGTGATGA GTGGTTGCCA
CTGGTCAATG CCGCGACGCA TGACAACCCC GAATTACATG TTTTAAGCCA CGCCTGGTTT
AAGGCATTAC CGGAAGCCGA TGTTGCGCTC CAGCGTCCAG TTCCGAACGA TATCGCCTAC
CTCCAGTACA CCTCCGGCAG CACCCGTTTT CCCCGTGGCG TCATTATCAC CCATCGCGAA
GTAATGGCTA ATCTACGTGC TATAAGCCAC GACGGCATTA AATTACGCCC TGGCGACCGC
TGCGTCTCCT GGCTGCCTTT CTACCATGAT ATGGGACTGG TCGGCTTTCT CCTGACCCCC
GTCGCCACGC AGCTTTCAGT AGATTATTTG CGCACTCAGG ATTTTGCCAT GCGTCCTCTG
CAATGGCTTA AATTGATCAG TAAAAATCGC GGCACCGTTT CCGTTGCGCC GCCGTTTGGC
TATGAATTGT GCCAGCGCCG CGTGAATGAA AAAGATCTCG CTGAACTGGA TCTTTCCTGC
TGGCGCGTCG CTGGTATTGG TGCTGAGCCG ATCTCCGCAG AACAACTCCA TCAATTCGCT
GAATGTTTCC GTCAGGTTAA CTTTGACGAT AAAACGTTCA TGCCGTGCTA CGGACTGGCA
GAAAATGCGC TGGCTGTCAG CTTCTCTGAT GAAGCCTCCG GGGTTGTGGT TAACGAAGTG
GATCGCGACA TCCTCGAATA TCAGGGCAAA GCCGTCGCGC CGGGTGCAGA GACACGCGCC
GTATCGACTT TCGTCAACTG CGGCAAAGCG TTGCCGGAAC ATGGTATTGA AATCCGCAAT
GAAGCAGGTA TGCCGGTCGC GGAACGTGTG GTAGGCCATA TTTGCATCTC CGGTCCCAGT
CTGATGAGCG GTTACTTTGG CGACCAGGTT TCGCAAGACG AGATTGCCGC GACGGGCTGG
TTAGACACCG GCGACCTCGG TTATCTGCTG GACGGTTATC TGTATGTCAC CGGACGCATT
AAAGATCTGA TTATTATTCG TGGCCGTAAT ATCTGGCCGC AGGATATTGA ATATATTGCG
GAACAAGAAC CGGAAATTCA TTCTGGCGAT GCGATTGCTT TTGTTACCGC CCAGGAAAAA
ATCATTTTGC AGATCCAGTG TCGGATCAGC GACGAAGAAC GTCGCGGGCA GCTTATCCAC
GCGCTGGCGG CACGGATCCA AAGCGAATTT GGCGTGACCG CGGCTATCGA TCTGTTGCCG
CCCCACAGTA TTCCCCGAAC GTCCTCCGGC AAGCCTGCCC GTGCGGAAGC GAAAAAACGT
TATCAGAAGG CTTATGCTGC CAGTCTTAAT GTGCAGGAAT CCCTGGCATG A
 
Protein sequence
MVYMSNKIFT HSLPMRYADF PTLVDALDYA ALSSAGMNFY DRRCQLEDQL EYQTLKARAE 
AVAKRLLSLN LKKGDRVALI AETSSGFVEA FFVCQYAGLV AVPLAIPMGV GQRDSWSAKL
QGLLASCQPA AIITGDEWLP LVNAATHDNP ELHVLSHAWF KALPEADVAL QRPVPNDIAY
LQYTSGSTRF PRGVIITHRE VMANLRAISH DGIKLRPGDR CVSWLPFYHD MGLVGFLLTP
VATQLSVDYL RTQDFAMRPL QWLKLISKNR GTVSVAPPFG YELCQRRVNE KDLAELDLSC
WRVAGIGAEP ISAEQLHQFA ECFRQVNFDD KTFMPCYGLA ENALAVSFSD EASGVVVNEV
DRDILEYQGK AVAPGAETRA VSTFVNCGKA LPEHGIEIRN EAGMPVAERV VGHICISGPS
LMSGYFGDQV SQDEIAATGW LDTGDLGYLL DGYLYVTGRI KDLIIIRGRN IWPQDIEYIA
EQEPEIHSGD AIAFVTAQEK IILQIQCRIS DEERRGQLIH ALAARIQSEF GVTAAIDLLP
PHSIPRTSSG KPARAEAKKR YQKAYAASLN VQESLA