Gene EcSMS35_4531 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4531 
SymbolacsA 
ID6142923 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4631470 
End bp4633428 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content57% 
IMG OID641619347 
Productacetyl-CoA synthetase 
Protein accessionYP_001746459 
Protein GI170683866 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAAA TTCACAAACA CACCATTCCT GCCAACATCG CAGACCGTTG CCTGATAAAC 
CCTCAGCAGT ACGAGGCGAT GTATCAACAA TCTATTAACG CACCAGATAC CTTCTGGGGC
GAACAGGGAA AAATTCTCGA CTGGATCAAA CCTTACCAGA AGGTGAAAAA CACCTCCTTT
GCCCCCGGTA ATGTGTCCAT TAAATGGTAC GAGGACGGCA CGCTGAATCT GGCGGCAAAC
TGCCTTGACC GCCATCTGCA AGAAAACGGC GATCGTACCG CCATCATCTG GGAAGGCGAC
GACGCCAGCC AGAGCAAACA TATCAGCTAT AAAGAGCTGC ACCGCGACGT CTGCCGCTTC
GCCAATACCC TGCTCGAGCT GGGCATTAAA AAAGGTGATG TGGTGGCGAT TTATATGCCG
ATGGTGCCGG AAGCCGCGGT CGCGATGCTG GCCTGCGCCC GCATTGGCGC GGTGCATTCG
GTAATTTTCG GCGGCTTCTC GCCGGAAGCC GTTGCCGGGC GCATTATCGA TTCTAACTCA
CGGTTGGTGA TCACTTCCGA CGAAGGCGTG CGCGCCGGGC GTAGTATTCC GCTGAAGAAA
AACGTTGATG ACGCGCTGAA AAACCCGAAC GTCACCAGCG TAGAGCATGT GGTGGTACTG
AAGCGTACTG GCGGAAAAAT TGACTGGCAG GAAGGGCGCG ACCTGTGGTG GCACGACCTG
GTTGAGCAAG CGAGCGATCA GCACCAGGCG GAAGAGATGA ACGCCGAAGA TCCGCTGTTT
ATTCTCTATA CCTCCGGTTC TACCGGTAAG CCGAAAGGCG TACTGCACAC TACCGGCGGT
TATCTGGTGT ACGCGGCGCT GACCTTTAAA TATGTCTTTG ATTATCATCC GGGCGATATC
TACTGGTGCA CCGCCGATGT GGGTTGGGTT ACCGGACACA GTTATTTGCT GTACGGCCCG
TTGGCCTGCG GCGCGACCAC GCTGATGTTT GAAGGCGTAC CGAACTGGCC GACGCCTGCC
CGTATGGCGC AGGTGGTGGA CAAGCATCAG GTCAATATTC TCTATACCGC GCCCACGGCG
ATTCGCGCGC TGATGGCGGA AGGCGATAAA GCGATTGAAG GCACCGATCG ATCGTCGCTG
CGCATTCTCG GTTCCGTGGG CGAGCCAATC AACCCGGAAG CGTGGGAGTG GTACTGGAAG
AAGATCGGCA ACGAGAAATG CCCAGTGGTC GATACCTGGT GGCAGACCGA AACTGGCGGT
TTCATGATCA CCCCACTGCC TGGCGCTACC GAGCTGAAAG CCGGTTCGGC AACACGTCCG
TTCTTCGGCG TACAACCGGC GCTGGTCGAT AACGAAGGTC ATCCGCTGGA GGGGGCCGCC
GAAGGCAGTC TGGTGATCAC CGACTCCTGG CCTGGTCAGG CGCGTACATT GTTTGGCGAT
CACGAACGCT TTGAACAGAC CTACTTCTCC ACCTTCAAAA ATATGTATTT CAGCGGCGAC
GGCGCGCGTC GTGATGAAGA TGGCTATTAC TGGATCACCG GGCGCGTGGA CGACGTGCTG
AACGTCTCGG GTCACCGTCT GGGAACAGCG GAGATTGAGT CGGCGCTGGT GGCGCATCCG
AAGATTGCCG AAGCCGCTGT CGTCGGTATT CCGCACAATA TTAAAGGTCA GGCGATCTAC
GCCTACGTCA CGCTTAATCA CGGGGAGGAA CCGTCACCAG AACTGTACGC AGAAGTCCGC
AACTGGGTGC GTAAAGAGAT TGGCCCGCTG GCGACGCCAG ACGTGCTGCA CTGGACCGAC
TCCCTGCCTA AAACCCGCTC CGGCAAAATT ATGCGCCGTA TTCTGCGCAA AATTGCGGCG
GGCGATACCA GCAACCTGGG CGATACCTCG ACGCTTGCCG ATCCTGGCGT AGTCGAGAAG
CTGCTTGAAG AGAAGCAGGC TATCGCGATG CCATCGTAA
 
Protein sequence
MSQIHKHTIP ANIADRCLIN PQQYEAMYQQ SINAPDTFWG EQGKILDWIK PYQKVKNTSF 
APGNVSIKWY EDGTLNLAAN CLDRHLQENG DRTAIIWEGD DASQSKHISY KELHRDVCRF
ANTLLELGIK KGDVVAIYMP MVPEAAVAML ACARIGAVHS VIFGGFSPEA VAGRIIDSNS
RLVITSDEGV RAGRSIPLKK NVDDALKNPN VTSVEHVVVL KRTGGKIDWQ EGRDLWWHDL
VEQASDQHQA EEMNAEDPLF ILYTSGSTGK PKGVLHTTGG YLVYAALTFK YVFDYHPGDI
YWCTADVGWV TGHSYLLYGP LACGATTLMF EGVPNWPTPA RMAQVVDKHQ VNILYTAPTA
IRALMAEGDK AIEGTDRSSL RILGSVGEPI NPEAWEWYWK KIGNEKCPVV DTWWQTETGG
FMITPLPGAT ELKAGSATRP FFGVQPALVD NEGHPLEGAA EGSLVITDSW PGQARTLFGD
HERFEQTYFS TFKNMYFSGD GARRDEDGYY WITGRVDDVL NVSGHRLGTA EIESALVAHP
KIAEAAVVGI PHNIKGQAIY AYVTLNHGEE PSPELYAEVR NWVRKEIGPL ATPDVLHWTD
SLPKTRSGKI MRRILRKIAA GDTSNLGDTS TLADPGVVEK LLEEKQAIAM PS