Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4531 |
Symbol | acsA |
ID | 6142923 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4631470 |
End bp | 4633428 |
Gene Length | 1959 bp |
Protein Length | 652 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641619347 |
Product | acetyl-CoA synthetase |
Protein accession | YP_001746459 |
Protein GI | 170683866 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases |
TIGRFAM ID | [TIGR02188] acetate--CoA ligase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAAA TTCACAAACA CACCATTCCT GCCAACATCG CAGACCGTTG CCTGATAAAC CCTCAGCAGT ACGAGGCGAT GTATCAACAA TCTATTAACG CACCAGATAC CTTCTGGGGC GAACAGGGAA AAATTCTCGA CTGGATCAAA CCTTACCAGA AGGTGAAAAA CACCTCCTTT GCCCCCGGTA ATGTGTCCAT TAAATGGTAC GAGGACGGCA CGCTGAATCT GGCGGCAAAC TGCCTTGACC GCCATCTGCA AGAAAACGGC GATCGTACCG CCATCATCTG GGAAGGCGAC GACGCCAGCC AGAGCAAACA TATCAGCTAT AAAGAGCTGC ACCGCGACGT CTGCCGCTTC GCCAATACCC TGCTCGAGCT GGGCATTAAA AAAGGTGATG TGGTGGCGAT TTATATGCCG ATGGTGCCGG AAGCCGCGGT CGCGATGCTG GCCTGCGCCC GCATTGGCGC GGTGCATTCG GTAATTTTCG GCGGCTTCTC GCCGGAAGCC GTTGCCGGGC GCATTATCGA TTCTAACTCA CGGTTGGTGA TCACTTCCGA CGAAGGCGTG CGCGCCGGGC GTAGTATTCC GCTGAAGAAA AACGTTGATG ACGCGCTGAA AAACCCGAAC GTCACCAGCG TAGAGCATGT GGTGGTACTG AAGCGTACTG GCGGAAAAAT TGACTGGCAG GAAGGGCGCG ACCTGTGGTG GCACGACCTG GTTGAGCAAG CGAGCGATCA GCACCAGGCG GAAGAGATGA ACGCCGAAGA TCCGCTGTTT ATTCTCTATA CCTCCGGTTC TACCGGTAAG CCGAAAGGCG TACTGCACAC TACCGGCGGT TATCTGGTGT ACGCGGCGCT GACCTTTAAA TATGTCTTTG ATTATCATCC GGGCGATATC TACTGGTGCA CCGCCGATGT GGGTTGGGTT ACCGGACACA GTTATTTGCT GTACGGCCCG TTGGCCTGCG GCGCGACCAC GCTGATGTTT GAAGGCGTAC CGAACTGGCC GACGCCTGCC CGTATGGCGC AGGTGGTGGA CAAGCATCAG GTCAATATTC TCTATACCGC GCCCACGGCG ATTCGCGCGC TGATGGCGGA AGGCGATAAA GCGATTGAAG GCACCGATCG ATCGTCGCTG CGCATTCTCG GTTCCGTGGG CGAGCCAATC AACCCGGAAG CGTGGGAGTG GTACTGGAAG AAGATCGGCA ACGAGAAATG CCCAGTGGTC GATACCTGGT GGCAGACCGA AACTGGCGGT TTCATGATCA CCCCACTGCC TGGCGCTACC GAGCTGAAAG CCGGTTCGGC AACACGTCCG TTCTTCGGCG TACAACCGGC GCTGGTCGAT AACGAAGGTC ATCCGCTGGA GGGGGCCGCC GAAGGCAGTC TGGTGATCAC CGACTCCTGG CCTGGTCAGG CGCGTACATT GTTTGGCGAT CACGAACGCT TTGAACAGAC CTACTTCTCC ACCTTCAAAA ATATGTATTT CAGCGGCGAC GGCGCGCGTC GTGATGAAGA TGGCTATTAC TGGATCACCG GGCGCGTGGA CGACGTGCTG AACGTCTCGG GTCACCGTCT GGGAACAGCG GAGATTGAGT CGGCGCTGGT GGCGCATCCG AAGATTGCCG AAGCCGCTGT CGTCGGTATT CCGCACAATA TTAAAGGTCA GGCGATCTAC GCCTACGTCA CGCTTAATCA CGGGGAGGAA CCGTCACCAG AACTGTACGC AGAAGTCCGC AACTGGGTGC GTAAAGAGAT TGGCCCGCTG GCGACGCCAG ACGTGCTGCA CTGGACCGAC TCCCTGCCTA AAACCCGCTC CGGCAAAATT ATGCGCCGTA TTCTGCGCAA AATTGCGGCG GGCGATACCA GCAACCTGGG CGATACCTCG ACGCTTGCCG ATCCTGGCGT AGTCGAGAAG CTGCTTGAAG AGAAGCAGGC TATCGCGATG CCATCGTAA
|
Protein sequence | MSQIHKHTIP ANIADRCLIN PQQYEAMYQQ SINAPDTFWG EQGKILDWIK PYQKVKNTSF APGNVSIKWY EDGTLNLAAN CLDRHLQENG DRTAIIWEGD DASQSKHISY KELHRDVCRF ANTLLELGIK KGDVVAIYMP MVPEAAVAML ACARIGAVHS VIFGGFSPEA VAGRIIDSNS RLVITSDEGV RAGRSIPLKK NVDDALKNPN VTSVEHVVVL KRTGGKIDWQ EGRDLWWHDL VEQASDQHQA EEMNAEDPLF ILYTSGSTGK PKGVLHTTGG YLVYAALTFK YVFDYHPGDI YWCTADVGWV TGHSYLLYGP LACGATTLMF EGVPNWPTPA RMAQVVDKHQ VNILYTAPTA IRALMAEGDK AIEGTDRSSL RILGSVGEPI NPEAWEWYWK KIGNEKCPVV DTWWQTETGG FMITPLPGAT ELKAGSATRP FFGVQPALVD NEGHPLEGAA EGSLVITDSW PGQARTLFGD HERFEQTYFS TFKNMYFSGD GARRDEDGYY WITGRVDDVL NVSGHRLGTA EIESALVAHP KIAEAAVVGI PHNIKGQAIY AYVTLNHGEE PSPELYAEVR NWVRKEIGPL ATPDVLHWTD SLPKTRSGKI MRRILRKIAA GDTSNLGDTS TLADPGVVEK LLEEKQAIAM PS
|
| |