Gene EcolC_3958 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3958 
Symbol 
ID6064481 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp4346478 
End bp4348436 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content57% 
IMG OID641603371 
Productacetyl-CoA synthetase 
Protein accessionYP_001726886 
Protein GI170021932 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.524009 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAAA TTCACAAACA CACCATTCCT GCCAACATCG CAGACCGTTG CCTGATAAAC 
CCTCAGCAGT ACGAGGCGAT GTATCAACAA TCTATTAACG TACCTGATAC CTTCTGGGGC
GAACAGGGAA AAATTCTTGA CTGGATCAAA CCTTACCAGA AGGTGAAAAA CACCTCCTTT
GCCCCCGGTA ATGTGTCCAT TAAATGGTAC GAGGACGGCA CGCTGAATCT GGCGGCAAAC
TGCCTTGACC GCCATCTGCA AGAAAACGGC GATCGTACCG CCATCATCTG GGAAGGCGAC
GACGCCAGCC AGAGCAAACA TATCAGCTAT AAAGAGCTGC ACCGCGACGT CTGCCGCTTC
GCCAATACCC TGCTCGAGCT GGGCATTAAA AAAGGTGATG TGGTGGCGAT TTATATGCCG
ATGGTGCCGG AAGCCGCGGT TGCGATGCTG GCCTGCGCCC GCATTGGCGC GGTGCATTCG
GTGATTTTCG GCGGCTTCTC GCCGGAAGCC GTTGCCGGGC GCATTATTGA TTCCAACTCA
CGACTGGTGA TCACTTCCGA CGAAGGTGTG CGTGCCGGGC GCAGTATTCC GCTGAAGAAA
AACGTTGATG ACGCGCTGAA AAACCCGAAC GTCACCAGCG TAGAGCATGT GGTGGTACTG
AAGCGTACTG GCGGGAAAAT TGACTGGCAG GAAGGGCGCG ACCTGTGGTG GCACGACCTG
GTTGAGCAAG CGAGCGATCA GCACCAGGCG GAAGAGATGA ACGCCGAAGA TCCGCTGTTT
ATTCTCTACA CCTCCGGTTC TACCGGTAAG CCAAAAGGTG TGCTGCATAC TACCGGCGGT
TATCTGGTGT ACGCGGCGCT GACCTTTAAA TATGTCTTTG ATTATCATCC GGGTGATATC
TACTGGTGCA CCGCCGATGT GGGCTGGGTG ACCGGACACA GTTACTTGCT GTACGGCCCG
CTGGCCTGCG GTGCGACCAC GCTGATGTTT GAAGGCGTAC CGAACTGGCC GACGCCTGCC
CGTATGGCGC AGGTGGTGGA CAAGCATCAG GTCAATATTC TCTATACCGC GCCCACGGCG
ATTCGCGCGC TGATGGCGGA AGGGGATAAA GCGATTGAAG GCACCGACCG ATCGTCGCTG
CGCATTCTCG GTTCCGTGGG CGAGCCAATC AACCCGGAAG CGTGGGAGTG GTACTGGAAA
AAAATCGGCA ACGAGAAATG TCCGGTGGTC GATACCTGGT GGCAGACCGA AACTGGCGGT
TTCATGATCA CGCCGCTGCC TGGCGCTACC GAGCTGAAAG CCGGTTCGGC AACACGTCCG
TTCTTCGGCG TGCAACCGGC GCTGGTCGAT AACGAAGGTA ACCCGCTGGA AGGCGCTACC
GAAGGCAGCC TGGTGATCAC CGACTCCTGG CCGGGTCAGG CGCGTACGCT GTTTGGCGAT
CACGAACGTT TTGAACAGAC CTACTTCTCC ACCTTCAAAA ATATGTATTT CAGCGGCGAC
GGCGCGCGTC GCGATGAAGA TGGCTATTAC TGGATAACCG GGCGTGTGGA CGATGTGCTG
AACGTCTCCG GTCACCGTCT GGGAACGGCG GAGATTGAGT CGGCGCTGGT GGCGCATCCG
AAGATTGCCG AAGCCGCCGT AGTAGGTATT CCGCACAATA TTAAAGGTCA GGCGATCTAC
GCCTACGTCA CGCTTAATCA CGGGGAGGAA CCGTCACCAG AACTGTACGC AGAAGTCCGC
AACTGGGTGC GTAAAGAGAT TGGCCCGCTG GCGACGCCAG ACGTGCTGCA CTGGACCGAC
TCCCTGCCTA AAACCCGCTC CGGCAAAATT ATGCGCCGTA TTCTGCGCAA AATTGCGGCG
GGCGATACCA GCAACCTGGG CGATACCTCG ACGCTTGCCG ATCCTGGCGT AGTCGAGAAG
CTGCTTGAAG AGAAGCAGGC TATCGCGATG CCATCGTAA
 
Protein sequence
MSQIHKHTIP ANIADRCLIN PQQYEAMYQQ SINVPDTFWG EQGKILDWIK PYQKVKNTSF 
APGNVSIKWY EDGTLNLAAN CLDRHLQENG DRTAIIWEGD DASQSKHISY KELHRDVCRF
ANTLLELGIK KGDVVAIYMP MVPEAAVAML ACARIGAVHS VIFGGFSPEA VAGRIIDSNS
RLVITSDEGV RAGRSIPLKK NVDDALKNPN VTSVEHVVVL KRTGGKIDWQ EGRDLWWHDL
VEQASDQHQA EEMNAEDPLF ILYTSGSTGK PKGVLHTTGG YLVYAALTFK YVFDYHPGDI
YWCTADVGWV TGHSYLLYGP LACGATTLMF EGVPNWPTPA RMAQVVDKHQ VNILYTAPTA
IRALMAEGDK AIEGTDRSSL RILGSVGEPI NPEAWEWYWK KIGNEKCPVV DTWWQTETGG
FMITPLPGAT ELKAGSATRP FFGVQPALVD NEGNPLEGAT EGSLVITDSW PGQARTLFGD
HERFEQTYFS TFKNMYFSGD GARRDEDGYY WITGRVDDVL NVSGHRLGTA EIESALVAHP
KIAEAAVVGI PHNIKGQAIY AYVTLNHGEE PSPELYAEVR NWVRKEIGPL ATPDVLHWTD
SLPKTRSGKI MRRILRKIAA GDTSNLGDTS TLADPGVVEK LLEEKQAIAM PS