Gene EcDH1_3923 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3923 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4227099 
End bp4229057 
Gene Length1959 bp 
Protein Length652 aa 
Translation table11 
GC content57% 
IMG OID 
Productacetate/CoA ligase 
Protein accessionACX41523 
Protein GI260451101 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones47 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCCAAA TTCACAAACA CACCATTCCT GCCAACATCG CAGACCGTTG CCTGATAAAC 
CCTCAGCAGT ACGAGGCGAT GTATCAACAA TCTATTAACG TACCTGATAC CTTCTGGGGC
GAACAGGGAA AAATTCTTGA CTGGATCAAA CCTTACCAGA AGGTGAAAAA CACCTCCTTT
GCCCCCGGTA ATGTGTCCAT TAAATGGTAC GAGGACGGCA CGCTGAATCT GGCGGCAAAC
TGCCTTGACC GCCATCTGCA AGAAAACGGC GATCGTACCG CCATCATCTG GGAAGGCGAC
GACGCCAGCC AGAGCAAACA TATCAGCTAT AAAGAGCTGC ACCGCGACGT CTGCCGCTTC
GCCAATACCC TGCTCGAGCT GGGCATTAAA AAAGGTGATG TGGTGGCGAT TTATATGCCG
ATGGTGCCGG AAGCCGCGGT TGCGATGCTG GCCTGCGCCC GCATTGGCGC GGTGCATTCG
GTGATTTTCG GCGGCTTCTC GCCGGAAGCC GTTGCCGGGC GCATTATTGA TTCCAACTCA
CGACTGGTGA TCACTTCCGA CGAAGGTGTG CGTGCCGGGC GCAGTATTCC GCTGAAGAAA
AACGTTGATG ACGCGCTGAA AAACCCGAAC GTCACCAGCG TAGAGCATGT GGTGGTACTG
AAGCGTACTG GCGGGAAAAT TGACTGGCAG GAAGGGCGCG ACCTGTGGTG GCACGACCTG
GTTGAGCAAG CGAGCGATCA GCACCAGGCG GAAGAGATGA ACGCCGAAGA TCCGCTGTTT
ATTCTCTACA CCTCCGGTTC TACCGGTAAG CCAAAAGGTG TGCTGCATAC TACCGGCGGT
TATCTGGTGT ACGCGGCGCT GACCTTTAAA TATGTCTTTG ATTATCATCC GGGTGATATC
TACTGGTGCA CCGCCGATGT GGGCTGGGTG ACCGGACACA GTTACTTGCT GTACGGCCCG
CTGGCCTGCG GTGCGACCAC GCTGATGTTT GAAGGCGTAC CCAACTGGCC GACGCCTGCC
CGTATGGCGC AGGTGGTGGA CAAGCATCAG GTCAATATTC TCTATACCGC ACCCACGGCG
ATCCGCGCGC TGATGGCGGA AGGCGATAAA GCGATCGAAG GCACCGACCG TTCGTCGCTG
CGCATTCTCG GTTCCGTGGG CGAGCCAATT AACCCGGAAG CGTGGGAGTG GTACTGGAAA
AAAATCGGCA ACGAGAAATG TCCGGTGGTC GATACCTGGT GGCAGACCGA AACCGGCGGT
TTCATGATCA CCCCGCTGCC TGGCGCTACC GAGCTGAAAG CCGGTTCGGC AACACGTCCG
TTCTTCGGCG TGCAACCGAC GCTGGTCGAT AACGAAGGTA ACCCGCTGGA GGGGGCCACC
GAAGGTAGCC TGGTAATCAC CGACTCCTGG CCGGGTCAGG CGCGTACGCT GTTTGGCGAT
CACGAACGTT TTGAACAGAC CTACTTCTCC ACCTTCAAAA ATATGTATTT CAGCGGCGAC
GGCGCGCGTC GCGATGAAGA TGGCTATTAC TGGATAACCG GGCGTGTGGA CGACGTGCTG
AACGTCTCCG GTCACCGTCT GGGGACGGCA GAGATTGAGT CGGCGCTGGT GGCGCATCCG
AAGATTGCCG AAGCCGCCGT AGTAGGTATT CCGCACAATA TTAAAGGTCA GGCGATCTAC
GCCTACGTCA CGCTTAATCA CGGGGAGGAA CCGTCACCAG AACTGTACGC AGAAGTCCGC
AACTGGGTGC GTAAAGAGAT TGGCCCGCTG GCGACGCCAG ACGTGCTGCA CTGGACCGAC
TCCCTGCCTA AAACCCGCTC CGGCAAAATT ATGCGCCGTA TTCTGCGCAA AATTGCGGCG
GGCGATACCA GCAACCTGGG CGATACCTCG ACGCTTGCCG ATCCTGGCGT AGTCGAGAAG
CTGCTTGAAG AGAAGCAGGC TATCGCGATG CCATCGTAA
 
Protein sequence
MSQIHKHTIP ANIADRCLIN PQQYEAMYQQ SINVPDTFWG EQGKILDWIK PYQKVKNTSF 
APGNVSIKWY EDGTLNLAAN CLDRHLQENG DRTAIIWEGD DASQSKHISY KELHRDVCRF
ANTLLELGIK KGDVVAIYMP MVPEAAVAML ACARIGAVHS VIFGGFSPEA VAGRIIDSNS
RLVITSDEGV RAGRSIPLKK NVDDALKNPN VTSVEHVVVL KRTGGKIDWQ EGRDLWWHDL
VEQASDQHQA EEMNAEDPLF ILYTSGSTGK PKGVLHTTGG YLVYAALTFK YVFDYHPGDI
YWCTADVGWV TGHSYLLYGP LACGATTLMF EGVPNWPTPA RMAQVVDKHQ VNILYTAPTA
IRALMAEGDK AIEGTDRSSL RILGSVGEPI NPEAWEWYWK KIGNEKCPVV DTWWQTETGG
FMITPLPGAT ELKAGSATRP FFGVQPTLVD NEGNPLEGAT EGSLVITDSW PGQARTLFGD
HERFEQTYFS TFKNMYFSGD GARRDEDGYY WITGRVDDVL NVSGHRLGTA EIESALVAHP
KIAEAAVVGI PHNIKGQAIY AYVTLNHGEE PSPELYAEVR NWVRKEIGPL ATPDVLHWTD
SLPKTRSGKI MRRILRKIAA GDTSNLGDTS TLADPGVVEK LLEEKQAIAM PS