Gene EcDH1_3032 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3032 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3253439 
End bp3255049 
Gene Length1611 bp 
Protein Length536 aa 
Translation table11 
GC content56% 
IMG OID 
Product2,3-dihydroxybenzoate-AMP ligase 
Protein accessionACX40660 
Protein GI260450238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.14201 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCATTC CATTCACCCG CTGGCCGGAA GAGTTTGCCC GTCGCTATCG GGAAAAAGGC 
TACTGGCAGG ATTTGCCGCT GACCGACATT CTGACGCGAC ATGCTGCGAG TGACAGCATC
GCGGTTATCG ACGGCGAGCG ACAGTTGAGT TATCGGGAGC TGAATCAGGC GGCGGATAAC
CTCGCGTGTA GTTTACGCCG TCAGGGCATT AAACCTGGTG AAACCGCGCT GGTACAACTG
GGTAACGTCG CTGAATTGTA TATTACCTTT TTCGCGCTGC TGAAACTGGG CGTTGCGCCG
GTGCTGGCGT TGTTCAGCCA TCAGCGTAGT GAACTGAACG CCTATGCCAG CCAGATTGAA
CCCGCATTGC TGATTGCCGA TCGCCAACAT GCGCTGTTTA GCGGGGATGA TTTCCTCAAT
ACTTTCGTCA CAGAACATTC CTCCATTCGC GTGGTGCAAC TGCTCAACGA CAGCGGTGAG
CATAACTTGC AGGATGCGAT TAACCATCCG GCTGAGGATT TTACTGCCAC GCCATCACCT
GCTGATGAAG TGGCCTATTT CCAGCTTTCC GGTGGCACCA CCGGCACACC GAAACTGATC
CCGCGCACTC ATAACGACTA CTACTACAGC GTGCGTCGTA GCGTCGAGAT TTGTCAGTTC
ACACAACAGA CACGCTACCT GTGCGCGATC CCGGCGGCTC ATAACTACGC CATGAGTTCG
CCAGGATCGC TGGGCGTCTT TCTTGCCGGA GGAACGGTTG TTCTGGCGGC CGATCCCAGC
GCCACGCTCT GTTTCCCATT GATTGAAAAA CATCAGGTTA ACGTTACCGC GCTGGTGCCA
CCCGCAGTCA GCCTGTGGTT GCAGGCGCTG ATCGAAGGCG AAAGCCGGGC GCAGCTTGCC
TCGCTGAAAC TGTTACAGGT CGGCGGCGCA CGTCTTTCTG CCACCCTTGC GGCGCGTATT
CCCGCTGAGA TTGGCTGTCA GTTGCAGCAG GTGTTTGGCA TGGCGGAAGG GCTGGTGAAC
TACACCCGAC TTGATGATAG CGCGGAGAAA ATTATCCATA CCCAGGGTTA CCCAATGTGT
CCGGATGACG AAGTATGGGT TGCCGATGCC GAAGGAAATC CACTGCCGCA AGGGGAAGTC
GGACGCCTGA TGACGCGCGG GCCGTACACC TTCCGCGGCT ATTACAAAAG TCCACAGCAC
AATGCCAGCG CCTTTGATGC CAACGGTTTT TACTGTTCCG GCGATCTGAT CTCTATTGAT
CCAGAGGGTT ACATCACCGT GCAGGGGCGC GAGAAAGATC AGATTAACCG TGGCGGCGAG
AAGATCGCTG CCGAAGAGAT CGAAAACCTG CTGCTGCGCC ACCCGGCGGT GATCTACGCC
GCACTGGTGA GCATGGAAGA TGAGCTGATG GGCGAAAAAA GCTGCGCTTA TCTGGTGGTA
AAAGAGCCGC TGCGCGCGGT GCAGGTGCGT CGTTTCCTGC GTGAACAGGG TATTGCCGAA
TTTAAATTAC CGGATCGCGT GGAGTGTGTG GATTCACTTC CGCTGACGGC GGTCGGGAAA
GTCGATAAAA AACAATTACG TCAGTGGCTG GCGTCACGCG CATCAGCCTG A
 
Protein sequence
MSIPFTRWPE EFARRYREKG YWQDLPLTDI LTRHAASDSI AVIDGERQLS YRELNQAADN 
LACSLRRQGI KPGETALVQL GNVAELYITF FALLKLGVAP VLALFSHQRS ELNAYASQIE
PALLIADRQH ALFSGDDFLN TFVTEHSSIR VVQLLNDSGE HNLQDAINHP AEDFTATPSP
ADEVAYFQLS GGTTGTPKLI PRTHNDYYYS VRRSVEICQF TQQTRYLCAI PAAHNYAMSS
PGSLGVFLAG GTVVLAADPS ATLCFPLIEK HQVNVTALVP PAVSLWLQAL IEGESRAQLA
SLKLLQVGGA RLSATLAARI PAEIGCQLQQ VFGMAEGLVN YTRLDDSAEK IIHTQGYPMC
PDDEVWVADA EGNPLPQGEV GRLMTRGPYT FRGYYKSPQH NASAFDANGF YCSGDLISID
PEGYITVQGR EKDQINRGGE KIAAEEIENL LLRHPAVIYA ALVSMEDELM GEKSCAYLVV
KEPLRAVQVR RFLREQGIAE FKLPDRVECV DSLPLTAVGK VDKKQLRQWL ASRASA