Gene EcDH1_3656 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3656 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3941455 
End bp3942867 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content55% 
IMG OID 
Producttranscriptional regulator, GntR family with aminotransferase domain 
Protein accessionACX41268 
Protein GI260450846 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGCGTT ATCAACATCT GGCGACTCTG CTTGCCGAAC GGATTGAGCA AGGGCTGTAT 
CGTCACGGGG AGAAATTGCC GTCGGTGCGC AGCTTAAGTC AGGAGCACGG CGTCAGTATC
AGCACCGTGC AGCAGGCGTA TCAGACGCTG GAGACGATGA AGCTCATCAC TCCGCAGCCG
CGTTCGGGTT ATTTTGTCGC ACAACGTAAA GCCCAGCCGC CAGTACCGCC GATGACGCGT
CCGGTGCAGC GCCCGGTGGA AATTACCCAG TGGGATCAGG TGCTGGATAT GCTGAAGGCG
CATAGCGACA GTTCCATTGT TCCGTTAAGC AAAAGCACAC CGGATGTCGA AGCGCCCAGC
CTGAAACCGC TGTGGCGGGA GCTAAGCCGG GTGGTGCAGC ATAATCTGCA AACCGTTCTC
GGTTATGACT TGTTAGCCGG TCAGCGAGTA TTGCGAGAGC AGATTGCCCG CCTGATGCTC
GACAGCGGCT CGGTGGTCAC CGCCGATGAC ATCATCATCA CCAGCGGCTG CCATAACTCG
ATGTCGCTGG CGTTAATGGC AGTGTGTAAA CCGGGCGATA TTGTCGCGGT CGAATCCCCC
TGTTATTACG GTTCGATGCA GATGCTGCGC GGCATGGGCG TGAAAGTGAT TGAAATCCCA
ACCGATCCAG AAACTGGCAT CAGCGTTGAA GCGCTGGAAC TGGCGCTGGA ACAGTGGCCG
ATTAAAGGCA TCATTCTGGT GCCAAACTGT AATAATCCGC TGGGATTTAT TATGCCGGAC
GCGCGCAAAC GGGCCGTTCT CTCTCTCGCT CAGCGTCATG ATATTGTGAT TTTTGAAGAT
GATGTCTATG GCGAACTGGC GACGGAGTAT CCGCGCCCGC GGACCATTCA TTCCTGGGAT
ATCGACGGGC GAGTGCTGTT GTGCAGCTCG TTCAGTAAAA GTATTGCACC AGGCCTGCGC
GTGGGTTGGG TCGCACCGGG GCGTTATCAC GATAAACTGA TGCATATGAA ATACGCCATC
AGCAACTTTA ATGTGCCGTC CACGCAAATG GCGGCGGCAA CGTTTGTGCT GGAAGGTCAC
TATCATCGCC ATATCCGGCG GATGCGGCAG ATCTATCAGC GCAATTTGGC GCTTTATACC
TGCTGGATAC GGGAATATTT TCCCTGCGAA ATCTGTATTA CGCGCCCGAA AGGCGGATTT
TTACTGTGGA TAGAATTGCC TGAACAGGTC GATATGGTCT GCGTCGCGCG GCAGCTGTGC
CGCATGAAAA TCCAGGTGGC GGCAGGCTCG ATTTTCTCAG CTTCCGGCAA ATACCGTAAT
TGTCTGCGCA TCAACTGCGC TTTGCCGCTC AGCGAAACCT ATCGCGAAGC ACTGAAGCAA
ATTGGCGAGG CCGTGTATCG GGCAATGGAA TAA
 
Protein sequence
MTRYQHLATL LAERIEQGLY RHGEKLPSVR SLSQEHGVSI STVQQAYQTL ETMKLITPQP 
RSGYFVAQRK AQPPVPPMTR PVQRPVEITQ WDQVLDMLKA HSDSSIVPLS KSTPDVEAPS
LKPLWRELSR VVQHNLQTVL GYDLLAGQRV LREQIARLML DSGSVVTADD IIITSGCHNS
MSLALMAVCK PGDIVAVESP CYYGSMQMLR GMGVKVIEIP TDPETGISVE ALELALEQWP
IKGIILVPNC NNPLGFIMPD ARKRAVLSLA QRHDIVIFED DVYGELATEY PRPRTIHSWD
IDGRVLLCSS FSKSIAPGLR VGWVAPGRYH DKLMHMKYAI SNFNVPSTQM AAATFVLEGH
YHRHIRRMRQ IYQRNLALYT CWIREYFPCE ICITRPKGGF LLWIELPEQV DMVCVARQLC
RMKIQVAAGS IFSASGKYRN CLRINCALPL SETYREALKQ IGEAVYRAME