Gene EcDH1_3370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3370 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3615662 
End bp3617119 
Gene Length1458 bp 
Protein Length485 aa 
Translation table11 
GC content51% 
IMG OID 
Productaminoacyl-histidine dipeptidase 
Protein accessionACX40990 
Protein GI260450568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones37 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCTGAAC TGTCTCAATT ATCTCCACAG CCGCTGTGGG ATATTTTTGC CAAAATCTGT 
TCTATTCCTC ACCCGTCCTA TCATGAAGAG CAACTCGCTG AATACATTGT TGGTTGGGCA
AAAGAGAAAG GTTTCCATGT CGAACGCGAT CAGGTAGGTA ATATCCTGAT TCGTAAACCT
GCTACCGCAG GTATGGAAAA TCGTAAACCG GTCGTCTTAC AGGCCCACCT CGATATGGTG
CCGCAGAAAA ATAACGACAC CGTGCATGAC TTCACGAAAG ATCCTATCCA GCCTTATATT
GATGGCGAAT GGGTTAAAGC GCGCGGCACC ACGCTGGGTG CGGATAACGG CATTGGTATG
GCCTCTGCGC TGGCGGTTCT GGCTGACGAA AACGTGGTTC ACGGCCCGCT GGAAGTGCTG
CTGACCATGA CCGAAGAAGC CGGTATGGAC GGTGCGTTCG GCTTACAGGG CAACTGGTTG
CAGGCTGATA TTCTGATTAA CACCGACTCC GAAGAAGAAG GTGAAATCTA CATGGGTTGT
GCGGGGGGTA TCGACTTCAC CTCCAACCTG CATTTAGATC GTGAAGCGGT TCCAGCTGGT
TTTGAAACCT TCAAGTTAAC CTTAAAAGGT CTGAAAGGCG GTCACTCCGG CGGGGAAATC
CACGTTGGGC TGGGTAATGC CAACAAACTG CTGGTGCGCT TCCTGGCGGG TCATGCGGAA
GAACTGGATC TGCGCCTTAT CGATTTCAAC GGCGGCACAC TGCGTAACGC CATCCCGCGT
GAAGCCTTTG CGACCATTGC TGTCGCAGCT GATAAAGTCG ACGTCCTGAA ATCTCTGGTG
AATACCTATC AGGAGATCCT GAAAAACGAG CTGGCAGAAA AAGAGAAAAA TCTGGCCTTG
TTGCTGGACT CTGTAGCGAA CGATAAAGCT GCCCTGATTG CGAAATCTCG CGATACCTTT
ATTCGTCTGC TGAACGCCAC CCCGAACGGT GTGATTCGTA ACTCCGATGT AGCCAAAGGT
GTGGTTGAAA CCTCCCTGAA CGTCGGTGTG GTGACCATGA CTGACAATAA CGTAGAAATT
CACTGCCTGA TCCGTTCACT GATCGACAGC GGTAAAGACT ACGTGGTGAG CATGCTGGAT
TCGCTGGGTA AACTGGCTGG CGCGAAAACC GAAGCGAAAG GCGCATATCC TGGCTGGCAG
CCGGACGCTA ATTCTCCGGT GATGCATCTG GTACGTGAAA CCTATCAGCG CCTGTTCAAC
AAGACGCCGA ACATCCAGAT TATCCACGCG GGCCTGGAAT GTGGTCTGTT CAAAAAACCG
TATCCGGAAA TGGACATGGT TTCTATCGGG CCAACTATCA CCGGTCCACA CTCTCCGGAT
GAGCAAGTTC ACATCGAAAG CGTAGGTCAT TACTGGACAC TGCTGACTGA ACTGCTGAAA
GAAATTCCGG CGAAGTAA
 
Protein sequence
MSELSQLSPQ PLWDIFAKIC SIPHPSYHEE QLAEYIVGWA KEKGFHVERD QVGNILIRKP 
ATAGMENRKP VVLQAHLDMV PQKNNDTVHD FTKDPIQPYI DGEWVKARGT TLGADNGIGM
ASALAVLADE NVVHGPLEVL LTMTEEAGMD GAFGLQGNWL QADILINTDS EEEGEIYMGC
AGGIDFTSNL HLDREAVPAG FETFKLTLKG LKGGHSGGEI HVGLGNANKL LVRFLAGHAE
ELDLRLIDFN GGTLRNAIPR EAFATIAVAA DKVDVLKSLV NTYQEILKNE LAEKEKNLAL
LLDSVANDKA ALIAKSRDTF IRLLNATPNG VIRNSDVAKG VVETSLNVGV VTMTDNNVEI
HCLIRSLIDS GKDYVVSMLD SLGKLAGAKT EAKGAYPGWQ PDANSPVMHL VRETYQRLFN
KTPNIQIIHA GLECGLFKKP YPEMDMVSIG PTITGPHSPD EQVHIESVGH YWTLLTELLK
EIPAK