Gene EcDH1_3153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3153 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3392338 
End bp3393888 
Gene Length1551 bp 
Protein Length516 aa 
Translation table11 
GC content48% 
IMG OID 
ProductEAL domain protein 
Protein accessionACX40779 
Protein GI260450357 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.516748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAACAC GACATCTGGT CGGCCTTATT TCGGGAGTAC TGATTCTTTC AGTATTGCTG 
CCTGTCGGCT TAAGCATCTG GCTGGCCCAT CAGCAGGTAG AAACATCGTT TATTGAAGAG
CTGGATACCT ATTCCTCCCG CGTCGCTATT CGAGCCAATA AGGTGGCGAC ACAAGGGAAA
GATGCGCTGC AGGAGCTGGA AAGATGGCAA GGCGCTGCCT GTAGCGAAGC CCATCTCATG
GAAATGCGTC GGGTATCTTA CAGTTATCGC TATATTCAGG AAGTGGCTTA TATCGATAAC
AACGTTCCCC AGTGTTCGTC TCTGGAGCAT GAAAGTCCGC CCGATACCTT CCCCGAGCCA
GGTAAAATTT CGAAAGATGG TTATCGTGTC TGGTTAACAT CGCATAACGA TTTAGGCATT
ATCCGTTACA TGGTCGCCAT GGGAACGGCA CATTATGTCG TCATGATCGA CCCCGCTTCC
TTTATTGATG TCATTCCCTA TAGCTCATGG CAAATTGATG CCGCCATTAT TGGCAATGCC
CATAACGTTG TCATAACCAG CAGCGATGAA ATTGCTCAGG GAATTATTAC CAGGCTACAA
AAAACACCCG GTGAGCATAT CGAAAATAAT GGAATCATTT ACGATATCCT GCCCTTACCG
GAGATGAATA TTTCGATCAT CACATGGGCT TCAACGAAAA TGTTGCAGAA AGGCTGGCAT
CGGCAAGTCT TTATTTGGTT ACCGCTCGGG TTGGTGATTG GCCTGCTGGC AGCGATGTTT
GTGCTGCGTA TTTTGCGCCG TATTCAGTCA CCGCATCATC GGCTGCAGGA TGCTATCGAA
AATCGTGATA TTTGCGTGCA CTATCAGCCG ATTGTCTCCT TAGCCAATGG CAAAATTGTC
GGTGCTGAGG CACTGGCGCG CTGGCCGCAG ACAGACGGTA GTTGGTTGTC ACCAGATAGT
TTTATTCCGC TGGCACAGCA AACGGGCCTT TCTGAGCCAT TGACGCTACT GATTATAAGA
AGCGTCTTTG AAGATATGGG CGACTGGCTG CGTCAGCATC CACAGCAGCA TATTTCGATC
AATCTTGAAT CCCCCGTGCT CACCTCGGAA AAAATCCCGC AATTGCTGCG TGACATGATC
AATCACTATC AGGTTAATCC CAGACAGATC GCGCTTGAAC TCACTGAACG CGAGTTTGCC
GATCCGAAAA CCAGCGCCCC GATAATTTCT CGCTACCGGG AGGCGGGCCA TGAAATTTAT
CTTGATGATT TTGGTACGGG GTATTCAAGT TTAAGTTATT TACAGGATCT GGATGTCGAC
ATTCTGAAGA TCGATAAATC TTTCGTTGAT GCGCTGGAAT ATAAAAATGT CACGCCGCAT
ATCATCGAAA TGGCAAAAAC ACTGAAACTG AAAATGGTAG CGGAGGGAAT CGAAACCAGT
AAACAAGAAG AGTGGTTACG CCAGCATGGC GTGCACTACG GTCAGGGCTG GCTCTACAGC
AAGGCATTAC CGAAAGAAGA TTTCTTACGC TGGGCCGAGC AACATTTGTG A
 
Protein sequence
MRTRHLVGLI SGVLILSVLL PVGLSIWLAH QQVETSFIEE LDTYSSRVAI RANKVATQGK 
DALQELERWQ GAACSEAHLM EMRRVSYSYR YIQEVAYIDN NVPQCSSLEH ESPPDTFPEP
GKISKDGYRV WLTSHNDLGI IRYMVAMGTA HYVVMIDPAS FIDVIPYSSW QIDAAIIGNA
HNVVITSSDE IAQGIITRLQ KTPGEHIENN GIIYDILPLP EMNISIITWA STKMLQKGWH
RQVFIWLPLG LVIGLLAAMF VLRILRRIQS PHHRLQDAIE NRDICVHYQP IVSLANGKIV
GAEALARWPQ TDGSWLSPDS FIPLAQQTGL SEPLTLLIIR SVFEDMGDWL RQHPQQHISI
NLESPVLTSE KIPQLLRDMI NHYQVNPRQI ALELTEREFA DPKTSAPIIS RYREAGHEIY
LDDFGTGYSS LSYLQDLDVD ILKIDKSFVD ALEYKNVTPH IIEMAKTLKL KMVAEGIETS
KQEEWLRQHG VHYGQGWLYS KALPKEDFLR WAEQHL