Gene EcDH1_1814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1814 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1965630 
End bp1967003 
Gene Length1374 bp 
Protein Length457 aa 
Translation table11 
GC content52% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX39473 
Protein GI260449051 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.0974065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCAAAAG TTCAGGCCGA CGGCCTGCCA TTGCCCCAGC GATACGGTGC GATATTAACC 
ATTGTGATTG GTATTTCGAT GGCCGTCCTT GACGGCGCAA TCGCCAACGT CGCCCTGCCA
ACAATCGCCA CGGACCTTCA TGCCACGCCA GCCAGTTCCA TCTGGGTAGT GAACGCCTAT
CAAATCGCCA TTGTCATCTC CCTGCTCTCG TTTTCGTTTC TGGGCGATAT GTTTGGCTAT
CGACGTATTT ATAAATGCGG TCTGGTCGTT TTTCTGTTGT CTTCACTGTT CTGCGCCCTT
TCTGATTCGC TGCAAATGCT CACCCTTGCG CGTGTCATAC AAGGTTTCGG CGGTGCAGCG
TTGATGAGCG TTAATACCGC ACTTATCCGC CTGATCTATC CACAACGTTT TCTGGGTAGA
GGGATGGGCA TAAACTCGTT TATTGTTGCC GTCTCTTCTG CTGCCGGGCC GACAATTGCT
GCAGCAATCC TCTCCATCGC ATCCTGGAAA TGGTTATTTT TAATCAACGT ACCGTTAGGT
ATTATCGCCC TGCTTCTGGC GATGCGTTTT CTGCCACCCA ATGGTTCTCG CGCCAGTAAA
CCCCGTTTCG ACCTGCCCAG CGCCGTGATG AACGCGTTAA CCTTCGGCCT GCTTATCACT
GCGTTGAGTG GTTTCGCTCA GGGGCAATCG CTGACGTTAA TTGCTGCGGA ACTGGTGGTA
ATGGTTGTTG TTGGTATTTT CTTTATTCGC CGCCAGCTTT CTCTTCCCGT ACCGCTGCTA
CCGGTGGATT TACTGCGTAT CCCGCTGTTT TCACTTTCTA TTTGCACATC TGTTTGCTCT
TTCTGCGCAC AAATGCTGGC AATGGTTTCC CTGCCCTTTT ACCTGCAAAC CGTGCTCGGG
CGTAGTGAAG TCGAAACAGG TTTACTTCTG ACACCGTGGC CGTTAGCAAC GATGGTGATG
GCTCCGCTGG CAGGCTATTT GATTGAACGC GTACATGCAG GATTGCTGGG GGCTTTAGGG
TTGTTCATCA TGGCTGCGGG GCTTTTTTCC CTGGTTCTGC TGCCCGCGTC ACCTGCGGAT
ATCAATATTA TCTGGCCGAT GATCTTATGT GGTGCTGGAT TTGGCTTATT CCAGTCACCC
AATAACCACA CCATTATTAC CTCCGCGCCT CGCGAACGTA GCGGTGGAGC CAGTGGCATG
TTAGGAACGG CTCGTCTACT GGGTCAGAGT AGCGGCGCGG CGCTGGTGGC GCTGATGCTA
AATCAGTTTG GAGATAATGG TACACACGTC TCGCTGATGG CTGCGGCTAT TCTGGCAGTG
ATTGCTGCCT GTGTCAGTGG TTTACGTATC ACTCAGCCAC GATCCAGGGC ATAA
 
Protein sequence
MPKVQADGLP LPQRYGAILT IVIGISMAVL DGAIANVALP TIATDLHATP ASSIWVVNAY 
QIAIVISLLS FSFLGDMFGY RRIYKCGLVV FLLSSLFCAL SDSLQMLTLA RVIQGFGGAA
LMSVNTALIR LIYPQRFLGR GMGINSFIVA VSSAAGPTIA AAILSIASWK WLFLINVPLG
IIALLLAMRF LPPNGSRASK PRFDLPSAVM NALTFGLLIT ALSGFAQGQS LTLIAAELVV
MVVVGIFFIR RQLSLPVPLL PVDLLRIPLF SLSICTSVCS FCAQMLAMVS LPFYLQTVLG
RSEVETGLLL TPWPLATMVM APLAGYLIER VHAGLLGALG LFIMAAGLFS LVLLPASPAD
INIIWPMILC GAGFGLFQSP NNHTIITSAP RERSGGASGM LGTARLLGQS SGAALVALML
NQFGDNGTHV SLMAAAILAV IAACVSGLRI TQPRSRA