Gene EcDH1_3641 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3641 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3918307 
End bp3919668 
Gene Length1362 bp 
Protein Length453 aa 
Translation table11 
GC content52% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX41254 
Protein GI260450832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones44 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGAAAAAG AAAATATCAC CATCGATCCG CGTTCTTCAT TTACTCCATC TTCGTCGGCA 
GATATTCCCG TGCCACCAGA TGGATTAGTT CAACGCAGTA CCCGAATTAA ACGCATTCAA
ACCACCGCCA TGTTGTTATT ATTTTTTGCG GCGGTAATCA ATTATCTCGA CCGCAGTTCG
CTGTCGGTAG CAAATTTAAC GATTCGTGAA GAATTGGGAT TAAGTGCCAC CGAAATCGGC
GCTTTGCTCT CCGTGTTTTC ACTCGCTTAC GGGATTGCGC AACTTCCTTG CGGCCCACTA
TTGGATCGTA AAGGCCCACG CCTGATGCTG GGACTGGGGA TGTTCTTCTG GTCACTGTTC
CAGGCAATGT CTGGCATGGT GCACAACTTT ACGCAGTTCG TGTTGGTGCG TATCGGTATG
GGGATTGGTG AAGCGCCGAT GAACCCATGC GGTGTAAAAG TCATTAACGA CTGGTTCAAC
ATCAAAGAGC GCGGACGCCC GATGGGCTTC TTCAACGCAG CTTCTACCAT TGGCGTTGCC
GTAAGCCCAC CGATTCTGGC GGCGATGATG CTGGTGATGG GCTGGCGCGG GATGTTTATT
ACCATTGGTG TACTGGGGAT TTTTCTCGCC ATCGGCTGGT ATATGCTCTA TCGCAACCGC
GAGCACGTAG AACTGACTGC CGTTGAGCAA GCTTATCTCA ATGCAGGTAG CGTCAATGCC
CGCCGAGATC CGCTCAGTTT TGCCGAATGG CGCAGCCTGT TCCGTAACCG TACAATGTGG
GGAATGATGC TCGGATTCAG TGGCATCAAC TACACTGCGT GGCTGTATCT GGCCTGGCTT
CCTGGTTACC TGCAAACAGC CTATAACCTG GATTTAAAAA GCACAGGGTT GATGGCGGCT
ATCCCTTTCC TGTTTGGGGC TGCCGGGATG CTGGTCAACG GTTACGTTAC CGACTGGCTG
GTCAAAGGGG GAATGGCTCC GATTAAAAGC CGTAAGATCT GCATTATTGC CGGGATGTTC
TGTTCTGCCG CCTTTACGCT GATAGTACCA CAAGCGACAA CATCCATGAC GGCGGTTCTG
CTGATTGGCA TGGCACTGTT CTGTATTCAC TTTGCCGGAA CATCCTGCTG GGGCTTGATC
CACGTCGCAG TTGCTTCTCG CATGACTGCG TCGGTGGGCA GTATCCAGAA CTTTGCCAGC
TTCATCTGCG CCTCTTTTGC GCCGATCATT ACTGGTTTTA TTGTTGATAC CACCCACTCA
TTCCGTCTGG CACTAATCAT CTGCGGTTGC GTCACCGCAG CGGGGGCACT GGCGTACATC
TTCCTGGTTC GTCAGCCGAT CAACGACCCA CGTAAAGATT AA
 
Protein sequence
MEKENITIDP RSSFTPSSSA DIPVPPDGLV QRSTRIKRIQ TTAMLLLFFA AVINYLDRSS 
LSVANLTIRE ELGLSATEIG ALLSVFSLAY GIAQLPCGPL LDRKGPRLML GLGMFFWSLF
QAMSGMVHNF TQFVLVRIGM GIGEAPMNPC GVKVINDWFN IKERGRPMGF FNAASTIGVA
VSPPILAAMM LVMGWRGMFI TIGVLGIFLA IGWYMLYRNR EHVELTAVEQ AYLNAGSVNA
RRDPLSFAEW RSLFRNRTMW GMMLGFSGIN YTAWLYLAWL PGYLQTAYNL DLKSTGLMAA
IPFLFGAAGM LVNGYVTDWL VKGGMAPIKS RKICIIAGMF CSAAFTLIVP QATTSMTAVL
LIGMALFCIH FAGTSCWGLI HVAVASRMTA SVGSIQNFAS FICASFAPII TGFIVDTTHS
FRLALIICGC VTAAGALAYI FLVRQPINDP RKD