Gene EcDH1_0917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0917 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp985821 
End bp987230 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content53% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX38600 
Protein GI260448178 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGGC GTTGCCTTTT CGGCTTCTCA GGCGAGAAGC CGTTCTTATT ACCGGACAAT 
GAAGGGGTAA AGATGAACAC TTCACCGGTG CGAATGGATG ATTTACCGCT TAACCGTTTT
CACTGCCGCA TTGCTGCGCT CACTTTCGGC GCACACCTGA CCGACGGTTA TGTTCTCGGC
GTCATTGGTT ACGCCATTAT TCAGCTTACG CCCGCCATGC AACTGACGCC GTTTATGGCG
GGAATGATCG GCGGCTCGGC GCTCCTTGGT TTGTTCCTTG GCAGCCTGGT TCTTGGGTGG
ATCTCCGACC ATATTGGTCG GCAAAAAATC TTCACCTTCA GCTTTTTGCT GATTACGCTT
GCTTCGTTTT TACAATTTTT TGCCACCACG CCAGAGCATC TTATTGGACT GCGCATTTTG
ATTGGCATTG GTCTGGGAGG CGATTATTCA GTAGGTCACA CCTTGCTGGC TGAATTTTCC
CCGCGCCGCC ATCGCGGTAT TTTGCTGGGC GCATTCAGCG TGGTGTGGAC CGTAGGCTAT
GTGCTGGCAA GTATTGCCGG ACATCACTTT ATTTCCGAAA ACCCGGAGGC CTGGCGCTGG
CTACTGGCAT CGGCAGCTCT GCCCGCGTTG TTGATTACGT TATTACGCTG GGGAACGCCA
GAATCACCAC GCTGGCTACT GCGCCAGGGG CGTTTTGCAG AAGCTCACGC TATCGTGCAT
CGCTATTTTG GTCCCCATGT TTTACTGGGC GATGAAGTGG TAACGGCGAC CCATAAACAC
ATCAAAACCT TGTTCTCTTC GCGTTACTGG CGGCGCACGG CGTTTAACAG CGTCTTCTTT
GTCTGCCTCG TAATCCCATG GTTTGTGATT TATACCTGGC TGCCAACTAT CGCCCAGACT
ATTGGTCTGG AAGATGCGCT GACTGCCAGC CTGATGCTTA ATGCGTTGTT AATTGTGGGC
GCGCTGCTGG GATTAGTTCT GACGCACCTG CTGGCACATC GCAAATTTTT GCTGGGAAGT
TTTTTGCTGC TGGCGGCAAC GCTGGTAGTC ATGGCCTGTT TGCCTTCCGG CAGTTCATTA
ACGCTGCTGC TTTTTGTTCT CTTCAGCACC ACCATTTCGG CAGTCAGTAA TCTGGTGGGC
ATTTTGCCTG CGGAAAGTTT TCCTACTGAC ATTCGCTCGC TGGGCGTCGG TTTTGCCACT
GCCATGAGTC GACTTGGCGC GGCGGTAAGT ACTGGCCTGC TGCCGTGGGT GCTGGCGCAG
TGGGGAATGC AAGTCACCTT ATTGCTCCTG GCGACAGTGT TGTTGGTTGG TTTTGTTGTG
ACCTGGCTAT GGGCACCAGA AACTAAAGCC CTCCCGCTGG TGGCGGCGGG AAATGTAGGA
GGTGCGAATG AACATTCTGT TAGCGTTTAA
 
Protein sequence
MTGRCLFGFS GEKPFLLPDN EGVKMNTSPV RMDDLPLNRF HCRIAALTFG AHLTDGYVLG 
VIGYAIIQLT PAMQLTPFMA GMIGGSALLG LFLGSLVLGW ISDHIGRQKI FTFSFLLITL
ASFLQFFATT PEHLIGLRIL IGIGLGGDYS VGHTLLAEFS PRRHRGILLG AFSVVWTVGY
VLASIAGHHF ISENPEAWRW LLASAALPAL LITLLRWGTP ESPRWLLRQG RFAEAHAIVH
RYFGPHVLLG DEVVTATHKH IKTLFSSRYW RRTAFNSVFF VCLVIPWFVI YTWLPTIAQT
IGLEDALTAS LMLNALLIVG ALLGLVLTHL LAHRKFLLGS FLLLAATLVV MACLPSGSSL
TLLLFVLFST TISAVSNLVG ILPAESFPTD IRSLGVGFAT AMSRLGAAVS TGLLPWVLAQ
WGMQVTLLLL ATVLLVGFVV TWLWAPETKA LPLVAAGNVG GANEHSVSV