Gene EcDH1_1579 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1579 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp1715177 
End bp1716592 
Gene Length1416 bp 
Protein Length471 aa 
Translation table11 
GC content56% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX39244 
Protein GI260448822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.383326 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAGATC TTCCCGACAG CACCCGTTGG CAATTGTGGA TTGTGGCTTT CGGCTTCTTT 
ATGCAGTCGC TGGACACCAC CATCGTAAAC ACCGCCCTTC CCTCAATGGC GCAAAGCCTC
GGGGAAAGTC CGTTGCATAT GCACATGGTC ATTGTCTCTT ATGTGCTGAC CGTGGCGGTG
ATGCTGCCCG CCAGCGGCTG GCTGGCGGAC AAAGTCGGCG TGCGCAATAT TTTCTTTACC
GCCATCGTGC TGTTTACTCT CGGTTCACTG TTTTGCGCGC TTTCCGGCAC GCTGAACGAA
CTGTTGCTGG CACGCGCGTT ACAGGGCGTT GGCGGCGCGA TGATGGTGCC GGTCGGCAGA
TTGACGGTGA TGAAAATCGT ACCGCGCGAG CAATATATGG CGGCGATGAC CTTTGTCACG
TTACCCGGTC AGGTCGGTCC GCTGCTCGGT CCGGCGCTCG GCGGTCTGCT GGTGGAGTAC
GCATCGTGGC ACTGGATCTT TTTGATCAAC ATTCCGGTGG GGATTATCGG TGCGATCGCC
ACATTGCTGT TAATGCCGAA CTACACCATG CAGACGCGGC GCTTTGATCT CTCCGGATTT
TTATTGCTGG CGGTTGGCAT GGCGGTATTA ACCCTGGCGC TGGACGGCAG TAAAGGTACA
GGTTTATCGC CGCTGACGAT TGCAGGCCTG GTCGCAGTTG GCGTGGTGGC ACTGGTGCTT
TATCTGCTGC ACGCCAGAAA TAACAACCGT GCCCTGTTCA GTCTGAAACT GTTCCGTACT
CGTACCTTTT CGCTGGGCCT GGCGGGGAGC TTTGCCGGAC GTATTGGCAG TGGCATGTTG
CCCTTTATGA CACCGGTTTT CCTGCAAATT GGCCTCGGTT TCTCGCCGTT TCATGCCGGA
CTGATGATGA TCCCGATGGT GCTTGGCAGC ATGGGAATGA AGCGAATTGT GGTACAGGTG
GTGAATCGCT TTGGTTATCG TCGGGTACTG GTAGCGACCA CGCTGGGTCT GTCGCTGGTC
ACCCTGTTGT TTATGACTAC CGCCCTGCTG GGCTGGTACT ACGTTTTGCC GTTCGTCCTG
TTTTTACAAG GGATGGTCAA CTCGACGCGT TTCTCCTCCA TGAACACCCT GACGCTGAAA
GATCTCCCGG ACAATCTGGC GAGCAGCGGC AACAGCCTGC TGTCGATGAT TATGCAATTG
TCGATGAGTA TCGGCGTCAC TATCGCCGGG CTGTTGCTGG GACTTTTTGG TTCACAGCAT
GTCAGCGTCG ACAGCGGCAC CACACAAACC GTCTTTATGT ACACCTGGCT TAGCATGGCG
TTGATCATCG CCCTTCCGGC GTTCATCTTT GCCAGAGTGC CGAACGATAC GCATCAAAAT
GTAGCTATTT CGCGGCGAAA AAGGAGCGCG CAATGA
 
Protein sequence
MTDLPDSTRW QLWIVAFGFF MQSLDTTIVN TALPSMAQSL GESPLHMHMV IVSYVLTVAV 
MLPASGWLAD KVGVRNIFFT AIVLFTLGSL FCALSGTLNE LLLARALQGV GGAMMVPVGR
LTVMKIVPRE QYMAAMTFVT LPGQVGPLLG PALGGLLVEY ASWHWIFLIN IPVGIIGAIA
TLLLMPNYTM QTRRFDLSGF LLLAVGMAVL TLALDGSKGT GLSPLTIAGL VAVGVVALVL
YLLHARNNNR ALFSLKLFRT RTFSLGLAGS FAGRIGSGML PFMTPVFLQI GLGFSPFHAG
LMMIPMVLGS MGMKRIVVQV VNRFGYRRVL VATTLGLSLV TLLFMTTALL GWYYVLPFVL
FLQGMVNSTR FSSMNTLTLK DLPDNLASSG NSLLSMIMQL SMSIGVTIAG LLLGLFGSQH
VSVDSGTTQT VFMYTWLSMA LIIALPAFIF ARVPNDTHQN VAISRRKRSA Q