Gene EcDH1_3528 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3528 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3792578 
End bp3793756 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content49% 
IMG OID 
Productsugar efflux transporter 
Protein accessionACX41143 
Protein GI260450721 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTGGA TAATGACGAT GGCTCGCCGT ATGAACGGTG TTTACGCGGC ATTTATGCTG 
GTCGCTTTTA TGATGGGGGT GGCCGGGGCG CTACAGGCTC CTACATTGAG CTTATTTCTG
AGTCGTGAGG TTGGCGCGCA ACCTTTCTGG ATCGGCCTCT TTTATACGGT GAATGCTATT
GCTGGGATCG GCGTAAGCCT CTGGTTGGCA AAACGTTCTG ACAGTCAGGG CGATCGGCGA
AAACTGATTA TATTTTGCTG TTTGATGGCT ATCGGCAATG CGCTATTGTT TGCATTTAAT
CGTCATTATC TGACGCTTAT CACCTGTGGT GTGCTTCTGG CATCTCTGGC CAATACGGCA
ATGCCACAGT TATTTGCTCT GGCGCGGGAA TATGCGGATA ACTCGGCGCG AGAAGTGGTG
ATGTTTAGCT CGGTGATGCG TGCGCAGCTT TCTCTGGCAT GGGTTATCGG TCCACCGTTG
GCCTTTATGC TGGCGTTGAA TTACGGCTTT ACGGTGATGT TTTCGATTGC CGCCGGGATA
TTCACACTCA GTCTGGTATT GATTGCATTT ATGCTTCCGT CTGTGGCGCG GGTAGAACTG
CCGTCGGAAA ATGCTTTATC AATGCAAGGT GGCTGGCAGG ATAGTAACGT ACGGATGTTA
TTTGTCGCCT CGACGTTAAT GTGGACCTGC AACACCATGT ACATTATTGA TATGCCGTTG
TGGATCAGTA GCGAGTTAGG ATTGCCAGAC AAACTGGCGG GTTTCCTGAT GGGGACGGCA
GCTGGACTGG AAATACCAGC AATGATTCTG GCTGGCTACT ATGTCAAACG TTATGGTAAG
CGGCGAATGA TGGTCATAGC AGTGGCGGCA GGAGTACTGT TTTACACCGG ATTGATTTTC
TTTAATAGCC GTATGGCGTT GATGACGCTG CAACTTTTTA ACGCTGTATT TATCGGCATT
GTTGCGGGTA TTGGGATGCT ATGGTTTCAG GATTTAATGC CTGGAAGAGC GGGGGCAGCT
ACCACCTTAT TTACTAACAG TATTTCTACC GGGGTAATTC TGGCTGGCGT TATTCAGGGA
GCAATTGCAC AAAGTTGGGG GCACTTTGCT GTCTACTGGG TAATTGCGGT TATTTCTGTT
GTCGCATTAT TTTTAACCGC AAAGGTTAAA GACGTTTGA
 
Protein sequence
MIWIMTMARR MNGVYAAFML VAFMMGVAGA LQAPTLSLFL SREVGAQPFW IGLFYTVNAI 
AGIGVSLWLA KRSDSQGDRR KLIIFCCLMA IGNALLFAFN RHYLTLITCG VLLASLANTA
MPQLFALARE YADNSAREVV MFSSVMRAQL SLAWVIGPPL AFMLALNYGF TVMFSIAAGI
FTLSLVLIAF MLPSVARVEL PSENALSMQG GWQDSNVRML FVASTLMWTC NTMYIIDMPL
WISSELGLPD KLAGFLMGTA AGLEIPAMIL AGYYVKRYGK RRMMVIAVAA GVLFYTGLIF
FNSRMALMTL QLFNAVFIGI VAGIGMLWFQ DLMPGRAGAA TTLFTNSIST GVILAGVIQG
AIAQSWGHFA VYWVIAVISV VALFLTAKVK DV