Gene EcDH1_3949 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3949 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4255349 
End bp4256674 
Gene Length1326 bp 
Protein Length441 aa 
Translation table11 
GC content56% 
IMG OID 
ProductMATE efflux family protein 
Protein accessionACX41549 
Protein GI260451127 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones52 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCATTCC TCACTTCATC TGATAAAGCA CTCTGGCATC TCGCCTTACC CATGATTTTC 
TCCAATATCA CCGTTCCGTT GCTGGGACTG GTCGATACGG CGGTAATTGG TCATCTTGAT
AGCCCGGTTT ATTTGGGCGG CGTGGCGGTT GGTGCAACGG CGACCAGCTT TCTCTTTATG
CTGTTGCTGT TTTTACGCAT GAGCACCACC GGGCTGACTG CGCAGGCTTA TGGTGCCAAA
AATCCTCAGG CATTAGCCCG TACGCTGGTG CAACCGTTGC TGTTGGCGTT GGGGGCTGGG
GCGTTAATTG CGCTGCTGCG TACGCCGATT ATCGATCTGG CGCTGCATAT TGTTGGCGGT
AGTGAGGCAG TCCTGGAACA GGCGCGGCGC TTTCTTGAAA TCCGCTGGTT AAGCGCACCG
GCGTCGCTGG CGAATCTGGT ATTACTCGGT TGGTTACTCG GCGTGCAATA TGCCCGTGCG
CCAGTAATTT TGTTAGTGGT CGGCAATATC CTCAACATTG TGCTGGATGT CTGGCTGGTG
ATGGGGCTGC ATATGAACGT GCAGGGCGCG GCGCTGGCGA CGGTTATTGC GGAATATGCA
ACATTGCTGA TTGGTCTGCT AATGGTGCGT AAAATCCTCA AACTACGCGG AATTTCCGGC
GAAATGCTGA AAACTGCCTG GCGAGGAAAC TTCCGTCGCT TGCTGGCGCT TAACCGCGAT
ATCATGCTGC GTTCGCTGTT GTTGCAACTC TGTTTCGGCG CGATCACCGT ACTTGGCGCG
CGACTGGGGA GTGACATTAT CGCTGTTAAC GCGGTTCTGA TGACGCTACT CACCTTTACC
GCCTATGCGC TGGATGGTTT TGCCTACGCG GTTGAAGCGC ACTCCGGTCA GGCATACGGT
GCGCGCGACG GTAGCCAGTT GCTGGATGTC TGGCGGGCAG CGTGCCGCCA GTCGGGGATC
GTAGCGTTAC TGTTTTCGGT GGTTTATTTG CTGGCTGGGG AACACATCAT TGCGTTACTG
ACGTCGTTAA CCCAGATTCA GCAGCTGGCT GACCGCTATC TTATCTGGCA GGTGATTTTG
CCGGTGGTTG GCGTCTGGTG TTATCTGCTG GACGGCATGT TTATAGGCGC AACGCGTGCC
ACCGAAATGC GTAACAGTAT GGCGGTGGCC GCCGCAGGTT TTGCGCTGAC GCTCCTTACG
CTGCCGTGGC TGGGTAATCA TGCTTTGTGG CTGGCATTAA CCGTCTTTCT GGCGTTGCGC
GGGCTTTCTC TGGCGGCTAT CTGGCGGCGT CACTGGCGCA ATGGTACCTG GTTTGCCGCA
ACGTGA
 
Protein sequence
MAFLTSSDKA LWHLALPMIF SNITVPLLGL VDTAVIGHLD SPVYLGGVAV GATATSFLFM 
LLLFLRMSTT GLTAQAYGAK NPQALARTLV QPLLLALGAG ALIALLRTPI IDLALHIVGG
SEAVLEQARR FLEIRWLSAP ASLANLVLLG WLLGVQYARA PVILLVVGNI LNIVLDVWLV
MGLHMNVQGA ALATVIAEYA TLLIGLLMVR KILKLRGISG EMLKTAWRGN FRRLLALNRD
IMLRSLLLQL CFGAITVLGA RLGSDIIAVN AVLMTLLTFT AYALDGFAYA VEAHSGQAYG
ARDGSQLLDV WRAACRQSGI VALLFSVVYL LAGEHIIALL TSLTQIQQLA DRYLIWQVIL
PVVGVWCYLL DGMFIGATRA TEMRNSMAVA AAGFALTLLT LPWLGNHALW LALTVFLALR
GLSLAAIWRR HWRNGTWFAA T