Gene EcDH1_2118 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2118 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2260260 
End bp2261450 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content51% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX39773 
Protein GI260449351 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00060856 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA ACACTGTTTC CCGCAAAGTG GCGTGGCTAC GGGTCGTTAC GCTGGCAGTC 
GCCGCCTTCA TCTTCAACAC CACCGAATTT GTCCCTGTTG GCCTGCTCTC TGACATTGCG
CAAAGTTTTC ACATGCAAAC CGCTCAGGTC GGCATCATGT TGACCATTTA CGCATGGGTA
GTAGCGCTAA TGTCATTGCC TTTTATGTTA ATGACCAGTC AGGTTGAACG GCGCAAATTA
CTGATCTGCC TGTTTGTGGT GTTTATTGCC AGCCACGTAC TGTCGTTTTT GTCGTGGAGC
TTTACCGTTC TGGTGATCAG TCGCATTGGT GTGGCTTTTG CACATGCGAT TTTCTGGTCG
ATTACGGCGT CTCTGGCGAT CCGTATGGCT CCGGCCGGGA AGCGAGCACA GGCATTGAGT
TTAATTGCCA CCGGTACAGC ACTGGCGATG GTCTTAGGTT TACCTCTCGG GCGCATTGTG
GGCCAGTATT TCGGTTGGCG AATGACCTTC TTCGCGATTG GTATTGGGGC GCTTATCACC
CTTTTGTGCC TGATTAAGTT ACTTCCCTTA CTGCCCAGTG AGCATTCCGG TTCACTGAAA
AGCCTCCCGC TATTGTTCCG CCGCCCGGCA TTGATGAGCA TTTATTTGTT AACTGTGGTG
GTTGTCACCG CCCATTACAC GGCATACAGC TATATCGAGC CTTTTGTACA AAACATTGCG
GGATTCAGCG CCAACTTTGC CACGGCATTA CTGTTATTAC TCGGTGGTGC GGGCATTATT
GGCAGCGTGA TTTTCGGTAA ACTGGGTAAT CAGTATGCGT CTGCGTTGGT GAGTACGGCG
ATTGCGCTGT TGCTGGTGTG CCTGGCATTG CTGTTACCTG CGGCGAACAG TGAAATACAC
CTCGGGGTGC TGAGTATTTT CTGGGGGATC GCGATGATGA TCATCGGGCT TGGTATGCAG
GTTAAAGTGC TGGCGCTGGC ACCAGATGCT ACCGACGTCG CGATGGCGCT ATTCTCCGGC
ATATTTAATA TTGGAATCGG GGCGGGTGCG TTGGTAGGTA ATCAGGTGAG TTTGCACTGG
TCAATGTCGA TGATTGGTTA TGTGGGCGCG GTGCCTGCTT TTGCCGCGTT AATTTGGTCA
ATCATTATAT TTCGCCGCTG GCCAGTGACA CTCGAAGAAC AGACGCAATA G
 
Protein sequence
MTTNTVSRKV AWLRVVTLAV AAFIFNTTEF VPVGLLSDIA QSFHMQTAQV GIMLTIYAWV 
VALMSLPFML MTSQVERRKL LICLFVVFIA SHVLSFLSWS FTVLVISRIG VAFAHAIFWS
ITASLAIRMA PAGKRAQALS LIATGTALAM VLGLPLGRIV GQYFGWRMTF FAIGIGALIT
LLCLIKLLPL LPSEHSGSLK SLPLLFRRPA LMSIYLLTVV VVTAHYTAYS YIEPFVQNIA
GFSANFATAL LLLLGGAGII GSVIFGKLGN QYASALVSTA IALLLVCLAL LLPAANSEIH
LGVLSIFWGI AMMIIGLGMQ VKVLALAPDA TDVAMALFSG IFNIGIGAGA LVGNQVSLHW
SMSMIGYVGA VPAFAALIWS IIIFRRWPVT LEEQTQ