Gene EcDH1_3131 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3131 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3366259 
End bp3367479 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content52% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX40757 
Protein GI260450335 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCAATGA GTGAACAACC CCAGCCTGTG GCGGGCGCGG CTGCGTCAAC GACCAAGGCC 
CGAACATCGT TTGGTATTTT AGGTGCTATC AGCCTCTCAC ATCTGCTGAA CGACATGATC
CAATCGCTGA TTCTGGCGAT TTATCCGCTG CTTCAGTCAG AATTTTCTCT GACATTTATG
CAGATTGGCA TGATAACCCT CACCTTCCAG CTCGCCTCTT CGCTACTGCA ACCAGTGGTC
GGCTACTGGA CCGATAAATA TCCGATGCCA TGGTCGTTGC CAATTGGCAT GTGCTTTACC
TTAAGCGGTC TGGTGCTGCT TGCGCTGGCG GGCAGTTTTG GCGCAGTTCT GCTGGCGGCG
GCGCTGGTCG GTACCGGTTC ATCGGTCTTT CATCCGGAAT CTTCTCGCGT GGCCCGTATG
GCTTCCGGCG GGCGGCATGG CCTGGCGCAA TCTATCTTTC AGGTCGGCGG CAACTTTGGC
AGTTCCCTGG GACCCTTGCT GGCGGCGGTG ATTATCGCGC CTTATGGCAA AGGCAACGTT
GCCTGGTTTG TGCTTGCGGC ACTGCTGGCG ATCGTGGTGT TGGCGCAAAT CAGCCGTTGG
TACTCGGCAC AGCACCGAAT GAATAAAGGA AAACCCAAAG CGACGATTAT CAATCCACTG
CCGCGCAATA AAGTTGTACT GGCGGTCAGC ATTCTGTTAA TCCTCATTTT CTCGAAATAT
TTCTATATGG CGAGCATCAG CAGCTATTAC ACCTTTTATC TGATGCAAAA ATTCGGATTA
TCTATCCAGA ATGCTCAGCT TCATCTGTTT GCCTTCCTGT TTGCCGTTGC GGCAGGTACG
GTGATCGGCG GGCCTGTAGG GGATAAAATT GGGCGGAAAT ATGTGATTTG GGGCTCTATC
CTCGGCGTTG CGCCGTTTAC GCTGATTTTA CCCTACGCCA GCCTGCACTG GACGGGGGTT
TTAACGGTGA TTATTGGATT TATCCTCGCT TCGGCATTCT CTGCCATTCT GGTCTACGCT
CAGGAGCTGC TTCCAGGACG TATCGGTATG GTTTCTGGAC TCTTTTTCGG TTTTGCTTTT
GGCATGGGAG GTCTGGGAGC GGCAGTTCTG GGGCTTATCG CCGATCACAC CAGCATCGAG
TTAGTCTATA AAATCTGTGC TTTCCTGCCA CTATTGGGGA TGTTGACCAT ATTCCTGCCT
GATAACCGGC ATAAAGACTG A
 
Protein sequence
MAMSEQPQPV AGAAASTTKA RTSFGILGAI SLSHLLNDMI QSLILAIYPL LQSEFSLTFM 
QIGMITLTFQ LASSLLQPVV GYWTDKYPMP WSLPIGMCFT LSGLVLLALA GSFGAVLLAA
ALVGTGSSVF HPESSRVARM ASGGRHGLAQ SIFQVGGNFG SSLGPLLAAV IIAPYGKGNV
AWFVLAALLA IVVLAQISRW YSAQHRMNKG KPKATIINPL PRNKVVLAVS ILLILIFSKY
FYMASISSYY TFYLMQKFGL SIQNAQLHLF AFLFAVAAGT VIGGPVGDKI GRKYVIWGSI
LGVAPFTLIL PYASLHWTGV LTVIIGFILA SAFSAILVYA QELLPGRIGM VSGLFFGFAF
GMGGLGAAVL GLIADHTSIE LVYKICAFLP LLGMLTIFLP DNRHKD