Gene EcDH1_1952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_1952 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2106193 
End bp2107407 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content45% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX39609 
Protein GI260449187 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.17424 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAATC CCTATTTCCC TACCGCACTT GGGTTGTATT TTAATTACCT GGTGCATGGT 
ATGGGCGTCC TTTTGATGAG CCTGAATATG GCCTCGCTGG AGACACTTTG GCAGACTAAT
GCCGCGGGTG TCTCGATAGT TATCTCATCG CTGGGCATTG GTCGATTAAG TGTCTTGCTT
TTTGCAGGAT TATTATCCGA TCGCTTTGGT CGCCGCCCTT TTATCATGCT CGGGATGTGC
TGCTATATGG CCTTCTTTTT TGGCATCCTG CAGACCAATA ACATCATTAT CGCTTATGTT
TTTGGCTTTC TGGCGGGAAT GGCAAACAGT TTTCTCGATG CAGGCACTTA TCCCAGTTTG
ATGGAAGCTT TTCCACGCTC ACCTGGGACA GCCAATATTT TAATTAAAGC ATTTGTTTCC
AGCGGACAAT TTTTATTACC GCTAATCATT AGCCTGTTAG TGTGGGCTGA ACTGTGGTTC
GGTTGGTCCT TTATGATTGC TGCAGGCATT ATGTTTATTA ACGCTCTGTT TTTATACCGT
TGTACGTTCC CACCCCATCC GGGTCGTCGC TTACCTGTCA TAAAGAAAAC CACCAGCTCT
ACGGAACATC GCTGTTCAAT TATCGATTTA GCCAGTTATA CCTTATATGG CTATATCTCA
ATGGCAACGT TTTATCTGGT TAGCCAGTGG CTGGCACAGT ACGGACAATT TGTTGCAGGC
ATGTCATACA CTATGTCGAT CAAACTACTC AGTATCTACA CCGTGGGTTC GCTGCTTTGT
GTATTTATTA CCGCTCCACT CATTCGTAAT ACCGTTCGCC CAACAACATT ACTGATGCTG
TACACCTTTA TCTCATTTAT TGCTCTGTTT ACCGTCTGCC TGCATCCCAC ATTTTATGTG
GTGATAATAT TTGCTTTTGT CATTGGTTTT ACCTCTGCTG GAGGTGTTGT GCAAATTGGC
CTGACGTTAA TGGCTGAACG TTTCCCTTAC GCTAAAGGTA AAGCTACAGG GATCTATTAC
AGTGCGGGCA GTATTGCGAC CTTTACTATT CCGTTGATTA CGGCTCATCT GTCCCAAAGA
AGTATTGCCG ATATTATGTG GTTCGATACC GCCATCGCTG CCATCGGTTT TTTACTGGCA
CTGTTTATCG GCTTACGCAG CCGCAAAAAA ACGCGGCATC ACTCGCTAAA GGAAAATGTC
GCTCCGGGTG GGTAA
 
Protein sequence
MKNPYFPTAL GLYFNYLVHG MGVLLMSLNM ASLETLWQTN AAGVSIVISS LGIGRLSVLL 
FAGLLSDRFG RRPFIMLGMC CYMAFFFGIL QTNNIIIAYV FGFLAGMANS FLDAGTYPSL
MEAFPRSPGT ANILIKAFVS SGQFLLPLII SLLVWAELWF GWSFMIAAGI MFINALFLYR
CTFPPHPGRR LPVIKKTTSS TEHRCSIIDL ASYTLYGYIS MATFYLVSQW LAQYGQFVAG
MSYTMSIKLL SIYTVGSLLC VFITAPLIRN TVRPTTLLML YTFISFIALF TVCLHPTFYV
VIIFAFVIGF TSAGGVVQIG LTLMAERFPY AKGKATGIYY SAGSIATFTI PLITAHLSQR
SIADIMWFDT AIAAIGFLLA LFIGLRSRKK TRHHSLKENV APGG