Gene EcDH1_4109 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4109 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4447276 
End bp4448661 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content52% 
IMG OID 
Productsugar (Glycoside-Pentoside-Hexuronide) transporter 
Protein accessionACX41709 
Protein GI260451287 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones66 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTCACA TCACAACGGA AGATCCAGCA ACTTTACGCC TGCCCTTTAA AGAGAAACTC 
TCTTACGGTA TTGGCGACCT GGCCTCTAAC ATCCTGCTGG ATATCGGTAC GCTTTATCTT
TTGAAGTTTT ATACCGACGT TCTGGGGCTG CCAGGCACCT ATGGCGGCAT TATCTTTTTG
ATTTCAAAAT TCTTTACTGC GTTTACCGAT ATGGGTACCG GCATTATGTT GGATTCCCGA
CGCAAGATCG GTCCAAAAGG TAAGTTCCGT CCTTTTATTC TGTATGCGTC ATTCCCGGTC
ACCTTACTGG CGATCGCCAA CTTTGTCGGC ACACCGTTTG ATGTCACCGG TAAAACGGTG
ATGGCCACTA TTCTGTTTAT GCTTTACGGA CTGTTTTTCA GCATGATGAA CTGCTCCTAC
GGCGCAATGG TTCCCGCTAT CACCAAAAAC CCCAACGAAC GCGCCTCACT GGCGGCATGG
CGTCAGGGCG GCGCTACGCT GGGCCTGCTG CTGTGCACGG TGGGATTCGT GCCAGTTATG
AATCTTATCG AAGGTAATCA GCAACTTGGC TATATCTTCG CCGCCACGCT GTTTTCACTG
TTTGGCCTGC TGTTTATGTG GATCTGCTAC TCGGGCGTGA AAGAGCGTTA TGTCGAAACC
CAGCCTGCTA ATCCGGCGCA AAAGCCGGGC CTGCTGCAAT CTTTCCGCGC AATTGCGGGT
AACCGCCCGC TGTTCATTCT GTGCATTGCC AACCTCTGCA CTTTAGGGGC GTTTAACGTC
AAGCTCGCCA TCCAGGTCTA TTACACCCAG TACGTGCTTA ACGATCCCAT CCTGTTGTCG
TATATGGGAT TTTTCAGCAT GGGCTGTATT TTCATCGGCG TATTCCTGAT GCCTGCCTCA
GTCAGACGTT TTGGCAAGAA GAAAGTTTAT ATCGGCGGCC TGCTGATTTG GGTGCTGGGC
GATCTGCTCA ACTATTTCTT CGGCGGCGGT TCGGTCAGCT TCGTGGCGTT CTCCTGCCTG
GCGTTCTTTG GCTCAGCGTT TGTTAACAGC CTGAACTGGG CGCTGGTTTC CGACACCGTC
GAGTACGGCG AGTGGCGCAC CGGCGTGCGT TCGGAAGGAA CGGTCTACAC CGGTTTTACC
TTCTTTCGCA AAGTGTCTCA GGCGCTGGCT GGTTTCTTCC CCGGCTGGAT GCTGACGCAA
ATTGGCTATG TGCCGAACGT CGCACAGGCT GACCACACCA TTGAAGGGTT ACGCCAGTTG
ATCTTCATCT ACCCAAGCGC ACTGGCGGTA GTCACCATTG TGGCGATGGG TTGCTTCTAC
AGCCTGAACG AGAAGATGTA TGTCCGCATT GTGGAAGAGA TAGAAGCCCG TAAACGCACG
GCGTAA
 
Protein sequence
MSHITTEDPA TLRLPFKEKL SYGIGDLASN ILLDIGTLYL LKFYTDVLGL PGTYGGIIFL 
ISKFFTAFTD MGTGIMLDSR RKIGPKGKFR PFILYASFPV TLLAIANFVG TPFDVTGKTV
MATILFMLYG LFFSMMNCSY GAMVPAITKN PNERASLAAW RQGGATLGLL LCTVGFVPVM
NLIEGNQQLG YIFAATLFSL FGLLFMWICY SGVKERYVET QPANPAQKPG LLQSFRAIAG
NRPLFILCIA NLCTLGAFNV KLAIQVYYTQ YVLNDPILLS YMGFFSMGCI FIGVFLMPAS
VRRFGKKKVY IGGLLIWVLG DLLNYFFGGG SVSFVAFSCL AFFGSAFVNS LNWALVSDTV
EYGEWRTGVR SEGTVYTGFT FFRKVSQALA GFFPGWMLTQ IGYVPNVAQA DHTIEGLRQL
IFIYPSALAV VTIVAMGCFY SLNEKMYVRI VEEIEARKRT A