Gene EcDH1_0752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0752 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp794085 
End bp795479 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content53% 
IMG OID 
Productsugar transporter 
Protein accessionACX38436 
Protein GI260448014 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.718529 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGACG CTAAAAAACA GGGGCGGTCA AACAAGGCAA TGACGTTTTT CGTCTGCTTC 
CTTGCCGCTC TGGCGGGATT ACTCTTTGGC CTGGATATCG GTGTAATTGC TGGCGCACTG
CCGTTTATTG CAGATGAATT CCAGATTACT TCGCACACGC AAGAATGGGT CGTAAGCTCC
ATGATGTTCG GTGCGGCAGT CGGTGCGGTG GGCAGCGGCT GGCTCTCCTT TAAACTCGGG
CGCAAAAAGA GCCTGATGAT CGGCGCAATT TTGTTTGTTG CCGGTTCGCT GTTCTCTGCG
GCTGCGCCAA ACGTTGAAGT ACTGATTCTT TCCCGCGTTC TACTGGGGCT GGCGGTGGGT
GTGGCCTCTT ATACCGCACC GCTGTACCTC TCTGAAATTG CGCCGGAAAA AATTCGTGGC
AGTATGATCT CGATGTATCA GTTGATGATC ACTATCGGGA TCCTCGGTGC TTATCTTTCT
GATACCGCCT TCAGCTACAC CGGTGCATGG CGCTGGATGC TGGGTGTGAT TATCATCCCG
GCAATTTTGC TGCTGATTGG TGTCTTCTTC CTGCCAGACA GCCCACGTTG GTTTGCCGCC
AAACGCCGTT TTGTTGATGC CGAACGCGTG CTGCTACGCC TGCGTGACAC CAGCGCGGAA
GCGAAACGCG AACTGGATGA AATCCGTGAA AGTTTGCAGG TTAAACAGAG TGGCTGGGCG
CTGTTTAAAG AGAACAGCAA CTTCCGCCGC GCGGTGTTCC TTGGCGTACT GTTGCAGGTA
ATGCAGCAAT TCACCGGGAT GAACGTCATC ATGTATTACG CGCCGAAAAT CTTCGAACTG
GCGGGTTATA CCAACACTAC CGAGCAAATG TGGGGGACCG TGATTGTCGG CCTGACCAAC
GTACTTGCCA CCTTTATCGC AATCGGCCTT GTTGACCGCT GGGGACGTAA ACCAACGCTA
ACGCTGGGCT TCCTGGCGAT GGCTGCTGGC ATGGGCGTAC TCGGTACAAT GATGCATATC
GGTATTCACT CTCCGTCGGC GCAGTATTTC GCCATCGCCA TGCTGCTGAT GTTTATTGTC
GGTTTTGCCA TGAGTGCCGG TCCGCTGATT TGGGTACTGT GCTCCGAAAT TCAGCCGCTG
AAAGGCCGCG ATTTTGGCAT CACCTGCTCC ACTGCCACCA ACTGGATTGC CAACATGATC
GTTGGCGCAA CGTTCCTGAC CATGCTCAAC ACGCTGGGTA ACGCCAACAC CTTCTGGGTG
TATGCGGCTC TGAACGTACT GTTTATCCTG CTGACATTGT GGCTGGTACC GGAAACCAAA
CACGTTTCGC TGGAACATAT TGAACGTAAT CTGATGAAAG GTCGTAAACT GCGCGAAATA
GGCGCTCACG ATTAA
 
Protein sequence
MPDAKKQGRS NKAMTFFVCF LAALAGLLFG LDIGVIAGAL PFIADEFQIT SHTQEWVVSS 
MMFGAAVGAV GSGWLSFKLG RKKSLMIGAI LFVAGSLFSA AAPNVEVLIL SRVLLGLAVG
VASYTAPLYL SEIAPEKIRG SMISMYQLMI TIGILGAYLS DTAFSYTGAW RWMLGVIIIP
AILLLIGVFF LPDSPRWFAA KRRFVDAERV LLRLRDTSAE AKRELDEIRE SLQVKQSGWA
LFKENSNFRR AVFLGVLLQV MQQFTGMNVI MYYAPKIFEL AGYTNTTEQM WGTVIVGLTN
VLATFIAIGL VDRWGRKPTL TLGFLAMAAG MGVLGTMMHI GIHSPSAQYF AIAMLLMFIV
GFAMSAGPLI WVLCSEIQPL KGRDFGITCS TATNWIANMI VGATFLTMLN TLGNANTFWV
YAALNVLFIL LTLWLVPETK HVSLEHIERN LMKGRKLREI GAHD