Gene EcDH1_2800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_2800 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp2996214 
End bp2997446 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content52% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX40433 
Protein GI260450011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.245605 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAAAATA AATTAGCTTC CGGTGCCAGG CTTGGACGTC AGGCGTTACT TTTCCCTCTC 
TGTCTGGTGC TTTACGAATT TTCAACCTAT ATCGGCAACG ATATGATTCA ACCCGGTATG
TTGGCCGTGG TGGAACAATA TCAGGCGGGC ATTGATTGGG TTCCTACTTC GATGACCGCG
TATCTGGCGG GCGGGATGTT TTTACAATGG CTGCTGGGGC CGCTGTCGGA TCGTATTGGT
CGCCGTCCGG TGATGCTGGC GGGAGTGGTG TGGTTTATCG TCACCTGTCT GGCAATATTG
CTGGCGCAAA ACATTGAACA ATTCACCCTG TTGCGCTTCT TGCAGGGCAT AAGCCTCTGT
TTCATTGGCG CTGTGGGATA CGCCGCAATT CAGGAATCCT TCGAAGAGGC GGTTTGTATC
AAGATCACCG CGCTGATGGC GAACGTGGCG CTGATTGCTC CGCTACTTGG TCCGCTGGTG
GGCGCGGCGT GGATCCATGT GCTGCCCTGG GAGGGGATGT TTGTTTTGTT TGCCGCATTG
GCAGCGATCT CCTTTTTCGG TCTGCAACGA GCCATGCCTG AAACCGCCAC GCGTATAGGC
GAGAAACTGT CACTGAAAGA ACTCGGTCGT GACTATAAGC TGGTGCTGAA GAACGGCCGC
TTTGTGGCGG GGGCGCTGGC GCTGGGATTC GTTAGTCTGC CGTTGCTGGC GTGGATCGCC
CAGTCGCCGA TTATCATCAT TACCGGCGAG CAGTTGAGCA GCTATGAATA TGGCTTGCTG
CAAGTGCCTA TTTTCGGGGC GTTAATTGCG GGTAACTTGC TGTTAGCGCG TCTGACCTCG
CGCCGCACCG TACGTTCGCT GATTATTATG GGCGGCTGGC CGATTATGAT TGGTCTATTG
GTCGCTGCTG CGGCAACGGT TATCTCATCG CACGCGTATT TATGGATGAC TGCCGGGTTA
AGTATTTATG CTTTCGGTAT TGGTCTGGCG AATGCGGGAC TGGTGCGATT AACCCTGTTT
GCCAGCGATA TGAGTAAAGG TACGGTTTCT GCCGCGATGG GAATGCTGCA AATGCTGATC
TTTACCGTTG GTATTGAAAT CAGCAAACAT GCCTGGCTGA ACGGGGGCAA CGGACTGTTT
AATCTCTTCA ACCTTGTCAA CGGAATTTTG TGGCTGTCGC TGATGGTTAT CTTTTTAAAA
GATAAACAGA TGGGAAATTC TCACGAAGGG TAA
 
Protein sequence
MQNKLASGAR LGRQALLFPL CLVLYEFSTY IGNDMIQPGM LAVVEQYQAG IDWVPTSMTA 
YLAGGMFLQW LLGPLSDRIG RRPVMLAGVV WFIVTCLAIL LAQNIEQFTL LRFLQGISLC
FIGAVGYAAI QESFEEAVCI KITALMANVA LIAPLLGPLV GAAWIHVLPW EGMFVLFAAL
AAISFFGLQR AMPETATRIG EKLSLKELGR DYKLVLKNGR FVAGALALGF VSLPLLAWIA
QSPIIIITGE QLSSYEYGLL QVPIFGALIA GNLLLARLTS RRTVRSLIIM GGWPIMIGLL
VAAAATVISS HAYLWMTAGL SIYAFGIGLA NAGLVRLTLF ASDMSKGTVS AAMGMLQMLI
FTVGIEISKH AWLNGGNGLF NLFNLVNGIL WLSLMVIFLK DKQMGNSHEG