Gene EcDH1_4257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_4257 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp4621679 
End bp4622854 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content53% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX41855 
Protein GI260451433 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones73 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGCT TTTTGATTTG TAGTTTTGCC CTGGTTTTAC TTTATCCCGC CGGGATTGAT 
ATGTACCTCG TTGGTTTACC GCGCATCGCC GCCGATCTCA ATGCCAGCGA AGCGCAGTTG
CATATTGCGT TCTCCGTATA TCTGGCGGGG ATGGCAGCTG CGATGTTATT TGCCGGTAAA
GTGGCCGATC GTTCAGGGAG AAAGCCGGTC GCCATACCCG GCGCGGCGCT ATTTATTATT
GCCTCGGTGT TCTGTTCACT GGCTGAAACC AGCACGTTAT TTCTTGCAGG CCGATTTCTA
CAGGGGTTGG GCGCAGGCTG TTGTTACGTA GTGGCGTTCG CTATTTTGCG CGACACGCTG
GATGATCGAC GTCGGGCTAA AGTGCTGTCA TTACTCAACG GTATTACCTG CATCATTCCG
GTGTTAGCGC CAGTGCTCGG ACATCTGATT ATGCTTAAAT TCCCGTGGCA GAGTCTGTTC
TGGGCGATGG CAATGATGGG CATCGCGGTA CTGATGTTGT CTTTGTTTAT TTTAAAAGAA
ACGCGCCCAG CGGCCCCCGC AGCTTCGGAT AAACCACGAG AAAATAGCGA GTCGCTGCTT
AACCGTTTTT TCCTCAGCCG TGTTGTTATC ACCACCCTCA GCGTTTCGGT GATCCTCACT
TTCGTCAACA CGTCACCGGT ATTGCTGATG GAAATCATGG GGTTTGAGCG CGGTGAATAC
GCCACCATTA TGGCGCTGAC CGCTGGCGTC AGCATGACCG TTTCATTCTC CACGCCATTT
GCGCTGGGAA TTTTTAAGCC ACGTACGTTG ATGATCACCT CGCAGGTGTT ATTCCTGGCG
GCGGGGATCA CTCTTGCCGT TTCACCTTCC CATGCGGTTT CTCTGTTTGG TATCACGCTG
ATTTGCGCCG GTTTCTCGGT AGGTTTTGGT GTGGCGATGA GTCAGGCGTT AGGGCCGTTT
TCATTACGCG CGGGCGTAGC CAGCTCGACC TTAGGTATTG CGCAGGTTTG CGGTTCGTCA
CTGTGGATTT GGCTGGCAGC GGTGGTTGGT ATCGGCGCAT GGAATATGCT GATCGGGATT
CTGATTGCCT GTAGCATAGT GAGCCTGTTG CTGATTATGT TCGTCGCGCC TGGACGCCCC
GTTGCCGCTC ATGAAGAAAT CCATCACCAC GCTTGA
 
Protein sequence
MSRFLICSFA LVLLYPAGID MYLVGLPRIA ADLNASEAQL HIAFSVYLAG MAAAMLFAGK 
VADRSGRKPV AIPGAALFII ASVFCSLAET STLFLAGRFL QGLGAGCCYV VAFAILRDTL
DDRRRAKVLS LLNGITCIIP VLAPVLGHLI MLKFPWQSLF WAMAMMGIAV LMLSLFILKE
TRPAAPAASD KPRENSESLL NRFFLSRVVI TTLSVSVILT FVNTSPVLLM EIMGFERGEY
ATIMALTAGV SMTVSFSTPF ALGIFKPRTL MITSQVLFLA AGITLAVSPS HAVSLFGITL
ICAGFSVGFG VAMSQALGPF SLRAGVASST LGIAQVCGSS LWIWLAAVVG IGAWNMLIGI
LIACSIVSLL LIMFVAPGRP VAAHEEIHHH A