Gene EcDH1_3658 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_3658 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp3944510 
End bp3945742 
Gene Length1233 bp 
Protein Length410 aa 
Translation table11 
GC content54% 
IMG OID 
Productmajor facilitator superfamily MFS_1 
Protein accessionACX41269 
Protein GI260450847 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones46 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCACGTT TTTTTACCCG CCATGCCGCC ACGCTGTTTT TCCCGATGGC GTTGATTTTG 
TATGACTTTG CTGCGTATCT GTCGACGGAT CTGATCCAGC CTGGGATCAT TAATGTGGTA
CGTGATTTTA ATGCCGATGT CAGTCTGGCC CCTGCTGCCG TCAGTCTCTA TCTTGCTGGC
GGTATGGCGT TACAGTGGCT GCTGGGGCCG CTTTCCGACA GAATTGGCCG CAGGCCGGTG
CTGATTACCG GGGCGCTAAT TTTTACCCTT GCCTGCGCCG CGACAATGTT CACAACGTCT
ATGACACAGT TTCTTATCGC GCGTGCAATT CAGGGCACCA GTATCTGTTT TATTGCCACC
GTTGGTTATG TCACGGTGCA GGAGGCGTTC GGACAGACAA AAGGGATCAA GTTGATGGCG
ATTATCACCT CCATCGTACT GATTGCGCCG ATTATCGGCC CGCTTTCCGG CGCAGCTCTG
ATGCACTTTA TGCACTGGAA AGTCCTTTTT GCCATCATTG CGGTTATGGG TTTTATCTCA
TTTGTTGGCT TACTGTTGGC GATGCCAGAG ACGGTGAAGC GCGGCGCGGT TCCGTTTAGC
GCCAAAAGCG TCTTGCGCGA TTTTCGTAAT GTCTTTTGCA ATCGGCTGTT CCTCTTTGGC
GCAGCAACCA TCTCTTTAAG CTATATCCCG ATGATGAGCT GGGTGGCTGT CTCGCCGGTG
ATCCTTATCG ATGCAGGCAG CTTAACAACT TCGCAGTTCG CCTGGACACA GGTTCCGGTG
TTCGGCGCGG TGATTGTTGC GAATGCCATC GTGGCGCGTT TTGTTAAAGA TCCGACCGAA
CCGCGGTTTA TCTGGCGTGC CGTACCCATT CAACTGGTCG GCCTCTCGCT GTTGATTGTC
GGCAATCTGC TGTCGCCGCA CGTCTGGCTG TGGTCGGTGC TGGGCACCAG TCTGTATGCT
TTCGGGATTG GTTTGATTTT CCCGACCTTA TTCCGCTTTA CGCTGTTTTC CAATAAGTTA
CCGAAAGGGA CCGTCTCCGC ATCGCTAAAT ATGGTGATCC TGATGGTGAT GTCGGTCTCG
GTCGAAATCG GCCGCTGGCT ATGGTTTAAC GGCGGTCGCT TGCCGTTTCA TCTGTTAGCC
GTTGTGGCGG GCGTTATCGT CGTTTTCACC CTGGCGGGAT TGCTCAATCG CGTGCGCCAG
CATCAGGCAG CCGAGCTAGT GGAGGAGCAG TGA
 
Protein sequence
MPRFFTRHAA TLFFPMALIL YDFAAYLSTD LIQPGIINVV RDFNADVSLA PAAVSLYLAG 
GMALQWLLGP LSDRIGRRPV LITGALIFTL ACAATMFTTS MTQFLIARAI QGTSICFIAT
VGYVTVQEAF GQTKGIKLMA IITSIVLIAP IIGPLSGAAL MHFMHWKVLF AIIAVMGFIS
FVGLLLAMPE TVKRGAVPFS AKSVLRDFRN VFCNRLFLFG AATISLSYIP MMSWVAVSPV
ILIDAGSLTT SQFAWTQVPV FGAVIVANAI VARFVKDPTE PRFIWRAVPI QLVGLSLLIV
GNLLSPHVWL WSVLGTSLYA FGIGLIFPTL FRFTLFSNKL PKGTVSASLN MVILMVMSVS
VEIGRWLWFN GGRLPFHLLA VVAGVIVVFT LAGLLNRVRQ HQAAELVEEQ