Gene EcDH1_0024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcDH1_0024 
Symbol 
ID
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli DH1 
KingdomBacteria 
Replicon accessionCP001637 
Strand
Start bp23646 
End bp25361 
Gene Length1716 bp 
Protein Length571 aa 
Translation table11 
GC content53% 
IMG OID 
ProductSSS sodium solute transporter superfamily 
Protein accessionACX37722 
Protein GI260447300 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones65 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTCGT TACAAATCTT GAGTTTTGTC GGTTTTACGC TGCTGGTGGC GGTGATCACC 
TGGTGGAAGG TCCGCAAAAC AGATACCGGA TCGCAACAAG GCTATTTTCT TGCCGGACGT
TCACTAAAAG CGCCGGTTAT TGCCGCTTCG TTAATGCTAA CCAACCTTTC CACGGAACAA
CTGGTCGGCC TTTCCGGGCA GGCCTACAAA AGCGGCATGT CGGTGATGGG CTGGGAAGTG
ACTTCAGCGG TGACGCTGAT CTTCCTCGCG CTAATCTTTT TACCGCGCTA TCTGAAGCGC
GGCATTGCCA CCATCCCCGA TTTTCTGGAG GAACGTTATG ATAAAACGAC GCGTATTATC
ATCGACTTCT GCTTCCTAAT TGCCACCGGC GTCTGCTTTC TGCCGATTGT TCTCTACTCC
GGCGCGTTGG CGCTCAACAG CCTGTTTCAC GTCGGGGAAT CGCTACAGAT TTCTCACGGT
GCGGCTATCT GGCTACTGGT AATTTTGCTT GGTCTGGCGG GAATTTTGTA TGCGGTGATC
GGCGGACTGC GCGCAATGGC AGTGGCGGAC TCCATCAACG GTATTGGGCT GGTTATCGGC
GGGTTGATGG TGCCGGTATT TGGCCTAATC GCGATGGGCA AGGGCAGCTT TATGCAGGGC
ATTGAGCAAC TCACCACCGT TCACGCCGAG AAATTAAACT CAATCGGTGG CCCGACCGAT
CCCTTGCCGA TTGGCGCGGC ATTTACCGGT TTGATTCTGG TGAACACCTT TTACTGGTGT
ACAAATCAGG GCATCGTGCA ACGCACGCTG GCGTCAAAAA GCCTGGCGGA AGGGCAAAAG
GGGGCGCTGT TAACGGCGGT GCTGAAAATG CTCGACCCGC TGGTACTGGT GCTGCCAGGG
TTGATTGCGT TTCATCTGTA TCAGGATTTA CCGAAAGCCG ACATGGCCTA CCCGACGCTG
GTCAATAACG TTCTGCCAGT GCCAATGGTG GGTTTCTTCG GCGCGGTGTT ATTTGGTGCG
GTGATCAGTA CCTTCAACGG CTTTCTGAAT AGCGCCAGTA CGTTATTCAG TATGGGTATT
TACCGTCGCA TCATTAACCA GAATGCCGAG CCGCAGCAGC TGGTCACCGT CGGGCGCAAA
TTTGGTTTCT TTATCGCTAT CGTTTCGGTG CTGGTCGCGC CGTGGATCGC CAACGCGCCG
CAGGGGCTGT ATAGCTGGAT GAAACAGCTC AACGGCATTT ACAACGTGCC GCTGGTTACC
ATCATCATTA TGGGCTTTTT CTTCCCGCGC ATCCCGGCGC TGGCGGCAAA AGTAGCGATG
GGGATTGGCA TAATCAGCTA CATCACCATC AACTATCTGG TGAAGTTCGA CTTCCATTTC
CTCTATGTGC TGGCCTGTAC GTTCTGCATC AACGTGGTCG TGATGCTGGT GATCGGTTTT
ATCAAACCGC GCGCCACGCC GTTCACCTTC AAAGATGCGT TTGCGGTGGA CATGAAACCG
TGGAAAAACG TCAAGATCGC GTCAATTGGC ATCCTGTTCG CGATGATTGG CGTCTATGCC
GGGCTGGCTG AATTCGGCGG CTACGGTACG CGCTGGTTAG CGATGATCAG TTATTTCATT
GCCGCCGTAG TGATTGTCTA CCTGATTTTT GACAGCTGGC GGCATCGTCA CGACCCAGCC
GTAACCTTTA CTCCCGACGG GAAGGATAGC CTATGA
 
Protein sequence
MNSLQILSFV GFTLLVAVIT WWKVRKTDTG SQQGYFLAGR SLKAPVIAAS LMLTNLSTEQ 
LVGLSGQAYK SGMSVMGWEV TSAVTLIFLA LIFLPRYLKR GIATIPDFLE ERYDKTTRII
IDFCFLIATG VCFLPIVLYS GALALNSLFH VGESLQISHG AAIWLLVILL GLAGILYAVI
GGLRAMAVAD SINGIGLVIG GLMVPVFGLI AMGKGSFMQG IEQLTTVHAE KLNSIGGPTD
PLPIGAAFTG LILVNTFYWC TNQGIVQRTL ASKSLAEGQK GALLTAVLKM LDPLVLVLPG
LIAFHLYQDL PKADMAYPTL VNNVLPVPMV GFFGAVLFGA VISTFNGFLN SASTLFSMGI
YRRIINQNAE PQQLVTVGRK FGFFIAIVSV LVAPWIANAP QGLYSWMKQL NGIYNVPLVT
IIIMGFFFPR IPALAAKVAM GIGIISYITI NYLVKFDFHF LYVLACTFCI NVVVMLVIGF
IKPRATPFTF KDAFAVDMKP WKNVKIASIG ILFAMIGVYA GLAEFGGYGT RWLAMISYFI
AAVVIVYLIF DSWRHRHDPA VTFTPDGKDS L