Gene EcolC_1304 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1304 
Symbol 
ID6068551 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1431899 
End bp1433236 
Gene Length1338 bp 
Protein Length445 aa 
Translation table11 
GC content49% 
IMG OID641600725 
Productpermease DsdX 
Protein accessionYP_001724297 
Protein GI170019343 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.478606 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACTCTC AAATCTGGGT TGTGAGCACG CTGCTTATCA GCATCGTGTT AATTGTACTG 
ACCATCGTGA AGTTCAAATT CCACCCGTTT CTGGCGCTGT TGCTGGCCAG CTTCTTCGTG
GGAACGATGA TGGGCATGGG GCCACTGGAT ATGGTAAATG CTATTGAAAG TGGAATTGGC
GGAACGCTGG GGTTCCTCGC AGCGGTTATC GGCCTTGGCA CGATACTGGG AAAAATGATG
GAAGTATCCG GGGCCGCAGA AAGAATTGGT CTGACACTTC AACGCTGCCG CTGGCTTTCA
GTTGATGTCA TTATGGTGCT GGTTGGCCTG ATTTGTGGCA TCACGCTGTT TGTTGAAGTG
GGCGTCGTGC TATTGATTCC TCTGGCTTTT TCAATTGCCA AAAAAACCAA TACCTCATTA
TTAAAGCTTG CCATTCCGCT ATGTACCGCA TTGATGGCAG TGCACTGCGT GGTTCCTCCA
CATCCGGCTG CTTTATATGT TGCCAATAAG CTGGGCGCAG ATATCGGTTC GGTGATCGTC
TACGGTTTGC TGGTTGGGCT GATGGCATCA CTGATCGGTG GCCCACTTTT CCTTAAATTT
CTGGGTCAAC GACTGCCCTT TAAACCTGTA CCCACAGAGT TTGCAGATCT CAAAGTTCGC
GATGAAAAAA CACTACCGTC ATTAGGCGCA ACGTTATTCA CCATACTGCT ACCCATTGCG
CTGATGTTGG TTAAAACGAT TGCCGAATTG AATATGGCGC GTGAGAGTGG TTTGTATATC
TTGGTTGAGT TTATTGGCAA CCCTATCACT GCCATGTTTA TCGCCGTGTT TGTCGCCTAT
TATGTGTTGG GTATACGCCA GCATATGAGC ATGGGGACGA TGCTCACACA TACGGAAAAT
GGCTTCGGTT CTATTGCTAA TATTTTGCTG ATTATCGGGG CCGGAGGCGC ATTCAACGCC
ATTTTAAAAA GCAGCAGTCT CGCTGATACG CTGGCAGTTA TTCTCTCCAA TATGCATATG
CACCCGATTC TTCTGGCCTG GTTAGTGGCT CTTATTCTGC ATGCGGCAGT GGGCTCCGCT
ACCGTGGCAA TGATGGGGGC AACGGCAATT GTTGCACCCA TGCTGCCGCT GTATCCCGAC
ATCAGCCCGG AAATTATTGC GATTGCTATC GGTTCAGGTG CAATTGGCTG CACTATCGTT
ACGGACTCGC TTTTCTGGCT AGTGAAGCAA TATTGCGGCG CTACGCTCAA TGAAACATTT
AAATACTATA CGACAGCGAC ATTTATCGCT TCAGTCGTCG CTCTGGCGGG CACATTCCTG
CTGTCATTTA TCATCTAA
 
Protein sequence
MHSQIWVVST LLISIVLIVL TIVKFKFHPF LALLLASFFV GTMMGMGPLD MVNAIESGIG 
GTLGFLAAVI GLGTILGKMM EVSGAAERIG LTLQRCRWLS VDVIMVLVGL ICGITLFVEV
GVVLLIPLAF SIAKKTNTSL LKLAIPLCTA LMAVHCVVPP HPAALYVANK LGADIGSVIV
YGLLVGLMAS LIGGPLFLKF LGQRLPFKPV PTEFADLKVR DEKTLPSLGA TLFTILLPIA
LMLVKTIAEL NMARESGLYI LVEFIGNPIT AMFIAVFVAY YVLGIRQHMS MGTMLTHTEN
GFGSIANILL IIGAGGAFNA ILKSSSLADT LAVILSNMHM HPILLAWLVA LILHAAVGSA
TVAMMGATAI VAPMLPLYPD ISPEIIAIAI GSGAIGCTIV TDSLFWLVKQ YCGATLNETF
KYYTTATFIA SVVALAGTFL LSFII