Gene ECH74115_4752 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4752 
SymbolgntU 
ID6968304 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4398055 
End bp4399395 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content56% 
IMG OID643388449 
Productlow affinity gluconate transporter 
Protein accessionYP_002272877 
Protein GI209397154 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.920357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTACAT TAACGCTTGT TTTAACAGCA GTAGGGTCTG TTTTACTGCT GCTGTTTTTA 
GTCATGAAGG CGCGTATGCA CGCTTTCCTG GCTTTAATGG TGGTGTCCAT GGGGGCTGGC
CTTTTTTCCG GTATGCCGCT CGATAAAATC GCAGCGACGA TGGAAAAAGG GATGGGAGGC
ACCCTCGGCT TCCTGGCGGT GGTTGTCGCC CTGGGAGCTA TGTTTGGCAA GATCTTACAT
GAAACCGGCG CAGTCGATCA GATTGCCGTC AAAATGCTTA AATCCTTCGG TCACAGCCGC
GCGCATTATG CCATCGGCCT TGCGGGGCTG GTCTGTGCGC TACCGCTGTT CTTTGAAGTG
GCGATTGTTC TGCTGATTAG CGTTGCTTTC TCAATGGCGC GCCACACCGG TACGAACCTG
GTGAAGCTGG TAATCCCATT ATTCGCAGGC GTGGCGGCCG CGGCGGCGTT CCTGGTGCCA
GGGCCAGCGC CAATGCTGCT GGCATCGCAG ATGAACGCCG ATTTTGGCTG GATGATCCTG
ATTGGCCTGT GTGCGGCAAT TCCGGGAATG ATTATTGCCG GGCCGCTGTG GGGTAACTTC
ATCAGCCGTT ACGTTGAGCT GCATATTCCT GACGACATCA GCGAACCGCA TCTCGGCGAA
GGCAAAATGC CATCCTTCGG ATTCAGCCTG TCGCTGATCC TGCTGCCGCT GGTGCTGGTA
GGGCTGAAAA CCATTGCCGC GCGTTTTGTG CCGGAAGGCT CTACCGCTTA CGAATGGTTC
GAGTTTATTG GTCATCCGTT TACCGCGATT CTGGTTGCTT GTCTGGTAGC GATTTATGGC
CTGGCAATGC GTCAGGGCAT GCCAAAAGAC AAAGTGATGG AGATTTGCGG TCACGCGCTG
CAACCGGCGG GGATCATTCT GCTGGTGATT GGTGCGGGCG GCGTGTTCAA ACAGGTGCTG
GTTGACTCTG GCGTAGGTCC GGCACTGGGC GAAGCGTTAA CCGGCATGGG CCTGCCGATT
GCCATCACCT GCTTCGTGCT GGCCGCTGCA GTGCGCATCA TTCAGGGTTC TGCAACCGTT
GCCTGTTTAA CGGCGGTGGG ACTGGTGATG CCGGTTATTG AACAACTGAA CTACTCCGGT
GCGCAAATGG CGGCGCTGTC GATTTGTATC GCCGGTGGTT CGATTGTTGT CAGCCACGTT
AACGACGCTG GTTTCTGGTT GTTCGGTAAA TTTACCGGCG CGACCGAAGC CGAAACGCTG
AAAACCTGGA CCATGATGGA AACCATCCTC GGCACTGTCG GTGCCATCGT TGGGATGATT
GCGTTCCAGC TGTTGAGTTA A
 
Protein sequence
MTTLTLVLTA VGSVLLLLFL VMKARMHAFL ALMVVSMGAG LFSGMPLDKI AATMEKGMGG 
TLGFLAVVVA LGAMFGKILH ETGAVDQIAV KMLKSFGHSR AHYAIGLAGL VCALPLFFEV
AIVLLISVAF SMARHTGTNL VKLVIPLFAG VAAAAAFLVP GPAPMLLASQ MNADFGWMIL
IGLCAAIPGM IIAGPLWGNF ISRYVELHIP DDISEPHLGE GKMPSFGFSL SLILLPLVLV
GLKTIAARFV PEGSTAYEWF EFIGHPFTAI LVACLVAIYG LAMRQGMPKD KVMEICGHAL
QPAGIILLVI GAGGVFKQVL VDSGVGPALG EALTGMGLPI AITCFVLAAA VRIIQGSATV
ACLTAVGLVM PVIEQLNYSG AQMAALSICI AGGSIVVSHV NDAGFWLFGK FTGATEAETL
KTWTMMETIL GTVGAIVGMI AFQLLS