Gene EcHS_A3635 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A3635 
SymbolgntU 
ID5594795 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp3622757 
End bp3624097 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID640922751 
Productlow affinity gluconate transporter 
Protein accessionYP_001460232 
Protein GI157162914 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones49 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGACTACAT TAACGCTTGT TTTAACAGCA GTAGGGTCTG TTTTACTGCT GCTGTTTTTA 
GTCATGAAGG CGCGTATGCA CGCTTTCCTG GCTTTAATGG TGGTGTCCAT GGGGGCTGGC
CTTTTTTCCG GTATGCCGCT CGATAAAATC GCAGCGACGA TGGAAAAAGG GATGGGAGGC
ACCCTCGGCT TCCTGGCGGT GGTTGTCGCC CTGGGAGCTA TGTTTGGCAA GATCTTACAT
GAAACCGGCG CAGTCGATCA GATTGCCGTC AAAATGCTCA AATCCTTCGG TCACAGCCGC
GCGCATTATG CCATCGGCCT TGCGGGGCTG GTCTGTGCGC TACCGCTGTT CTTTGAAGTG
GCGATTGTTC TGCTGATTAG CGTTGCTTTC TCAATGGCGC GCCACACCGG TACGAACCTG
GTGAAGCTGG TAATCCCATT ATTCGCAGGC GTGGCGGCAG CGGCGGCGTT CCTGGTGCCA
GGGCCAGCGC CAATGCTGCT GGCATCGCAG ATGAACGCCG ATTTTGGCTG GATGATCCTG
ATTGGCCTGT GTGCGGCAAT TCCGGGAATG ATTATTGCCG GGCCGCTGTG GGGTAATTTC
ATCAGCCGTT ACGTGGAGTT GCATATTCCT GACGACATCA GCGAACCGCA TCTCGGCGAA
GGCAAAATGC CGTCCTTCGG ATTCAGCCTG TCGCTGATCC TGTTGCCGCT GGTGCTGGTG
GGGCTGAAAA CCATTGCCGC GCGTTTTGTG CCAGAAGGCT CTACCGCTTA CGAATGGTTC
GAGTTTATTG GTCATCCGTT TACCGCGATT CTGGTTGCTT GTCTGGTGGC GATTTATGGC
CTGGCGATGC GTCAGGGCAT GCCAAAAGAC AAAGTGATGG AGATTTGCGG TCACGCGCTG
CAACCGGCGG GGATCATTCT GCTGGTGATT GGTGCGGGCG GCGTGTTCAA ACAGGTGCTG
GTTGACTCTG GCGTAGGTCC GGCACTGGGC GAAGCGTTAA CCGGCATGGG CCTGCCGATT
GCTATCACCT GCTTCGTGCT GGCCGCTGCA GTGCGCATCA TTCAGGGGTC TGCCACCGTA
GCCTGTTTAA CGGCGGTGGG ACTGGTGATG CCAGTCATTG AACAACTGAA CTACTCTGGT
GCGCAAATGG CGGCACTGTC GATTTGTATC GCTGGTGGTT CGATTGTTGT CAGCCACGTT
AACGACGCTG GTTTCTGGTT GTTCGGTAAA TTTACCGGCG CGACCGAAGC CGAAACGCTG
AAAACCTGGA CCATGATGGA AACCATTCTC GGCACTGTCG GTGCCATCGT TGGGATGATT
GCGTTCCAGC TGTTGAGTTA A
 
Protein sequence
MTTLTLVLTA VGSVLLLLFL VMKARMHAFL ALMVVSMGAG LFSGMPLDKI AATMEKGMGG 
TLGFLAVVVA LGAMFGKILH ETGAVDQIAV KMLKSFGHSR AHYAIGLAGL VCALPLFFEV
AIVLLISVAF SMARHTGTNL VKLVIPLFAG VAAAAAFLVP GPAPMLLASQ MNADFGWMIL
IGLCAAIPGM IIAGPLWGNF ISRYVELHIP DDISEPHLGE GKMPSFGFSL SLILLPLVLV
GLKTIAARFV PEGSTAYEWF EFIGHPFTAI LVACLVAIYG LAMRQGMPKD KVMEICGHAL
QPAGIILLVI GAGGVFKQVL VDSGVGPALG EALTGMGLPI AITCFVLAAA VRIIQGSATV
ACLTAVGLVM PVIEQLNYSG AQMAALSICI AGGSIVVSHV NDAGFWLFGK FTGATEAETL
KTWTMMETIL GTVGAIVGMI AFQLLS