Gene EcSMS35_3719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3719 
SymbolgntU 
ID6143476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3789533 
End bp3790873 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content55% 
IMG OID641618545 
Productlow affinity gluconate transporter 
Protein accessionYP_001745685 
Protein GI170680129 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones54 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTACAT TAACGCTTGT TTTAACAGCA GTAGGGTCTG TTTTACTGCT GCTGTTTTTA 
GTCATGAAGG CGCGTATGCA CGCTTTCCTG GCTTTAATGG TGGTGTCTAT GGGGGCTGGC
CTTTTTTCCG GTATGCCGCT CGATAAAATC GCAGCGACGA TGGAAAAAGG GATGGGAGGC
ACCCTCGGCT TCCTGGCGGT GGTTGTCGCC CTGGGAGCCA TGTTTGGCAA GATCTTACAT
GAAACCGGCG CAGTCGATCA GATTGCCGTC AAAATGCTCA AATCCTTCGG TCACAGCCGC
GCGCATTATG CCATCGGCCT TGCGGGGCTG GTCTGTGCGC TGCCGTTGTT CTTTGAAGTG
GCGATTGTTC TGCTGATTAG CGTTGCTTTC TCAATGGCGC GCCACACCGG TACGAACCTG
GTGAAGCTGG TAATCCCATT ATTTGCAGGC GTGGCGGCCG CTGCTGCGTT CCTGGTGCCT
GGACCAGCGC CAATGCTGCT GGCATCGCAG ATGAACGCCG ACTTTGGCTG GATGATCCTG
ATTGGCCTGT GTGCGGCAAT TCCGGGAATG ATTATTGCCG GGCCGCTGTG GGGTAATTTC
ATCAGCCGCT ACGTGGAGCT GCATATTCCT GACGACATCA GCGAACCGCA TCTCGGCGAA
GGCAAAATGC CATCTTTCGG ATTCAGCCTG TCGCTGATCC TGTTGCCGCT GGTGCTGGTA
GGGCTGAAAA CCATTGCCGC GCGTTTTGTG CCAGAAGGCT CTACCGCTTA CGAATGGTTC
GAGTTTATTG GTCATCCGTT TACCGCGATT CTGGTTGCTT GTCTGGTAGC GATTTATGGC
CTGGCAATGC GTCAGGGCAT GCCGAAAGAT AAAGTGATGG AAATTTGCGG TCACGCGCTG
CAACCGGCGG GGATCATTCT GCTGGTGATT GGTGCGGGCG GCGTATTCAA ACAGGTGCTG
GTTGACTCTG GCGTAGGTCC GGCACTGGGC GAAGCGTTAA CCGGCATGGG CCTGCCGATT
GCCATCACCT GCTTCGTGCT GGCAGCTGCA GTGCGCATCA TTCAGGGTTC TGCCACCGTT
GCCTGTTTAA CGGCGGTGGG ACTGGTGATG CCGGTTATTG AACAACTGAA CTACTCCGGT
GCGCAAATGG CGGCGCTGTC GATTTGTATC GCCGGTGGTT CGATTGTTGT CAGCCACGTT
AACGACGCGG GTTTCTGGTT GTTCGGTAAA TTTACCGGCG CGACCGAAGC CGAAACGCTG
AAAACCTGGA CCATGATGGA AACCATACTC GGCACTGTTG GTGCCATCGT TGGGATGATT
GCGTTCCAGC TGTTGAGTTA A
 
Protein sequence
MTTLTLVLTA VGSVLLLLFL VMKARMHAFL ALMVVSMGAG LFSGMPLDKI AATMEKGMGG 
TLGFLAVVVA LGAMFGKILH ETGAVDQIAV KMLKSFGHSR AHYAIGLAGL VCALPLFFEV
AIVLLISVAF SMARHTGTNL VKLVIPLFAG VAAAAAFLVP GPAPMLLASQ MNADFGWMIL
IGLCAAIPGM IIAGPLWGNF ISRYVELHIP DDISEPHLGE GKMPSFGFSL SLILLPLVLV
GLKTIAARFV PEGSTAYEWF EFIGHPFTAI LVACLVAIYG LAMRQGMPKD KVMEICGHAL
QPAGIILLVI GAGGVFKQVL VDSGVGPALG EALTGMGLPI AITCFVLAAA VRIIQGSATV
ACLTAVGLVM PVIEQLNYSG AQMAALSICI AGGSIVVSHV NDAGFWLFGK FTGATEAETL
KTWTMMETIL GTVGAIVGMI AFQLLS