Gene EcSMS35_4746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4746 
SymbolidnT 
ID6144098 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4846113 
End bp4847432 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content51% 
IMG OID641619561 
ProductGnt-II system L-idonate transporter 
Protein accessionYP_001746669 
Protein GI170684304 
COG category[E] Amino acid transport and metabolism
[G] Carbohydrate transport and metabolism 
COG ID[COG2610] H+/gluconate symporter and related permeases 
TIGRFAM ID[TIGR00791] gluconate transporter 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.618535 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones49 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTAA TCATTATTGC GGCAGGCGTC GCGCTGCTTC TTATCCTGAT GATCGGCTTT 
AAAGTTAACG GCTTTATTGC CCTCGTTCTG GTCGCTGCCG TCGTCGGATT TGCCGAAGGG
ATGGATGCAC AGGCCGTCCT GCACTCTATA CAAAATGGTA TCGGCAGCAC GCTCGGCGGG
CTGGCAATGA TCCTCGGTTT CGGGGCCATG TTAGGCAAGC TGATTTCTGA TACGGGCGCG
GCACAACGTA TCGCCACTAC GCTGATTGCT ACTTTTGGTA AAAAACGCGT GCAATGGGCG
CTAGTGATCA CCGGTCTGGT TGTCGGCCTC GCCATGTTTT TTGAAGTGGG TTTTGTCCTG
CTGCTGCCGT TGGTATTTAC CATCGTGGCA TCATCTGGAT TACCCCTGTT GTATGTTGGC
GTACCGATGG TAGCAGCGCT CTCTGTAACC CACTGTTTTC TGCCGCCACA TCCAGGGCCT
ACTGCCATCG CGACTATCTT TGAGGCTAAT CTCGGAACGA CTCTACTGTA TGGATTTATC
ATTACCATTC CGACAGTTAT TGTCGCAGGA CCGCTGTTTT CTAAACTGCT AACTCGCTTT
GAGAAAGCAC CACCGGAAGG CTTATTTAAT CCTCATCTGT TTAGCGAAGA GGAGATGCCC
TCCTTCTGGA ACAGTATTTT CGCTGCAGTG ATCCCGGTCA TCCTGATGGC TATCGCCGCC
GTTTGTGAAA TTACGTTACC GAAAACTAAC ACCGTGCGCC TCTTCTTTGA ATTTGTCGGT
AACCCTGCCG TTGCGCTGTT TATTGCCATT GTTATTGCGA TTTTCACACT GGGCCGACGT
AATGGACGCA CCATCGAGCA AATCATGGAT ATCATTGGGG ATTCTATAGG CGCTATCGCG
ATGATTGTGT TTATTATCGC TGGCGGCGGC GCGTTTAAGC AGGTATTAGT AGATAGCGGT
GTCGGGCAGT ATATTTCACA CTTAATGACC GGAACTACAC TTTCGCCGTT ATTGATGTGC
TGGACTGTTG CGGCGCTGTT GCGTATCGCT CTGGGCTCTG CCACCGTCGC GGCCATTACC
ACCGCGGGTG TGGTGTTGCC GATTATCAAC GTTACCCATG CCGATCCCGC TTTAATGGTA
CTGGCAACTG GTGCGGGCAG CGTGATCGCG TCACACGTAA ACGACCCTGG CTTCTGGCTA
TTTAAAGGGT ATTTTAATCT GACGGTTGGT GAAACGTTGC GTACCTGGAC GGTGATGGAA
ACCCTTATTT CTATTATGGG TTTGCTGGGC GTGTTAGCCA TTAACGCCGT ATTGCACTGA
 
Protein sequence
MPLIIIAAGV ALLLILMIGF KVNGFIALVL VAAVVGFAEG MDAQAVLHSI QNGIGSTLGG 
LAMILGFGAM LGKLISDTGA AQRIATTLIA TFGKKRVQWA LVITGLVVGL AMFFEVGFVL
LLPLVFTIVA SSGLPLLYVG VPMVAALSVT HCFLPPHPGP TAIATIFEAN LGTTLLYGFI
ITIPTVIVAG PLFSKLLTRF EKAPPEGLFN PHLFSEEEMP SFWNSIFAAV IPVILMAIAA
VCEITLPKTN TVRLFFEFVG NPAVALFIAI VIAIFTLGRR NGRTIEQIMD IIGDSIGAIA
MIVFIIAGGG AFKQVLVDSG VGQYISHLMT GTTLSPLLMC WTVAALLRIA LGSATVAAIT
TAGVVLPIIN VTHADPALMV LATGAGSVIA SHVNDPGFWL FKGYFNLTVG ETLRTWTVME
TLISIMGLLG VLAINAVLH