Gene EcSMS35_4014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4014 
SymbolsetC 
ID6142948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4093311 
End bp4094495 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content46% 
IMG OID641618839 
Productsugar efflux transporter C 
Protein accessionYP_001745977 
Protein GI170683995 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAAAAA CGGCTACTAC TCCATCAAAA ATACTTGATC TCACTGCCGC GGCATTTTTA 
CTTGTCGCCT TTCTGACGGG TATTGCGGGC GCTCTCCAGA CTCCCACCCT AAGTATATTC
CTCGCAGATG AACTGAAAGC CCGTCCTATA ATGGTAGGTT TTTTCTTCAC CGGTAGCGCT
ATTATGGGGA TTCTGGTCAG TCAATTTCTG GCAAGGCACT CCGATAAACA AGGCGACCGT
AAATTACTGA TCCTGCTATG TTGCTTATTT GGAGTGTTGG CCTGCACGCT TTTTGCGTGG
AATCGCAACT ACTTCATTCT CCTCTCTACG GGCGTACTTC TGAGTAGTTT TGCTTCCACT
GCAAACCCGC AAATGTTCGC CCTCGCCCGT GAACACGCCG ACAGAACAGG CCGTGAGACG
GTCATGTTCA GTACATTTTT ACGTGCTCAG ATCTCGCTTG CCTGGGTTAT CGGGCCACCG
CTCGCTTATG AACTGGCAAT GGGATTTAGT TTTAAAGTGA TGTATCTCAC CGCTGCCATC
GCATTTGTTG TTTGCGGACT GATAGTCTGG TTGTTTTTGC CATCAATACA AAGAAATATT
CCTGTCGTTA CCCAACCCGT AGAAATTTTA CCCTCCATCC ATAGAAAGCG GGATACACGG
CTACTTTTTG TAGTCTGTTC AATGATGTGG GCGGCGAATA ATCTCTACAT GATAAATATG
CCGCTATTTA TTATTGATGA ACTGCATCTA ACCGATAAAC TGGCTGGAGA AATGATTGGT
ATCGCTGCCG GTCTGGAAAT TCCGATGATG TTAATCGCAG GCTATTACAT GAAACGTATT
GGCAAGCGGC TATTAATGCT CATTGCTATC GTGAGTGGTA TGTGTTTTTA CGCCAGCGTA
CTCATGGCGA CGACTCCGGC AATTGAGCTG GAATTGCAAA TTCTAAATGC CATTTTCCTT
GGTATTCTCT GTGGTATCGG CATGCTTTAT TTTCAGGATC TGATGCCTGA AAAAATAGGC
TCTGCGACAA CGTTATATGC AAATACCTCA CGCGTCGGCT GGATTATCGC CGGCTCTGTT
GACGGAATTA TGGTTGAAAT CTGGAGCTAC CACGCGTTGT TCTGGCTGGC GATAGGGATG
TTGGGTATTG CGATGATTTG CCTGCTGTTT ATTAAAGATA TTTAG
 
Protein sequence
MQKTATTPSK ILDLTAAAFL LVAFLTGIAG ALQTPTLSIF LADELKARPI MVGFFFTGSA 
IMGILVSQFL ARHSDKQGDR KLLILLCCLF GVLACTLFAW NRNYFILLST GVLLSSFAST
ANPQMFALAR EHADRTGRET VMFSTFLRAQ ISLAWVIGPP LAYELAMGFS FKVMYLTAAI
AFVVCGLIVW LFLPSIQRNI PVVTQPVEIL PSIHRKRDTR LLFVVCSMMW AANNLYMINM
PLFIIDELHL TDKLAGEMIG IAAGLEIPMM LIAGYYMKRI GKRLLMLIAI VSGMCFYASV
LMATTPAIEL ELQILNAIFL GILCGIGMLY FQDLMPEKIG SATTLYANTS RVGWIIAGSV
DGIMVEIWSY HALFWLAIGM LGIAMICLLF IKDI