Gene EcSMS35_2319 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2319 
SymbolsetB 
ID6143288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2349521 
End bp2350702 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID641617192 
Productsugar efflux transporter B 
Protein accessionYP_001744365 
Protein GI170680537 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000297031 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.00120871 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCATAACT CCCCCGCAGT CTCCAGCGCG AAATCGTTTG ATCTGACCTC GACGGCGTTT 
TTAATCGTTG CCTTTCTCAC CGGTATTGCG GGCGCTTTGC AAACCCCGAC ACTCAGTATT
TTCCTTACCG ATGAAGTACA TGCCCGTCCG GCGATGGTGG GATTCTTCTT TACCGGCAGC
GCTGTCATTG GGATTCTGGT CAGTCAGTTT CTCGCCGGGC GCTCTGATAA GCGCGGCGAT
CGCAAATCGC TGATTGTCTT TTGCTGCCTG TTAGGCGTGC TGGCCTGCAC CCTTTTTGCC
TGGAATCGCA ACTACTTTGT TTTGTTATTC GTTGGCGTCT TTCTTAGCAG CTTTGGCTCG
ACCGCTAACC CGCAAATGTT TGCCCTTGCA CGTGAACATG CCGACAAAAC CGGACGTGAG
GCGGTGATGT TCAGCTCTTT TTTACGCGCT CAGGTTTCAC TGGCATGGGT CATTGGCCCG
CCGCTGGCTT ATGCCTTAGC GATGGGTTTC AGCTTTACGG TAATGTATCT GAGCGCAGCG
GTAGCGTTTA TTGTTTGCGG CGTGATGGTG TGGCTGTTTT TACCGTCGAT GCAAAAAGAG
CTTCCGCTGG CGACCGGCAC GGTTGAAGCG CCGCGCCGTA ACCGTCGCGA TACGCTGCTG
CTGTTTGTCA TTTGTACATT GATGTGGGGC TCGAACAGCC TGTACATCAT CAACATGCCG
CTATTTATTA TCAACGAACT GCATCTTCCC GAGAAACTGG CAGGCGTGAT GATGGGGACC
GCCGCCGGGC TGGAAATCCC GACCATGTTG ATTGCCGGAT ATTTTGCCAA ACGTCTGGGT
AAGCGTTTCT TAATGCGCGT TGCTGCCGTG GGTGGCGTCT GTTTTTACGC AGGAATGCTG
ATGGCGCATT CACCTGCCAT TCTGTTGGGC TTGCAGCTGC TAAATGCTAT TTTTATTGGC
ATTCTGGGCG GCATCGGGAT GCTTTATTTT CAGGATTTGA TGCCCGGTCA GGCAGGTTCA
GCCACCACGC TCTATACCAA CACGTCGCGC GTGGGCTGGA TCATCGCAGG ATCTGTGGCG
GGCATCGTCG CCGAGATCTG GAATTATCAC GCTGTGTTCT GGTTTGCGAT GGTGATGATT
ATCGCCACTC TGTTTTGCTT ACTGCGGATT AAAGATGTTT AA
 
Protein sequence
MHNSPAVSSA KSFDLTSTAF LIVAFLTGIA GALQTPTLSI FLTDEVHARP AMVGFFFTGS 
AVIGILVSQF LAGRSDKRGD RKSLIVFCCL LGVLACTLFA WNRNYFVLLF VGVFLSSFGS
TANPQMFALA REHADKTGRE AVMFSSFLRA QVSLAWVIGP PLAYALAMGF SFTVMYLSAA
VAFIVCGVMV WLFLPSMQKE LPLATGTVEA PRRNRRDTLL LFVICTLMWG SNSLYIINMP
LFIINELHLP EKLAGVMMGT AAGLEIPTML IAGYFAKRLG KRFLMRVAAV GGVCFYAGML
MAHSPAILLG LQLLNAIFIG ILGGIGMLYF QDLMPGQAGS ATTLYTNTSR VGWIIAGSVA
GIVAEIWNYH AVFWFAMVMI IATLFCLLRI KDV