Gene ECH74115_3308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3308 
SymbolsetB 
ID6968389 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3040154 
End bp3041335 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID643387119 
Productsugar efflux transporter B 
Protein accessionYP_002271583 
Protein GI209398972 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000181486 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.0175789 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATAACT CCCCCGCAGT CTCCAGCGCG AAATCGTTTG ACCTGACCTC GACGGCGTTT 
TTAATCGTTG CCTTTCTCAC CGGTATTGCG GGCGCTCTGC AAACCCCGAC ACTCAGTATT
TTTCTTACCG ATGAAGTACA TGCCCGTCCG GCGATGGTGG GATTCTTCTT TACCGGCAGC
GCTGTCATTG GGGTTCTGGT GAGTCAGTTT CTCGCCGGGC GCTCTGATAA GCGCGGCGAT
CGCAAATCGC TGATTGTCTT TTGCTGCCTG TTAGGCGTGC TGGCCTGCAC CCTTTTTGCC
TGGAATCGCA ACTACTTTGT TTTGCTATTC GTTGGCGTCT TTCTTAGCAG CTTTGGCTCG
ACCGCTAACC CGCAAATGTT TGCCCTTGCC CGTGAACATG CCGACAAAAC CGGACGTGAG
GCGGTGATGT TCAGCTCTTT TTTACGCGCT CAGGTTTCAC TGGCATGGGT CATTGGCCCA
CCGCTGGCTT ATGCCTTAGC GATGGGTTTC AGCTTTACGG TAATGTATCT GAGCGCAGCG
GTAGCATTTA TTGTTTGCGG CGTGATGGTG TGGCTGTTTT TACCGTCGAT GCAAAAAGAG
CTTCCGCTGG CGACCGGCAC GGTTGAAGCG CCGCGCCGTA ACCGTCGCGA TACGCTGCTG
CTGTTTGTCA TTTGTACATT GATGTGGGGC TCGAACAGCC TGTACATCAT CAACATGCCG
CTATTTATTA TCAACGAACT CCATCTTCCC GAGAAACTGG CCGGTGTGAT GATGGGGACC
GCCGCCGGGC TGGAAATCCC GACCATGTTG ATTGCCGGAT ATTTCGCCAA ACGTCTGGGT
AAGCGTTTCT TAATGCGCGT TGCTGCCGTG GGTGGCGTCT GTTTTTACGC AGGAATGCTG
ATGGCGCATT CTCCTGTCAT TCTGTTGGGC TTGCAGCTGC TAAATGCTAT TTTTATTGGC
ATTCTGGGCG GTATCGGGAT GCTCTATTTT CAGGATCTGA TGCCCGGTCA GGCAGGTTCA
GCCACCACGC TCTATACCAA CACGTCGCGC GTGGGCTGGA TCATCGCGGG ATCAGTGGCG
GGCATCGTCG CCGAGATCTG GAATTATCAC GCTGTGTTCT GGTTTGCGAT GGTGATGATT
ATCGCCACTC TGTTTTGCTT ACTGCGGATT AAAGATGTTT AA
 
Protein sequence
MHNSPAVSSA KSFDLTSTAF LIVAFLTGIA GALQTPTLSI FLTDEVHARP AMVGFFFTGS 
AVIGVLVSQF LAGRSDKRGD RKSLIVFCCL LGVLACTLFA WNRNYFVLLF VGVFLSSFGS
TANPQMFALA REHADKTGRE AVMFSSFLRA QVSLAWVIGP PLAYALAMGF SFTVMYLSAA
VAFIVCGVMV WLFLPSMQKE LPLATGTVEA PRRNRRDTLL LFVICTLMWG SNSLYIINMP
LFIINELHLP EKLAGVMMGT AAGLEIPTML IAGYFAKRLG KRFLMRVAAV GGVCFYAGML
MAHSPVILLG LQLLNAIFIG ILGGIGMLYF QDLMPGQAGS ATTLYTNTSR VGWIIAGSVA
GIVAEIWNYH AVFWFAMVMI IATLFCLLRI KDV