Gene EcHS_A2308 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2308 
SymbolsetB 
ID5594337 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2306041 
End bp2307222 
Gene Length1182 bp 
Protein Length393 aa 
Translation table11 
GC content53% 
IMG OID640921434 
Productsugar efflux transporter B 
Protein accessionYP_001458970 
Protein GI157161652 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00899] sugar efflux transporter 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000000000102092 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCATAACT CCCCCGCAGT CTCCAGCGCG AAATCGTTTG ACCTGACCTC GACGGCGTTT 
TTAATCGTTG CCTTTCTCAC CGGTATTGCG GGCGCTCTGC AAACCCCGAC ACTCAGTATT
TTTCTTACCG ATGAAGTACA TGCCCGTCCG GCGATGGTGG GATTCTTCTT TACCGGCAGC
GCTGTCATTG GGATTCTGGT AAGTCAGTTT CTCGCCGGGC GCTCTGATAA GCGCGGCGAT
CGCAAATCGC TGATTGTCTT TTGCTGCCTG TTAGGCGTGC TGGCCTGCAC CCTTTTTGCC
TGGAATCGCA ACTACTTTGT TTTGCTATTC GTTGGCGTCT TTCTTAGCAG CTTTGGCTCG
ACCGCTAACC CGCAAATGTT TGCCCTTGCC CGTGAACATG CCGACAAAAC CGGACGTGAG
GCGGTGATGT TCAGTTCTTT TTTACGCGCT CAGGTTTCAC TGGCATGGGT CATTGGCCCA
CCGCTGGCTT ATGCCTTAGC GATGGGTTTC AGCTTTACGG TAATGTATCT GAGCGCAGCG
GTAGCGTTTA TTGTTTGCGG CGTGATGGTG TGGCTGTTTT TACCGTCGAT GCAAAAAGAG
CTTCCGCTGG CGACCGGCAC GATTGAAGCG CCGCGCCGTA ACCGTCGCGA TACGCTGCTG
CTGTTTGTCA TTTGTACATT GATGTGGGGC TCGAACAGCC TGTACATCAT CAACATGCCG
CTATTTATTA TCAACGAACT GCATCTTCCC GAGAAACTGG CCGGTGTGAT GATGGGGACC
GCCGCCGGGC TGGAAATCCC GACCATGTTG ATTGCCGGAT ATTTCGCCAA ACGTCTGGGT
AAGCGTTTCT TAATGCGCGT TGCTGCCGTG GGTGGCGTCT GTTTTTACGC AGGAATGCTG
ATGGCGCATT CACCTGTCAT TCTGTTGGGC TTGCAGCTGC TAAATGCTAT TTTTATTGGC
ATTCTGGGTG GCATCGGGAT GCTCTATTTT CAGGATTTGA TGCCCGGTCA GGCAGGTTCA
GCCACCACGC TCTATACCAA CACGTCGCGC GTGGGCTGGA TCATCGCAGG ATCAGTGGCG
GGCATCGTCG CCGAGATCTG GAATTATCAC GCTGTGTTCT GGTTTGCGAT GGTGATGATT
ATCGCCACTC TGTTTTGCTT ACTGCGGATT AAAGATGTTT AA
 
Protein sequence
MHNSPAVSSA KSFDLTSTAF LIVAFLTGIA GALQTPTLSI FLTDEVHARP AMVGFFFTGS 
AVIGILVSQF LAGRSDKRGD RKSLIVFCCL LGVLACTLFA WNRNYFVLLF VGVFLSSFGS
TANPQMFALA REHADKTGRE AVMFSSFLRA QVSLAWVIGP PLAYALAMGF SFTVMYLSAA
VAFIVCGVMV WLFLPSMQKE LPLATGTIEA PRRNRRDTLL LFVICTLMWG SNSLYIINMP
LFIINELHLP EKLAGVMMGT AAGLEIPTML IAGYFAKRLG KRFLMRVAAV GGVCFYAGML
MAHSPVILLG LQLLNAIFIG ILGGIGMLYF QDLMPGQAGS ATTLYTNTSR VGWIIAGSVA
GIVAEIWNYH AVFWFAMVMI IATLFCLLRI KDV