Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2319 |
Symbol | setB |
ID | 6143288 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2349521 |
End bp | 2350702 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641617192 |
Product | sugar efflux transporter B |
Protein accession | YP_001744365 |
Protein GI | 170680537 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00899] sugar efflux transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000297031 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.00120871 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCATAACT CCCCCGCAGT CTCCAGCGCG AAATCGTTTG ATCTGACCTC GACGGCGTTT TTAATCGTTG CCTTTCTCAC CGGTATTGCG GGCGCTTTGC AAACCCCGAC ACTCAGTATT TTCCTTACCG ATGAAGTACA TGCCCGTCCG GCGATGGTGG GATTCTTCTT TACCGGCAGC GCTGTCATTG GGATTCTGGT CAGTCAGTTT CTCGCCGGGC GCTCTGATAA GCGCGGCGAT CGCAAATCGC TGATTGTCTT TTGCTGCCTG TTAGGCGTGC TGGCCTGCAC CCTTTTTGCC TGGAATCGCA ACTACTTTGT TTTGTTATTC GTTGGCGTCT TTCTTAGCAG CTTTGGCTCG ACCGCTAACC CGCAAATGTT TGCCCTTGCA CGTGAACATG CCGACAAAAC CGGACGTGAG GCGGTGATGT TCAGCTCTTT TTTACGCGCT CAGGTTTCAC TGGCATGGGT CATTGGCCCG CCGCTGGCTT ATGCCTTAGC GATGGGTTTC AGCTTTACGG TAATGTATCT GAGCGCAGCG GTAGCGTTTA TTGTTTGCGG CGTGATGGTG TGGCTGTTTT TACCGTCGAT GCAAAAAGAG CTTCCGCTGG CGACCGGCAC GGTTGAAGCG CCGCGCCGTA ACCGTCGCGA TACGCTGCTG CTGTTTGTCA TTTGTACATT GATGTGGGGC TCGAACAGCC TGTACATCAT CAACATGCCG CTATTTATTA TCAACGAACT GCATCTTCCC GAGAAACTGG CAGGCGTGAT GATGGGGACC GCCGCCGGGC TGGAAATCCC GACCATGTTG ATTGCCGGAT ATTTTGCCAA ACGTCTGGGT AAGCGTTTCT TAATGCGCGT TGCTGCCGTG GGTGGCGTCT GTTTTTACGC AGGAATGCTG ATGGCGCATT CACCTGCCAT TCTGTTGGGC TTGCAGCTGC TAAATGCTAT TTTTATTGGC ATTCTGGGCG GCATCGGGAT GCTTTATTTT CAGGATTTGA TGCCCGGTCA GGCAGGTTCA GCCACCACGC TCTATACCAA CACGTCGCGC GTGGGCTGGA TCATCGCAGG ATCTGTGGCG GGCATCGTCG CCGAGATCTG GAATTATCAC GCTGTGTTCT GGTTTGCGAT GGTGATGATT ATCGCCACTC TGTTTTGCTT ACTGCGGATT AAAGATGTTT AA
|
Protein sequence | MHNSPAVSSA KSFDLTSTAF LIVAFLTGIA GALQTPTLSI FLTDEVHARP AMVGFFFTGS AVIGILVSQF LAGRSDKRGD RKSLIVFCCL LGVLACTLFA WNRNYFVLLF VGVFLSSFGS TANPQMFALA REHADKTGRE AVMFSSFLRA QVSLAWVIGP PLAYALAMGF SFTVMYLSAA VAFIVCGVMV WLFLPSMQKE LPLATGTVEA PRRNRRDTLL LFVICTLMWG SNSLYIINMP LFIINELHLP EKLAGVMMGT AAGLEIPTML IAGYFAKRLG KRFLMRVAAV GGVCFYAGML MAHSPAILLG LQLLNAIFIG ILGGIGMLYF QDLMPGQAGS ATTLYTNTSR VGWIIAGSVA GIVAEIWNYH AVFWFAMVMI IATLFCLLRI KDV
|
| |