Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_2468 |
Symbol | setB |
ID | 5588835 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 2452152 |
End bp | 2453333 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640926128 |
Product | sugar efflux transporter B |
Protein accession | YP_001463523 |
Protein GI | 157155722 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00899] sugar efflux transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000203316 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATAACT CCCCCGCAGT CTCCAGCGCG AAATCGTTTG ACCTGACCTC GACGGCGTTT TTAATCGTTG CCTTTCTCAC CGGTATTGCG GGCGCTCTGC AAACCCCGAC ACTCAGTATT TTTCTTACTG ATGAAGTACA TGCTCGTCCG GCGATGGTGG GATTCTTCTT TACCGGCAGC GCTGTCATTG GGATTCTGGT AAGTCAGTTT CTCGCCGGGC GCTCTGATAA GCGCGGCGAT CGCAAATCGC TGATTGTCTT TTGCTGCCTG TTAGGCGTGC TGGCCTGCAC CCTTTTTGCC TGGAATCGCA ACTACTTTGT TTTGCTATTC GTTGGCGTCT TTCTTAGCAG CTTTGGCTCG ACCGCTAACC CGCAAATGTT TGCCCTTGCC CGTGAACATG CCGACAAAAC CGGACGTGAG GCGGTGATGT TCAGCTCTTT TTTACGCGCT CAGGTTTCAC TGGCATGGGT CATTGGCCCA CCGCTGGCTT ATGCCTTAGC GATGGGTTTC AGCTTTACGG TAATGTATCT GAGCGCAGCG GTAGCGTTTA TTGTTTGCGG TGTGATGGTG TGGCTGTTTT TACCGTCGAT GCAAAAAGAG CTTCCGCTGG CGACCGGCAC GATCGAAGCG CCGCGCCGTA ACCGTCGCGA TACGCTGCTG CTGTTTGTCA TTTGTACATT GATGTGGGGC TCGAACAGCC TGTACATCAT CAACATGCCG CTATTTATTA TCAACGAACT GCATCTTCCC GAGAAACTGG CCGGTGTGAT GATGGGGACC GCCGCCGGGC TGGAAATCCC GACGATGTTG ATTGCCGGAT ATTTCGCCAA ACGTCTGGGT AAGCGTTTCT TAATGCGCGT TGCTGCCGTG GGTGGCGTCT GTTTTTACGC AGGAATGCTG ATGGCGCATT CACCTGTCAT TCTGTTGGGC TTGCAGCTGC TAAATGCTAT TTTTATTGGC ATTCTGGGCG GCATCGGGAT GCTCTATTTT CAGGATCTGA TGCCCGGTCA GGCGGGTTCA GCCACCACGC TCTATACCAA CACGTCGCGC GTGGGCTGGA TCATCGCAGG ATCAGTGGCG GGCATCGTCG CCGAGATCTG GAATTATCAC GCTGTGTTCT GGTTTGCGAT GGTGATGATT ATCGCCACTC TGTTTTGCTT ACTGCGGATT AAAGATGTTT AA
|
Protein sequence | MHNSPAVSSA KSFDLTSTAF LIVAFLTGIA GALQTPTLSI FLTDEVHARP AMVGFFFTGS AVIGILVSQF LAGRSDKRGD RKSLIVFCCL LGVLACTLFA WNRNYFVLLF VGVFLSSFGS TANPQMFALA REHADKTGRE AVMFSSFLRA QVSLAWVIGP PLAYALAMGF SFTVMYLSAA VAFIVCGVMV WLFLPSMQKE LPLATGTIEA PRRNRRDTLL LFVICTLMWG SNSLYIINMP LFIINELHLP EKLAGVMMGT AAGLEIPTML IAGYFAKRLG KRFLMRVAAV GGVCFYAGML MAHSPVILLG LQLLNAIFIG ILGGIGMLYF QDLMPGQAGS ATTLYTNTSR VGWIIAGSVA GIVAEIWNYH AVFWFAMVMI IATLFCLLRI KDV
|
| |