Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E3849 |
Symbol | |
ID | 6270304 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3575612 |
End bp | 3576871 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641727705 |
Product | major facilitator superfamily transporter |
Protein accession | YP_001882140 |
Protein GI | 187730432 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 47 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGTAAAAA TGAAACACTG TTGTAAAAAT GTGGTGATCC TCATGCCCGA ACCCGTAGCC GAACCCGCGC TAAACGGATT GCGCCTGAAT TTGCGCATTG TCTCCATTGT CATGTTTAAC TTCGCCAGCT ACCTCACCAT CGGGTTGCCG CTCGCTGTAT TACCGGGCTA TGTCCATGAT GTGATGGGCT TTAGCGCTTT CTGGGCAGGA TTGGTTATCA GCCTGCAATA TTTCGCCACC TTGCTGAGCC GTCCTCATGC CGGACGTTAC GCCGATTTGC TGGGCCCCAA AAAGATTGTC GTCTTCGGTT TATGCGGCTG CTTTTTGAGC GGTCTGGGAT ATCTGACGGC AGGATTAACC GCCAGTCTGC CCGTCATCAG CCTGTTATTA CTTTGCCTGG GACGCGTGAT CCTTGGGATT GGGCAAAGTT TTGCCGGAAC GGGATCGACC CTGTGGGGCG TTGGCGTGGT TGGCTCGCTG CATATCGGGC GAGTGATTTC GTGGAACGGC ATTGTCACTT ACGGGGCGAT GGCGATGGGT GCGCCGTTAG GCGTCGTGTT TTATCACTGG GGCGGCTTGC AGGCGTTAGC GTTAATCATT ATGGGCGTGG CGCTGGTGGC CATTTTGTTG GCGATCCCGC GTCCGACGGT AAAAGCCAGT AAAGGCAAAC CGCTGCCGTT TCGCGCGGTG CTTGGGCGCG TCTGGCTGTA CGGTATGGCA CTGGCACTGG CTTCTGCCGG ATTTGGCGTC ATCGCCACCT TTATCACGCT GTTTTATGAC GCTAAAAGTT GGGACGGTGC GGCTTTCGCG CTGACGCTGT TTAGCTGTGC GTTTGTCGGT ACGCGTTTGT TATTCCCTAA CGGCATTAAC CGTATCGGCG GCTTAAACGT AGCGATGATT TGCTTTAGCG TTGAGATAAT CGGCCTGCTA CTGGTTGGCG TGGCGACTAT GCCGTGGATG GCGAAAATCG GCGTCTTACT GGCGGGGGCA GGGTTTTCGC TGGTGTTCCC GGCATTAGGT GTAGTGGCGG TAAAAGCGGT TCCGCAGCAA AATCAGGGGG CGGCGCTGGC AACTTACACC GTATTTATGG ATTTATCGCT TGGCGTGACC GGACCACTGG CTGGGCTGGT GATGAGCTGG GCGGGCGTAC CGGTGATTTA TCTGGCGGCG GCGGGACTGG TCGCAATCGC GTTATTACTG ACGTGGCGAT TAAAAAAACG GCCTCCGGAA CACGTCCCTG AGGCCGCCTC ATCATCTTAA
|
Protein sequence | MVKMKHCCKN VVILMPEPVA EPALNGLRLN LRIVSIVMFN FASYLTIGLP LAVLPGYVHD VMGFSAFWAG LVISLQYFAT LLSRPHAGRY ADLLGPKKIV VFGLCGCFLS GLGYLTAGLT ASLPVISLLL LCLGRVILGI GQSFAGTGST LWGVGVVGSL HIGRVISWNG IVTYGAMAMG APLGVVFYHW GGLQALALII MGVALVAILL AIPRPTVKAS KGKPLPFRAV LGRVWLYGMA LALASAGFGV IATFITLFYD AKSWDGAAFA LTLFSCAFVG TRLLFPNGIN RIGGLNVAMI CFSVEIIGLL LVGVATMPWM AKIGVLLAGA GFSLVFPALG VVAVKAVPQQ NQGAALATYT VFMDLSLGVT GPLAGLVMSW AGVPVIYLAA AGLVAIALLL TWRLKKRPPE HVPEAASSS
|
| |