Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SbBS512_E4166 |
Symbol | |
ID | 6269020 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Shigella boydii CDC 3083-94 |
Kingdom | Bacteria |
Replicon accession | NC_010658 |
Strand | + |
Start bp | 3888306 |
End bp | 3889733 |
Gene Length | 1428 bp |
Protein Length | 475 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641727990 |
Product | drug resistance MFS transporter, drug:H+ antiporter-1 (DHA2) family |
Protein accession | YP_001882411 |
Protein GI | 187731926 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00000209718 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCGATA AAAAGAAGCG TAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC CTTAATCGTT CTCCTCTCGC AATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG ATGTTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATTTTTACC CTTGCCGTGA GTCTGTTCAC GTTGGGTTCT CTGGCCTGCG CACTTTCTAA TTCGCTACCA CAGCTGGTTG TCTTCCGGGT TATTCAGGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT CGGCTGGCCT TACTGCGTGC TTATCCTCGT AATGAACTTC TTCCTGTATT GAATTTTGTC GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC GGGCCTTCTT TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC TTTTTGCTGT TTGGCCTCAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA AAGATTGTCG CCAGCTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT CTCTATATTC TCCATGCGCG ACACACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGTAGT GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC GGCTGTATGA TGGCACCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA GTCTTACGTC GTCTGGGCTA TCGCCATACG TTAGTGGGGA TCACGGTGAT TATTGGGCTA ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG TTGTTTATAT TAGGGATGGC TATGTCGACG CAATTTACCG CGATGAATAC CATCACACTT GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA CTGTCGATTA GTTTAGGCGT TGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TAAAGGGATG GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACGATGGG CATTATTACT GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
|
Protein sequence | MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA MFIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL LYILHARHTP NPLISLDLFK TRTFSIGIVG NIATRLGTGS VPFLMPLMLQ VGFGYQAFIA GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYKGM EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE
|
| |