Gene SbBS512_E4166 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E4166 
Symbol 
ID6269020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp3888306 
End bp3889733 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content49% 
IMG OID641727990 
Productdrug resistance MFS transporter, drug:H+ antiporter-1 (DHA2) family 
Protein accessionYP_001882411 
Protein GI187731926 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000209718 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCGATA AAAAGAAGCG TAGTATGGCG GGTTTGCCGT GGATCGCGGC GATGGCCTTC 
TTCATGCAGG CACTTGATGC CACTATTCTG AATACCGCCT TACCCGCAAT CGCTCATAGC
CTTAATCGTT CTCCTCTCGC AATGCAATCA GCCATCATCA GTTATACGCT GACGGTGGCG
ATGTTTATTC CGGTAAGCGG ATGGCTAGCC GATCGCTTCG GTACGCGTCG CATTTTTACC
CTTGCCGTGA GTCTGTTCAC GTTGGGTTCT CTGGCCTGCG CACTTTCTAA TTCGCTACCA
CAGCTGGTTG TCTTCCGGGT TATTCAGGGG ATAGGCGGCG CAATGATGAT GCCTGTTGCT
CGGCTGGCCT TACTGCGTGC TTATCCTCGT AATGAACTTC TTCCTGTATT GAATTTTGTC
GCCATGCCGG GTCTGGTGGG GCCAATTTTA GGCCCCGTTC TTGGCGGCGT GCTGGTCACC
TGGGCAACCT GGCACTGGAT ATTTTTAATC AATATCCCCA TAGGTATTGC GGGCCTTCTT
TACGCGCGCA AACATATGCC CAATTTCACC ACCGCACGAC GCAGATTCGA TATCACTGGC
TTTTTGCTGT TTGGCCTCAG CCTTGTTCTC TTCTCAAGCG GAATAGAGCT ATTCGGGGAA
AAGATTGTCG CCAGCTGGAT TGCCTTGACG GTAATTGTCA CCAGCATCGG GTTACTGCTT
CTCTATATTC TCCATGCGCG ACACACGCCA AACCCATTAA TTTCATTAGA TTTATTTAAA
ACCCGCACTT TCTCGATCGG TATCGTAGGC AATATTGCAA CCCGTCTGGG GACCGGTAGT
GTACCGTTCC TTATGCCATT GATGTTACAG GTAGGATTTG GTTATCAGGC GTTTATTGCC
GGCTGTATGA TGGCACCGAC AGCGTTAGGT TCCATTATTG CAAAATCGAT GGTTACCCAA
GTCTTACGTC GTCTGGGCTA TCGCCATACG TTAGTGGGGA TCACGGTGAT TATTGGGCTA
ATGATCGCTC AGTTCTCTTT GCAATCACCG GCAATGGCGA TATGGATGCT GATCTTGCCG
TTGTTTATAT TAGGGATGGC TATGTCGACG CAATTTACCG CGATGAATAC CATCACACTT
GCCGATCTGA CCGATGACAA CGCCAGCAGC GGTAACAGTG TTCTGGCGGT CACGCAGCAA
CTGTCGATTA GTTTAGGCGT TGCTGTAAGT GCGGCCGTCC TTCGCGTTTA TAAAGGGATG
GAAGGCACAA CGACTGTCGA ACAATTCCAC TATACGTTTA TCACGATGGG CATTATTACT
GTTGCTTCAG CAGCAATGTT CATGCTTCTG AAAACAACCG ATGGTAATAA TTTGATCAAA
AGACAGCGTA AATCTAAGCC GAACCGCGTT CCATCAGAAT CGGAGTAA
 
Protein sequence
MSDKKKRSMA GLPWIAAMAF FMQALDATIL NTALPAIAHS LNRSPLAMQS AIISYTLTVA 
MFIPVSGWLA DRFGTRRIFT LAVSLFTLGS LACALSNSLP QLVVFRVIQG IGGAMMMPVA
RLALLRAYPR NELLPVLNFV AMPGLVGPIL GPVLGGVLVT WATWHWIFLI NIPIGIAGLL
YARKHMPNFT TARRRFDITG FLLFGLSLVL FSSGIELFGE KIVASWIALT VIVTSIGLLL
LYILHARHTP NPLISLDLFK TRTFSIGIVG NIATRLGTGS VPFLMPLMLQ VGFGYQAFIA
GCMMAPTALG SIIAKSMVTQ VLRRLGYRHT LVGITVIIGL MIAQFSLQSP AMAIWMLILP
LFILGMAMST QFTAMNTITL ADLTDDNASS GNSVLAVTQQ LSISLGVAVS AAVLRVYKGM
EGTTTVEQFH YTFITMGIIT VASAAMFMLL KTTDGNNLIK RQRKSKPNRV PSESE