Gene GWCH70_1833 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1833 
Symbol 
ID7976456 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1898612 
End bp1900306 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content45% 
IMG OID644798669 
Productdrug resistance transporter, EmrB/QacA subfamily 
Protein accessionYP_002949839 
Protein GI239827215 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0281974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCAACGT TTTTAACTGG TTATATTATA TTTTCCGTTA TCGTTCTCGC CGTTATCAAT 
GTTGCGGTGA GAAGACGAAA AAGCGCACCC GTCCCGCGTG ACGCGGAAAA AGCGGAGCTT
TCGAACGGCA TCATGGTAAA TCGTAATATG GAGACCGCTT CGACAGCGGC GCCGCCGCGC
CAAGCGGAAT TGGCGAATAT TGGCGATAGT CGGGCTAAAG TAGTGGCTAC GATTATGCTT
GGAGCGTTTG TGGCGATTTT GAATCAGACG CTCATTAACG TCGCACTTCC GCATATGATG
AACGATTTTA ACGTAGAAAC ATCGACGATT CAGTGGCTTG TTACGGGATA CATGCTCGTT
AATGGTGTCT TGATTCCAAT TAGCCCATTT TTAATCGCGA AGTTTCCGAC GAAGAAGTTG
TTCTTGTCGG GAATGTTATT TTTTGCGATT GGTGCCTTTA TTTGTTCTGT TTCTCCTTCT
TTTGCCATCG TGCTAATCGG TCGTCTTATT CAAGCGATAG GTGCAGGAAT TATTATGCAA
TTGATGATGG TCATTATGTT AAATATTTTC CCGCCAGAGA AACGTGGAGT AGCAATGGGG
ACAGTTGGGA TCGCAATGAT GTTTGCGCCT GCTGTCGGTC CGACATTATC GGGATGGATT
GTTGAACATT ATTCATGGCG TCTTTTATTC TACGTCGTAT TGCCCATTGC GATTATTGAT
ATTGTGCTAG CTTTTCTTTG GCTGAAAGAT ACATCGAGAA CAGGCAATCC GCCATTGGAT
CTACGAGGAG CAATTTATTC CACCATCGGT TTCGGCGGTG TGCTATACGG ATTTAGTGAA
GCGGGAAGCA ATGGCTGGGG GCAAACAAAC GTCATTGTGT CGATTATCAT CGGTGTTATA
TTCATTATTT TATTTGCATG GCGTTCGCTA ATAGTAGAAA ATCCAATTTT AAACTTTCGA
GTGTTTAAAT ATAATGTTTT CACATTATCT ACTATTATCG GCTGTGTGAT CAATATGGCT
ATGTTTGCCG CGATGGTGCT GCTTCCGGTC TATTTGCAAA ACTTGCGCGG CTTTACACCG
CTTGACGCCG GTTTACTGCT CTTGCCTGGC GCGATCGTGA TGGCCATCAT GTCGCCGATT
TCCGGATGGA TTTTTGACCG CATTGGCGCG CGGATGCTGG CTATTGTCGG TTTAGTCATT
ACGGTTGTGA CAACATGGGA GTTCAGCAAG CTGACGATGG ATACGCCATA TAGCCATATT
TTGGCGCTTT ATATTTTCCG CATGTTCGGT ATGTCGATGT TGGGGATGCC GATTATGACG
GAAGGGCTAA ACGCATTGCC CCGCCATTTA TACAGCCACG GAACGGCGAT GGCGAATACG
TTGCGTCAAG TAGCGGCATC GTTGGGAACA GCTTTCCTCG TTACCGTGAT GTCAAACCGA
TCGAAGTTTC ATGCGGAAAA TTACCGCAAT GAAATGACCG AAAATAATCC GTTCTTCATG
GACATTGTAA CGCATCTCAA ACAAGTCATT CCAAGCGATG AAGCAATTGT ACAGATTCTA
AACGGTATGG TGCAGCAACG CGCTGCGATA GAAGGCATTA ACGACGCGTT TTTTGTCGCA
ACAGGACTCG CGTTTCTTGC GCTTATTTTA GCTTTCTTCT TAAAAGGGAA AAAGAAAAAT
ATTCCGTCCT CGTAA
 
Protein sequence
MSTFLTGYII FSVIVLAVIN VAVRRRKSAP VPRDAEKAEL SNGIMVNRNM ETASTAAPPR 
QAELANIGDS RAKVVATIML GAFVAILNQT LINVALPHMM NDFNVETSTI QWLVTGYMLV
NGVLIPISPF LIAKFPTKKL FLSGMLFFAI GAFICSVSPS FAIVLIGRLI QAIGAGIIMQ
LMMVIMLNIF PPEKRGVAMG TVGIAMMFAP AVGPTLSGWI VEHYSWRLLF YVVLPIAIID
IVLAFLWLKD TSRTGNPPLD LRGAIYSTIG FGGVLYGFSE AGSNGWGQTN VIVSIIIGVI
FIILFAWRSL IVENPILNFR VFKYNVFTLS TIIGCVINMA MFAAMVLLPV YLQNLRGFTP
LDAGLLLLPG AIVMAIMSPI SGWIFDRIGA RMLAIVGLVI TVVTTWEFSK LTMDTPYSHI
LALYIFRMFG MSMLGMPIMT EGLNALPRHL YSHGTAMANT LRQVAASLGT AFLVTVMSNR
SKFHAENYRN EMTENNPFFM DIVTHLKQVI PSDEAIVQIL NGMVQQRAAI EGINDAFFVA
TGLAFLALIL AFFLKGKKKN IPSS