Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1833 |
Symbol | |
ID | 7976456 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 1898612 |
End bp | 1900306 |
Gene Length | 1695 bp |
Protein Length | 564 aa |
Translation table | 11 |
GC content | 45% |
IMG OID | 644798669 |
Product | drug resistance transporter, EmrB/QacA subfamily |
Protein accession | YP_002949839 |
Protein GI | 239827215 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0281974 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCAACGT TTTTAACTGG TTATATTATA TTTTCCGTTA TCGTTCTCGC CGTTATCAAT GTTGCGGTGA GAAGACGAAA AAGCGCACCC GTCCCGCGTG ACGCGGAAAA AGCGGAGCTT TCGAACGGCA TCATGGTAAA TCGTAATATG GAGACCGCTT CGACAGCGGC GCCGCCGCGC CAAGCGGAAT TGGCGAATAT TGGCGATAGT CGGGCTAAAG TAGTGGCTAC GATTATGCTT GGAGCGTTTG TGGCGATTTT GAATCAGACG CTCATTAACG TCGCACTTCC GCATATGATG AACGATTTTA ACGTAGAAAC ATCGACGATT CAGTGGCTTG TTACGGGATA CATGCTCGTT AATGGTGTCT TGATTCCAAT TAGCCCATTT TTAATCGCGA AGTTTCCGAC GAAGAAGTTG TTCTTGTCGG GAATGTTATT TTTTGCGATT GGTGCCTTTA TTTGTTCTGT TTCTCCTTCT TTTGCCATCG TGCTAATCGG TCGTCTTATT CAAGCGATAG GTGCAGGAAT TATTATGCAA TTGATGATGG TCATTATGTT AAATATTTTC CCGCCAGAGA AACGTGGAGT AGCAATGGGG ACAGTTGGGA TCGCAATGAT GTTTGCGCCT GCTGTCGGTC CGACATTATC GGGATGGATT GTTGAACATT ATTCATGGCG TCTTTTATTC TACGTCGTAT TGCCCATTGC GATTATTGAT ATTGTGCTAG CTTTTCTTTG GCTGAAAGAT ACATCGAGAA CAGGCAATCC GCCATTGGAT CTACGAGGAG CAATTTATTC CACCATCGGT TTCGGCGGTG TGCTATACGG ATTTAGTGAA GCGGGAAGCA ATGGCTGGGG GCAAACAAAC GTCATTGTGT CGATTATCAT CGGTGTTATA TTCATTATTT TATTTGCATG GCGTTCGCTA ATAGTAGAAA ATCCAATTTT AAACTTTCGA GTGTTTAAAT ATAATGTTTT CACATTATCT ACTATTATCG GCTGTGTGAT CAATATGGCT ATGTTTGCCG CGATGGTGCT GCTTCCGGTC TATTTGCAAA ACTTGCGCGG CTTTACACCG CTTGACGCCG GTTTACTGCT CTTGCCTGGC GCGATCGTGA TGGCCATCAT GTCGCCGATT TCCGGATGGA TTTTTGACCG CATTGGCGCG CGGATGCTGG CTATTGTCGG TTTAGTCATT ACGGTTGTGA CAACATGGGA GTTCAGCAAG CTGACGATGG ATACGCCATA TAGCCATATT TTGGCGCTTT ATATTTTCCG CATGTTCGGT ATGTCGATGT TGGGGATGCC GATTATGACG GAAGGGCTAA ACGCATTGCC CCGCCATTTA TACAGCCACG GAACGGCGAT GGCGAATACG TTGCGTCAAG TAGCGGCATC GTTGGGAACA GCTTTCCTCG TTACCGTGAT GTCAAACCGA TCGAAGTTTC ATGCGGAAAA TTACCGCAAT GAAATGACCG AAAATAATCC GTTCTTCATG GACATTGTAA CGCATCTCAA ACAAGTCATT CCAAGCGATG AAGCAATTGT ACAGATTCTA AACGGTATGG TGCAGCAACG CGCTGCGATA GAAGGCATTA ACGACGCGTT TTTTGTCGCA ACAGGACTCG CGTTTCTTGC GCTTATTTTA GCTTTCTTCT TAAAAGGGAA AAAGAAAAAT ATTCCGTCCT CGTAA
|
Protein sequence | MSTFLTGYII FSVIVLAVIN VAVRRRKSAP VPRDAEKAEL SNGIMVNRNM ETASTAAPPR QAELANIGDS RAKVVATIML GAFVAILNQT LINVALPHMM NDFNVETSTI QWLVTGYMLV NGVLIPISPF LIAKFPTKKL FLSGMLFFAI GAFICSVSPS FAIVLIGRLI QAIGAGIIMQ LMMVIMLNIF PPEKRGVAMG TVGIAMMFAP AVGPTLSGWI VEHYSWRLLF YVVLPIAIID IVLAFLWLKD TSRTGNPPLD LRGAIYSTIG FGGVLYGFSE AGSNGWGQTN VIVSIIIGVI FIILFAWRSL IVENPILNFR VFKYNVFTLS TIIGCVINMA MFAAMVLLPV YLQNLRGFTP LDAGLLLLPG AIVMAIMSPI SGWIFDRIGA RMLAIVGLVI TVVTTWEFSK LTMDTPYSHI LALYIFRMFG MSMLGMPIMT EGLNALPRHL YSHGTAMANT LRQVAASLGT AFLVTVMSNR SKFHAENYRN EMTENNPFFM DIVTHLKQVI PSDEAIVQIL NGMVQQRAAI EGINDAFFVA TGLAFLALIL AFFLKGKKKN IPSS
|
| |