Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1220 |
Symbol | |
ID | 7977691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1271522 |
End bp | 1272874 |
Gene Length | 1353 bp |
Protein Length | 450 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644798166 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002949339 |
Protein GI | 239826715 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | [TIGR00711] drug resistance transporter, EmrB/QacA subfamily [TIGR00880] Multidrug resistance protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.434652 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGAATAACA GAATATCCGT AATGGTAAGC ATTGTGCTCG CGATGCTCGT GGCATCGATG GATACAACGA TCATGAATAC GACGATGCCG ATCATTGCCA AAGAACTTGG AGGATTTTCT CTTTACGCGT GGTCGTTTGC TTCCTACATG ATCACGACGA CCGTTCTTTC TCCAATCGCC GGGAGACTTT CCGACATATT TGGGAGAAAG AAAGTATTTA GCTTCGGTAT TATTTTATTT CTAATCGGTT CTCTTCTTTG CGGGATGTCG CAAAATATGG TCCAGCTTGT TGTATTCCGC GCCCTGCAAG GAATTGGCGC TGGTTTTATG ATGCCTTTTC CTGCCATCAT TGCTGGGGAT TTATTTCCAA TTGAAAAGCG CGGAAAAATT CAAGCGTTTT TTACGGCAAT GTGGGGGATT TCCGCGGTTC TTGCGCCGCT GCTAGGATCC TTTTTTGTCG AATACGCATC ATGGCGCTGG ATTTTTTATG TGAATATCCC GATTTGCTTG CTTTCCTTGC TCACACTATT GCCATATAAA GAAGTGTACG AACCAAAACG GGCGGTGATT GACTATATTG GAGCGGTACT ATTTGCGACC GCGATCAGCC TTTTTCTATT GACGACGGTA GCCGAAAGCA ACCATTGGAT GTATGTCGGC ATTGGCGTTG TGTTATTAGT TATTTTTTAC TTATATGAAA AAAAGCAAAC GTCTCCGCTT GTTCCATTGA CGCTCGTGCA GCATAAAACG TTAAAATGGA TGAACATGAA CGGATTTGTC AGCTGTGTCG CTTTATTTGG CACGTCTAGC TACATTCCGC TATTTTTGCA AAATATCGCC CATCAATCGG TGTTTGCAAG CGGTGTCGCC CTTTTAGGCA TGTCGATTGG TTGGATGATT GTGGCGGTGC CGGCGGGAAA ATGGATTTTG CGCTATGGGT ATCGCATGTT ATTGATTATC GGAAACGTTC TTCTTGTGCT TTCTGGACTG CTGCTAGCAC TTTTAAATGA AAGCCACGGA TTTTTGTATG TCTTTTTTGC CATGTTCATT CAAGGGCTGT CGTTTGGATT GACATCTACT GTCGGTGTCA TCGGCTCGCA GCAGCTTGCC GATGCGCATG AAAAGGGAAT CGCGACTTCG TTTTTCATGT TTTGCCGCAA CATCGGCACA GCCATTGGTG TAACGATTAT GGGCGCCTTT TTAACGAAAG CGGCTGATTT TATGACTGGC ATCCATCATC TGTTTTTATT TGGATTTATC GGCAGCATTG TGGCTTTATT CACATCGTTT TTCATTCGTG ATGAGTCGGA ACAGAAAAAA AATAATTTGC TTCGCTCGGG GGAAATGGTC TAA
|
Protein sequence | MNNRISVMVS IVLAMLVASM DTTIMNTTMP IIAKELGGFS LYAWSFASYM ITTTVLSPIA GRLSDIFGRK KVFSFGIILF LIGSLLCGMS QNMVQLVVFR ALQGIGAGFM MPFPAIIAGD LFPIEKRGKI QAFFTAMWGI SAVLAPLLGS FFVEYASWRW IFYVNIPICL LSLLTLLPYK EVYEPKRAVI DYIGAVLFAT AISLFLLTTV AESNHWMYVG IGVVLLVIFY LYEKKQTSPL VPLTLVQHKT LKWMNMNGFV SCVALFGTSS YIPLFLQNIA HQSVFASGVA LLGMSIGWMI VAVPAGKWIL RYGYRMLLII GNVLLVLSGL LLALLNESHG FLYVFFAMFI QGLSFGLTST VGVIGSQQLA DAHEKGIATS FFMFCRNIGT AIGVTIMGAF LTKAADFMTG IHHLFLFGFI GSIVALFTSF FIRDESEQKK NNLLRSGEMV
|
| |