Gene GWCH70_2638 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2638 
Symbol 
ID7978299 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2672495 
End bp2675617 
Gene Length3123 bp 
Protein Length1040 aa 
Translation table11 
GC content47% 
IMG OID644799439 
ProductMMPL domain protein 
Protein accessionYP_002950598 
Protein GI239827974 
COG category[R] General function prediction only 
COG ID[COG2409] Predicted drug exporters of the RND superfamily 
TIGRFAM ID[TIGR03057] X-X-X-Leu-X-X-Gly heptad repeats 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAGAGGAA TCGTCAAAGG AAAATGGTTT GTGCTGTTGG CGTGGATTGT CGTTGCTGTC 
ATCCTTATTG TGACGGCGCC GAACATGGCC GATCTTGTCC GCGAGAAAGG GCAGCTTAGC
GTGCCAAAAG GCTACTCGTC TTCGCTTGCG ATGGATATTT TAAACGAAGT GCAAAAGAAG
GAGAACAAAG GAAGCGAACT GTCAACAGCA CTTGTGTTTT ACCGCAAAGG CGGGCTGACG
GACAACGATT GGAAGGAAGC GGAGCGCGCC GTTAAACAGT TGGAAGCGCA AAAAGAGAAA
CTCGGAATTA CCGAGATTAT CTCCCCGTTT AAAGAAAAGG AATTGAAAGA TCAACTTGTT
TCTAAAGATG GCACGACGAT TTTAACGTCC GTGAAGCTGG AGCGGAACGG AAGAAGCGCG
AAAGAAATAA GCAAGGCGCT TTACAAAGCG ATTGATGACA TTTCTGTCGA ACATTATTAC
ACGGGTGGCT GGATGATTGA CGAAGACGTT GTCGTCAGCT CGCAGAAAGG GTTGAAAAAG
ACGGAAGGCA TTACGGTTGT GTTTATTTTA GTGGTGTTGC TTGTCGTGTT CCGTTCGTTT
GTCGCGCCAC TCATTCCGCT TATCACCGTT GGGATGACGT ATTTAGTGTC ACAGTCGATT
GTCGCTTTTC TCGTGGATCA AGTGAATTTT CCGCTCTCTA CGTTTACGCA AATCTTTTTA
GTTGCCGTGC TGTTTGGAAT CGGAACGGAC TATTGTATTT TGCTGTTAAG CCGGTTTAAG
GAAGAACTTT CGCAGCACGA AAACCGCACG GATGCGATTG TCGCCACGTA CCGCACCGCT
GGAAAAACGG TATTGTTCAG CGGAATTGCG GTGATGATCG GGTTTGCCGC AATTGGATTA
TCAACGTTTA AGTTATATCA ATCGGCTGCC GCGGTGGCGG TTGGGGTAGC GGTGCTCCTT
GTTGCGCTGA TGACGTTGGT GCCGTTTTTT ATGGCGGTGC TTGGTCCAAA TCTGTTTTGG
CCGGCGAAAG GAAATTTAGA GCATAAACAA AGCCGCTTAT GGGATGACGC GGGACGGTTT
GCGTTTGCCA GACCGCTCAT TGCGTTAGGC ATCGTTGCGG TTATTACCGT TCCGGTGTTG
GCAACGTATG ATGGGGATTT GTCATTTGAT TCCCTTGAAG AAATTGGTGA CGACTATGCG
TCTGTTAAAG CGTTCAATAT TATCGCTAAA AATTTTAATC CTGGTGAAGC GATGCCAACG
CAAATCGTCA TGAAAAACGA TGAAGCGATG AACTCGCAAG AATATTTTGC CCTTATTGAA
AAAATCAGCC GTGAAGTCGA AAAAATCGAT GGTATCGACA AAGTGCGTTC TGTTACACGC
CCGACTGGGG AGCCAATTGA ACAATTGTTT GTCACCGAGC AAGCGAAAAG CTTAAAAGAT
GGTCTTGGAC AAGGAAAAGA AGGGATTGAG AAAATCAGCT CTGGACTAAG TCAAGCGGGC
AAACAGCTTT CCGCATCCGC GCCAAAATTA AAACAAGCGA CAAACGGTAT GGAAGAGCTT
GCATCAGGAA CGAAGCGGCT CAAAATGGGT GTTAGCGACC TCCAAAAAGG ACTTGCGCAA
ATTGAACAAG GCATTCGCAG CGGTTCGATG GGAGCTGGCG AAATCAAAAA AGGATTGGCA
ACAATCAAGG ATAATGCGAA AAAGTTAAAG CGAGGCGCTG AGCAGTTGCT GCAAGGATAT
GAAAAGGCGG GCGCCGGGCT CTCTTCGCTT ATCAGCCAGT ATGAACAACT GCAAGACGGC
ATGAACGCCT TATCGGAGCA ATTATCCTCG GTGAGCGCAT CGTTGAATCA TATAGAGCAA
ACACATCCAG AGCTGCAACA GGATCCGGAA TATCAGCGAA CAAAAATGAT TGCAGCGGGA
TTAGCGCAAC GATCCCAGCA AATGACTGCA GGTTTTACGC AATTAAATAA CGCACTCAAA
CAGGCAAGCG CTGGAGTCCG TCAAGCGAAC GGCTCGTTTG TGCAGCTTAT CAGCGGCCAA
AAAGCGTTGA TTGATGGGAT GACCAAACTG ATCGCCGGGC TTGACGAGCT GCAAAAAGGG
ATGGATAAAG CAGCAAATGG ACAGCGTCAA GTAATCGGGC GTCTTCCGCA GCTGTCGAAT
GGATTAAGCG AAGTGAACGC TGGGCAAGAA CAGCTGCTGC GAGGATTTTC CCAATTAGAC
GGGCAAATGA GTCAATTAAT TACAGGGCTT GATCAAAGTG CCGACGGTCT TCGGAAAGTA
TCGAAAGGCC TAGGTTCCGC AGAAAGCTAT TTATCCGAGC TTGCGTCTTC GCCAAACAAA
GAAATGACAG GCTGGTATTT GCCGAAACAA GTGCTTGAGA GCAAAGAATT TGCCAAAGCG
CAAGATGCGT ATATGTCCAA AGACAAAAAA GTCGTGACGT TTGATGTGAT TTTCGATGAA
AACCCGTATT CCACATCGGC GTTGAACAAA ATCGATGATA TTAAACAAGC AGTTGAGCGT
GCGGTGAAAG ATACGAAACT CGAAAATGCA AAAGTTGCCG TTGGCGGAGT GACGAGCATT
TATTCCGATT TGCACACGAT TTCGTCGAAT GACTACTCCC GCACCGTTGT GTTGATGCTT
GCCGGCATTG CGCTTATTTT AATCGTTTTG CTTCGTTCGT TGATTATGCC GATGTATTTA
ATTTTATCGC TTATTTTGAC GTACTACACA TCGATGGCGG TAACCGAGCT TATTTTTGTC
AATGGGCTTG GCTACGCCGG TTTGAACTGG GCGGTATCGT TTTTCGCGTT TGTCATCTTA
ATCGCGCTTG GCATCGACTA CAGCATTTTC TTGATGGACC GCTTCAACGA ATATCGTGAC
AAACCGGTGC AGGAAGCGAT GCTGTTGGCG ATGCGCAATA TGGGAACGGT TATTATTTCG
GCCGCTGTTA TTTTAGGCGG GACGTTTGCG GCAATGTATC CGTCCGGCGT CTTGTCGCTC
TTGCAAATCG CGACGATTGT GCTGACTGGA TTAATTTTAT ACGCGCTTGT TGTACTGCCG
CTGTTCGTGC CGGTGATGGT CAAAACATTC GGCAAGGCAA ACTGGTGGCC GTTTATGAAG
TAA
 
Protein sequence
MRGIVKGKWF VLLAWIVVAV ILIVTAPNMA DLVREKGQLS VPKGYSSSLA MDILNEVQKK 
ENKGSELSTA LVFYRKGGLT DNDWKEAERA VKQLEAQKEK LGITEIISPF KEKELKDQLV
SKDGTTILTS VKLERNGRSA KEISKALYKA IDDISVEHYY TGGWMIDEDV VVSSQKGLKK
TEGITVVFIL VVLLVVFRSF VAPLIPLITV GMTYLVSQSI VAFLVDQVNF PLSTFTQIFL
VAVLFGIGTD YCILLLSRFK EELSQHENRT DAIVATYRTA GKTVLFSGIA VMIGFAAIGL
STFKLYQSAA AVAVGVAVLL VALMTLVPFF MAVLGPNLFW PAKGNLEHKQ SRLWDDAGRF
AFARPLIALG IVAVITVPVL ATYDGDLSFD SLEEIGDDYA SVKAFNIIAK NFNPGEAMPT
QIVMKNDEAM NSQEYFALIE KISREVEKID GIDKVRSVTR PTGEPIEQLF VTEQAKSLKD
GLGQGKEGIE KISSGLSQAG KQLSASAPKL KQATNGMEEL ASGTKRLKMG VSDLQKGLAQ
IEQGIRSGSM GAGEIKKGLA TIKDNAKKLK RGAEQLLQGY EKAGAGLSSL ISQYEQLQDG
MNALSEQLSS VSASLNHIEQ THPELQQDPE YQRTKMIAAG LAQRSQQMTA GFTQLNNALK
QASAGVRQAN GSFVQLISGQ KALIDGMTKL IAGLDELQKG MDKAANGQRQ VIGRLPQLSN
GLSEVNAGQE QLLRGFSQLD GQMSQLITGL DQSADGLRKV SKGLGSAESY LSELASSPNK
EMTGWYLPKQ VLESKEFAKA QDAYMSKDKK VVTFDVIFDE NPYSTSALNK IDDIKQAVER
AVKDTKLENA KVAVGGVTSI YSDLHTISSN DYSRTVVLML AGIALILIVL LRSLIMPMYL
ILSLILTYYT SMAVTELIFV NGLGYAGLNW AVSFFAFVIL IALGIDYSIF LMDRFNEYRD
KPVQEAMLLA MRNMGTVIIS AAVILGGTFA AMYPSGVLSL LQIATIVLTG LILYALVVLP
LFVPVMVKTF GKANWWPFMK