Gene GWCH70_1220 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1220 
Symbol 
ID7977691 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1271522 
End bp1272874 
Gene Length1353 bp 
Protein Length450 aa 
Translation table11 
GC content44% 
IMG OID644798166 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002949339 
Protein GI239826715 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00711] drug resistance transporter, EmrB/QacA subfamily
[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.434652 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAATAACA GAATATCCGT AATGGTAAGC ATTGTGCTCG CGATGCTCGT GGCATCGATG 
GATACAACGA TCATGAATAC GACGATGCCG ATCATTGCCA AAGAACTTGG AGGATTTTCT
CTTTACGCGT GGTCGTTTGC TTCCTACATG ATCACGACGA CCGTTCTTTC TCCAATCGCC
GGGAGACTTT CCGACATATT TGGGAGAAAG AAAGTATTTA GCTTCGGTAT TATTTTATTT
CTAATCGGTT CTCTTCTTTG CGGGATGTCG CAAAATATGG TCCAGCTTGT TGTATTCCGC
GCCCTGCAAG GAATTGGCGC TGGTTTTATG ATGCCTTTTC CTGCCATCAT TGCTGGGGAT
TTATTTCCAA TTGAAAAGCG CGGAAAAATT CAAGCGTTTT TTACGGCAAT GTGGGGGATT
TCCGCGGTTC TTGCGCCGCT GCTAGGATCC TTTTTTGTCG AATACGCATC ATGGCGCTGG
ATTTTTTATG TGAATATCCC GATTTGCTTG CTTTCCTTGC TCACACTATT GCCATATAAA
GAAGTGTACG AACCAAAACG GGCGGTGATT GACTATATTG GAGCGGTACT ATTTGCGACC
GCGATCAGCC TTTTTCTATT GACGACGGTA GCCGAAAGCA ACCATTGGAT GTATGTCGGC
ATTGGCGTTG TGTTATTAGT TATTTTTTAC TTATATGAAA AAAAGCAAAC GTCTCCGCTT
GTTCCATTGA CGCTCGTGCA GCATAAAACG TTAAAATGGA TGAACATGAA CGGATTTGTC
AGCTGTGTCG CTTTATTTGG CACGTCTAGC TACATTCCGC TATTTTTGCA AAATATCGCC
CATCAATCGG TGTTTGCAAG CGGTGTCGCC CTTTTAGGCA TGTCGATTGG TTGGATGATT
GTGGCGGTGC CGGCGGGAAA ATGGATTTTG CGCTATGGGT ATCGCATGTT ATTGATTATC
GGAAACGTTC TTCTTGTGCT TTCTGGACTG CTGCTAGCAC TTTTAAATGA AAGCCACGGA
TTTTTGTATG TCTTTTTTGC CATGTTCATT CAAGGGCTGT CGTTTGGATT GACATCTACT
GTCGGTGTCA TCGGCTCGCA GCAGCTTGCC GATGCGCATG AAAAGGGAAT CGCGACTTCG
TTTTTCATGT TTTGCCGCAA CATCGGCACA GCCATTGGTG TAACGATTAT GGGCGCCTTT
TTAACGAAAG CGGCTGATTT TATGACTGGC ATCCATCATC TGTTTTTATT TGGATTTATC
GGCAGCATTG TGGCTTTATT CACATCGTTT TTCATTCGTG ATGAGTCGGA ACAGAAAAAA
AATAATTTGC TTCGCTCGGG GGAAATGGTC TAA
 
Protein sequence
MNNRISVMVS IVLAMLVASM DTTIMNTTMP IIAKELGGFS LYAWSFASYM ITTTVLSPIA 
GRLSDIFGRK KVFSFGIILF LIGSLLCGMS QNMVQLVVFR ALQGIGAGFM MPFPAIIAGD
LFPIEKRGKI QAFFTAMWGI SAVLAPLLGS FFVEYASWRW IFYVNIPICL LSLLTLLPYK
EVYEPKRAVI DYIGAVLFAT AISLFLLTTV AESNHWMYVG IGVVLLVIFY LYEKKQTSPL
VPLTLVQHKT LKWMNMNGFV SCVALFGTSS YIPLFLQNIA HQSVFASGVA LLGMSIGWMI
VAVPAGKWIL RYGYRMLLII GNVLLVLSGL LLALLNESHG FLYVFFAMFI QGLSFGLTST
VGVIGSQQLA DAHEKGIATS FFMFCRNIGT AIGVTIMGAF LTKAADFMTG IHHLFLFGFI
GSIVALFTSF FIRDESEQKK NNLLRSGEMV