Gene GWCH70_1853 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1853 
Symbol 
ID7976474 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp1918340 
End bp1919503 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content43% 
IMG OID644798687 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002949857 
Protein GI239827233 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000000358317 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGG CTAATTTTCT TTCTCTTTTA TTTGCCGTTA TGTTTTTGGT GATGTCTGGT 
TTTGGCATTA TCATCCCGGT TCTTCCCTTT TTAGCAGAAG AAGTGGGTGC CACTCCTACG
CAATTAGGAC TGCTAATGGC AACGTACTCT TTGATGCAAC TATTATTCGC TCCATTTTGG
GGCCAGATGT CCGACCGTTA CGGGCGAAAA CCAATTTTAT TCATCGGCAT CGCCGGATTG
TCGCTCTCGT TCTTTTTGTT TGCCGTGTCA AAAACACTTA CGATGTTATT TATCGCCCGT
ATTATCGGCG GGATGCTATC AGCAGCTACG ATTCCTACCG CAATGGCTTA CGTTGCTGAC
GTCACGACAC CGCAAGAACG TGGAAAAGCA ATGGGGGCTA TCGGCGCCGC TACCGGTCTT
GGATTTATTT TTGGTCCTGC GATTGGCGGC ATTTTTTCGA AAATAAATTT GCATATTCCT
TTCTTTATCT CAGGAACACT ATCCGCTGTT ACCGCATGTC TCGTGCTTCT TTTTTTGAAA
GAATCATTGA CAAAAGAAAA ACAACCGGCA ACTTTAAAAA CAAAAGAACC GATATGGTAC
ATACTAAAAG GGCCGCTTTT ATTTCTGTAC CTTCTTCAAT GGCTTATTAC TTTTTCATTG
GCTGGGCTAG AAGCAACATT CGCTTACTAC GCAGCAAAAC GCGCAGAATT ATATTCCTCG
CAATTAGGCT ATATTTTTAT GATTATGGGG CTTGCAAGTG CGATTGTCCA AGGAGGGCTC
ATTGGAAAAC TAATTCAAAA ATTTGGTGAA GGCCGCGTCA TTCAAGGCGG AATCATTGTA
TCAGCCGTTG GATTTGCCCT CATTTTATTC GTTCACAACT TCCTGACCGC CGCTATATTT
TTATCGATTT TTGGCATCGG AAACGGTGTC ATCCGCCCTT GTGTATCCTC GCTAGTCACT
AAACATATAT CAAGCGGCCA AGGGCGTGCT ACCGGATTGC TTTCTTCGTT TGACTCCCTT
GGGCGCATCA TCGGACCTCC AATTGCCGGA CAGATGTTTA CAACAATGAT CGAACTTCCA
TACTTAGCAG GTATTGTATT ATCCTGTTTC GCGTTGATCC TTTATCATGT TTTTGCAAAA
CAGTCACAGC AAGTTTCGTC TTAA
 
Protein sequence
MKKANFLSLL FAVMFLVMSG FGIIIPVLPF LAEEVGATPT QLGLLMATYS LMQLLFAPFW 
GQMSDRYGRK PILFIGIAGL SLSFFLFAVS KTLTMLFIAR IIGGMLSAAT IPTAMAYVAD
VTTPQERGKA MGAIGAATGL GFIFGPAIGG IFSKINLHIP FFISGTLSAV TACLVLLFLK
ESLTKEKQPA TLKTKEPIWY ILKGPLLFLY LLQWLITFSL AGLEATFAYY AAKRAELYSS
QLGYIFMIMG LASAIVQGGL IGKLIQKFGE GRVIQGGIIV SAVGFALILF VHNFLTAAIF
LSIFGIGNGV IRPCVSSLVT KHISSGQGRA TGLLSSFDSL GRIIGPPIAG QMFTTMIELP
YLAGIVLSCF ALILYHVFAK QSQQVSS