Gene GWCH70_0421 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0421 
Symbol 
ID7978578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp475759 
End bp476904 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content42% 
IMG OID644797407 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002948607 
Protein GI239825983 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000714938 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTTT GGATTTTAAT CAGCATTGTC GCTATTTCTG GTTTATCACA AGGGATGCTG 
CTGCCGCTTC TTTCTATATT GCTTGAAAAA CATGGATTTT CCTCTTCTGC AAATGGCATA
CATGCCACAG CGTTGTATAT CGGTGTCCTA TTGATCTCTC CATTTTTAGA AAAACCGTTG
CGTAAATATG GATACCGGCC TATGATTATC CTTGGTGGTT TTATTGTAAT ATTATCGCTC
GCCTTATTTT CAGCTTTTCA TTCATTTTTG ATTTGGTTTT TCTTGCGCCT TTGCATCGGA
ATTGGTGACC ATATGCTCCA TTTCGCAACA CAAACATGGA TTACCGATTT TTCTCCCGCA
CAACGGCGAG GACGTAACTT GTCGCTATAC GGGCTATTTT TCGGCATCGG TTTTTCAGCT
GGTCCGCTGT TAGCGTCACT CATTCAATTC CATGAATCGT TGCCGTTTTT CTTATCATCG
CTACTTAGCC TTATAGGCTG GTGTAGCGTC TTTTTCTTGC CAAACGAACG GCCGCAGGAA
AGCGAACAAT CTGGTTCGGC ACACACATTT CAACGCTTCG TTCATGCATG GAAATACGCG
TGGGTTGCTT TATTGCTTCC GTTTACTTAC GGCTTTTTGG AAGCATCGAT TCATGCCATT
TTCCCTGTTT ACGCTTTGCG GGAACATATT GGTATAGAAC ATGTAGCATT TATTTTACCA
GCCTTTTCAC TTGGAGGTAT CATTTTTCAA TTGCCTCTTG GAGTATTAAG TGATCGTTTT
CAGCGAAAAC GAGTCATTTC CGTTGCTTTA TTGATCGGAA GTGCCAGCTT TTTTAGTGCT
TATTTATTCC ATCACTCCCT TGTCGGGCTT GCCGTTTGTT TCTTTATTGC CGGTATGTTT
GTCGGTTCTT TGTTTTCACT CGGGATTACG TATATGGCAG ATTTGCTGCC AAAACAGCTC
TTCCCAGCCG GAAACTTACT ATGTGGTATG CTATACAGCA TCGGCAGCAT GATCGGTCCG
TTTATGACCG GCTTAATGAT TCAATTCGGG GCGAATCACA ACTTTTTCTT TACGATAAGC
GCACTTCTCT TCCTTGTCTT TGTGCCGCTG TTATGGAATA AAACAAAGAA GGTCACTCAT
GGATGA
 
Protein sequence
MRFWILISIV AISGLSQGML LPLLSILLEK HGFSSSANGI HATALYIGVL LISPFLEKPL 
RKYGYRPMII LGGFIVILSL ALFSAFHSFL IWFFLRLCIG IGDHMLHFAT QTWITDFSPA
QRRGRNLSLY GLFFGIGFSA GPLLASLIQF HESLPFFLSS LLSLIGWCSV FFLPNERPQE
SEQSGSAHTF QRFVHAWKYA WVALLLPFTY GFLEASIHAI FPVYALREHI GIEHVAFILP
AFSLGGIIFQ LPLGVLSDRF QRKRVISVAL LIGSASFFSA YLFHHSLVGL AVCFFIAGMF
VGSLFSLGIT YMADLLPKQL FPAGNLLCGM LYSIGSMIGP FMTGLMIQFG ANHNFFFTIS
ALLFLVFVPL LWNKTKKVTH G