Gene GWCH70_0603 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_0603 
Symbol 
ID7978792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp669655 
End bp670830 
Gene Length1176 bp 
Protein Length391 aa 
Translation table11 
GC content44% 
IMG OID644797592 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002948766 
Protein GI239826142 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000645746 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGGGGAAC CACAAAGTCA AAACATCATA GATGCGACGA AGGAAAAAAT TTGGACAAAA 
GATTTTATAC TTATTTGTTT AGCTAATTTT TTTGTTTTTC TTGGATTTCA AATGACATTG
CCGACGATTC CGCTGTTTGT CGAGCATCTT GGCGGCAATG ACCAGTTGAT CGGTTTAGTG
GTCGGGATTT TCACATTTGC GGCATTAATG GTGCGGCCGT TTGCCGGGCA TGCGTTAGAA
ACAAAGGGGC GGCGTTTTGT ATTTTTGCTA GGGCTTGCGA TCTTTGTCCT TTCGGTCGGA
TCGTACAGCT TTATTGCGAG CATTTTTTTG TTGTTTATGA TGAGGGTCAT TCAAGGCATT
GGGTGGGGGT TTTCGACTAC TGCCTCCGGA ACGATTGCCA CGGACATTAT CCCTGCCAGC
CGGAGAGGCG AAGGAATGGG GTATTATGGA CTATCAGGGA ATTTAGCACT TGCTTTTGGG
CCATCTATGG GGTTATTGCT CGCCGGAATG TTGTCATTCC GCCATCTTTT TTTGATTTGT
GCCGTTTTAG GTTTGGCGTC ACTATTGTTT GCTTCGAATA TTACATATAA AAAAATCGGA
CAGCCGCAAG CGCAAGCGCG TAATAAGTGG GATATTTATG AAAAAAGCGC GCTAGAGCCT
TCAATTTTGC TATTTTTCCT TACGGTGACA TTTGGAGGAA TTGCCTCATT TCTGCCACTA
TATACTGCAC AAAAAGGAAT TTCAGGAATT CAATGGTATT TCTTGCTATA TGCGCTTGCG
TTAATGGTAA CAAGAACGTT TGCCGGCCGG TTGTATGATA GAAAAGGCCA TCAGGCGGTG
TTTATACCAG GTGCTGCACT GATTTTCATT GCAATGTTAT TGTTGGCTTG GCTGCCAAGC
AATGCGATTT TGTTTATCGC AGCGATTCTT TACGGGCTAG GGTTTGGAAC GGTGCAGCCA
GCACTGCAAG CATGGTCTGT GGAAAAAGCG GCAAAAAACC GAAAAGGAAT GGCAAATGCG
ACCTTTTTCG CCTTTTTCGA TTTAGGGGTC GGAGTAGGAG CGATGGCTTT TGGGCAAATT
GGCCATTGGT TCGGTTACTC TAGCATTTAT ATAACTGCCG CACTATCCGT GTTGATTTCT
ATTAGCTTTT ACTTATATAT TTTGCATAAA AAATAA
 
Protein sequence
MGEPQSQNII DATKEKIWTK DFILICLANF FVFLGFQMTL PTIPLFVEHL GGNDQLIGLV 
VGIFTFAALM VRPFAGHALE TKGRRFVFLL GLAIFVLSVG SYSFIASIFL LFMMRVIQGI
GWGFSTTASG TIATDIIPAS RRGEGMGYYG LSGNLALAFG PSMGLLLAGM LSFRHLFLIC
AVLGLASLLF ASNITYKKIG QPQAQARNKW DIYEKSALEP SILLFFLTVT FGGIASFLPL
YTAQKGISGI QWYFLLYALA LMVTRTFAGR LYDRKGHQAV FIPGAALIFI AMLLLAWLPS
NAILFIAAIL YGLGFGTVQP ALQAWSVEKA AKNRKGMANA TFFAFFDLGV GVGAMAFGQI
GHWFGYSSIY ITAALSVLIS ISFYLYILHK K