Gene GWCH70_1998 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_1998 
Symbol 
ID7978954 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2054657 
End bp2055928 
Gene Length1272 bp 
Protein Length423 aa 
Translation table11 
GC content46% 
IMG OID644798825 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002949995 
Protein GI239827371 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00000565006 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGAATAC GCGACTGGGA TTGGAATTTA AAGGTGCGCT TATTTGGCGA GGCGCTCGTC 
CATATCGCAT TTTGGATGTT TTTTCCGTTT ATGGCGATAT ATTTTACTGA GTCATTTGGA
AAGGATAAAG CAGGACTATT ATTGATTGTT TCGCAAATTT TTTCCGTGAT CGCTAGTTTG
ATGGGCGGAT ATTGCGCGGA TGTATTCGGA CGCAAGCGGA TGATGGTGTT GTCTGCCTAT
GGTCAAGGGG CAGCGTTTTT CTTTTTTGCG CTTGCTAGTT CTCCGTGGTT TACTTCGCCG
TTCGTGGGGT ATCTTTGCTT TACGATCGCC GGAATTTGCG GGGCGTTTTA CTGGCCCGCA
AGCCAGGCGA TGGTTGCCGA TGTGGTGCCG GAAGAACATC GCAGCAGCGT CTTTGCCGTG
TTTTATATGT CCATCAATGT TGCTGTTGTG GTCGGACCGA TTATTGGCGG GGTGTTTTAT
GAACATTATC TTTTTGAGTT ATTGCTTGCT ACGGCATTTT TGTTTTTGTT TCTCGCTGCG
GTATTGACAA AATGGCTTCG TGAGACTGTG CCCGCGCGTG AAAACGGAGA GCTGCTGCAC
GGGAAATGGT ATGAGTTTTT GTGGCAGCAA GTCCGTCAGT ACAGCGTGAT TGTGCGCGAC
CGAATATTTC TATTGTTTAT CATTGCAGGT ATTCTTGTCG CACAGACGGT TATGCAGCTG
GATTTACTGA TTCCTGTGTA TACAAAAGAT GCGGTGGAGA GGCAAACGTT ATTTTCCTTC
GGTGATTGGT CGTTTACGCT TACTGGAGAG AAAGCATTTG GCCTTCTTAT TTCCGAAAAC
GGCTTGCTTG TTGTGCTGTT TACCGTGTGG GTAACGAAAT GGATGGAACG TTATCATGAA
CGGACGGCGT TTGTTGGGTC GTCCATTGTT TATGGAATTG CGATTTTTTT GTTTGGACAA
ACGACATCGA TTTGGGGACT TATTTTGGTG ATGGGATTGT TTACGTTTGG GGAGCTGATG
ACAGTAGGAA TCCAGCAAAC GTTTATTTCC AAGCTTGCTC CGGAGGAGAT GCGCGGGCAG
TATTTTGCGG CTGCGAGCTT GCGCTGGACG ATCGGACGGG CGATTGCCCC GCTTTCGATC
ACGGCAACGA TATGGCTCGG GTATGAATGG ACGTTTTTCT TGTTAAGTAT GCTTTCTTTC
ATAAGTGCGG CCTTATATAT GATTATGTTT CAATTGCTTG AAAAACGGCA AACACACAAA
GCATTAACAT AA
 
Protein sequence
MRIRDWDWNL KVRLFGEALV HIAFWMFFPF MAIYFTESFG KDKAGLLLIV SQIFSVIASL 
MGGYCADVFG RKRMMVLSAY GQGAAFFFFA LASSPWFTSP FVGYLCFTIA GICGAFYWPA
SQAMVADVVP EEHRSSVFAV FYMSINVAVV VGPIIGGVFY EHYLFELLLA TAFLFLFLAA
VLTKWLRETV PARENGELLH GKWYEFLWQQ VRQYSVIVRD RIFLLFIIAG ILVAQTVMQL
DLLIPVYTKD AVERQTLFSF GDWSFTLTGE KAFGLLISEN GLLVVLFTVW VTKWMERYHE
RTAFVGSSIV YGIAIFLFGQ TTSIWGLILV MGLFTFGELM TVGIQQTFIS KLAPEEMRGQ
YFAAASLRWT IGRAIAPLSI TATIWLGYEW TFFLLSMLSF ISAALYMIMF QLLEKRQTHK
ALT