Gene GWCH70_3374 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_3374 
Symbol 
ID7977130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp3401032 
End bp3402228 
Gene Length1197 bp 
Protein Length398 aa 
Translation table11 
GC content47% 
IMG OID644800141 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002951280 
Protein GI239828656 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value5.27195e-05 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTCAAC AAACGATTTC GCAGCGAAAG CTGTTAGGAG TTGCCGGGCT TGGCTGGATG 
TTTGACGCGA TGGATGTTGG CATGCTGTCG TTTATTATCG CGGCGTTGCA AAAAGATTGG
AACTTAAGTG TCGAACAAAT GGGATGGATC GGCAGCATCA ACTCCATCGG TATGGCAGTT
GGCGCGCTGT TATTCGGATT GTTGGCCGAC CGCATCGGCA GAAAAAACGT GTTTATTATT
ACACTATTAT TATTCTCGAT CGGAAGCGGA CTATCTGCCT TAACGACGAC ATTGACGGCG
TTTTTAGTTT TACGGTTTCT CATAGGCATG GGGTTGGGCG GAGAACTGCC GGTTGCTTCG
ACGCTTGTAT CGGAAAGCGT GCCGGCGCAA GAACGTGGAA AAGCTGTTGT GCTGCTGGAA
AGCTTTTGGG CAGTCGGCTG GTTGTTATCC GCGTTAATTT CATATTTTGT CATTCCAACA
TACGGTTGGC AAACAGCGCT ATTGCTTGCG GCGATTCCGG CGTTATACGC CCTATATTTA
CGATGGGGAT TGCCTGATTC GCCAAGGTTT ACAAGTGCGC GCAAAGAAGA AACCGTATGG
GACAACATCG TCAAGGTTTG GTCGTCTTCT TACCGGAAAG AAACGTTCAT GCTTTGGGTG
CTTTGGTTTT GCGTCGTATT TTCTTACTAC GGTATGTTTT TATGGCTGCC AAGCGTAATG
GTTATGAAAG GGTTTAGCTT AATTAAAAGC TTCGAGTATG TCTTGATTAT GACGCTGGCG
CAATTGCCTG GCTATTTTAG CGCCGCATGG CTTATTGAAC GAGCGGGTCG GAAATTTGTG
CTCATCACGT ATTTGATTGG TACGGCCGTT AGTGCCTATT TCTTTGGCAA CGCGGATTCG
CTTGCACTGC TCATGACCTT TGGCATTTTA CTATCGTTTT TTAACCTTGG CGCATGGGGA
GCGTTATACG CCTATACTCC AGAGCTTTAC CCGACTTCGA TTCGCGGCAC GGGAGCTGGG
ATGGCGGCGT CATTTGGACG CATCGGCGGC ATTTTAGGAC CGCTTTTCGT CGGCTATCTT
GTCAATAGAC ATATTACGAT TACAACGATT TTTCTGATTT TCTGTATTTC TATTTTCATT
GGCGTTATTG CGGTATGGGT GTTAGGAAAA GAAACGAAGC AACAGGAATT GGCATAG
 
Protein sequence
MLQQTISQRK LLGVAGLGWM FDAMDVGMLS FIIAALQKDW NLSVEQMGWI GSINSIGMAV 
GALLFGLLAD RIGRKNVFII TLLLFSIGSG LSALTTTLTA FLVLRFLIGM GLGGELPVAS
TLVSESVPAQ ERGKAVVLLE SFWAVGWLLS ALISYFVIPT YGWQTALLLA AIPALYALYL
RWGLPDSPRF TSARKEETVW DNIVKVWSSS YRKETFMLWV LWFCVVFSYY GMFLWLPSVM
VMKGFSLIKS FEYVLIMTLA QLPGYFSAAW LIERAGRKFV LITYLIGTAV SAYFFGNADS
LALLMTFGIL LSFFNLGAWG ALYAYTPELY PTSIRGTGAG MAASFGRIGG ILGPLFVGYL
VNRHITITTI FLIFCISIFI GVIAVWVLGK ETKQQELA