Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0603 |
Symbol | |
ID | 7978792 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 669655 |
End bp | 670830 |
Gene Length | 1176 bp |
Protein Length | 391 aa |
Translation table | 11 |
GC content | 44% |
IMG OID | 644797592 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002948766 |
Protein GI | 239826142 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.0000000000645746 |
Plasmid hitchhiking | No |
Plasmid clonability | unclonable |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGGGAAC CACAAAGTCA AAACATCATA GATGCGACGA AGGAAAAAAT TTGGACAAAA GATTTTATAC TTATTTGTTT AGCTAATTTT TTTGTTTTTC TTGGATTTCA AATGACATTG CCGACGATTC CGCTGTTTGT CGAGCATCTT GGCGGCAATG ACCAGTTGAT CGGTTTAGTG GTCGGGATTT TCACATTTGC GGCATTAATG GTGCGGCCGT TTGCCGGGCA TGCGTTAGAA ACAAAGGGGC GGCGTTTTGT ATTTTTGCTA GGGCTTGCGA TCTTTGTCCT TTCGGTCGGA TCGTACAGCT TTATTGCGAG CATTTTTTTG TTGTTTATGA TGAGGGTCAT TCAAGGCATT GGGTGGGGGT TTTCGACTAC TGCCTCCGGA ACGATTGCCA CGGACATTAT CCCTGCCAGC CGGAGAGGCG AAGGAATGGG GTATTATGGA CTATCAGGGA ATTTAGCACT TGCTTTTGGG CCATCTATGG GGTTATTGCT CGCCGGAATG TTGTCATTCC GCCATCTTTT TTTGATTTGT GCCGTTTTAG GTTTGGCGTC ACTATTGTTT GCTTCGAATA TTACATATAA AAAAATCGGA CAGCCGCAAG CGCAAGCGCG TAATAAGTGG GATATTTATG AAAAAAGCGC GCTAGAGCCT TCAATTTTGC TATTTTTCCT TACGGTGACA TTTGGAGGAA TTGCCTCATT TCTGCCACTA TATACTGCAC AAAAAGGAAT TTCAGGAATT CAATGGTATT TCTTGCTATA TGCGCTTGCG TTAATGGTAA CAAGAACGTT TGCCGGCCGG TTGTATGATA GAAAAGGCCA TCAGGCGGTG TTTATACCAG GTGCTGCACT GATTTTCATT GCAATGTTAT TGTTGGCTTG GCTGCCAAGC AATGCGATTT TGTTTATCGC AGCGATTCTT TACGGGCTAG GGTTTGGAAC GGTGCAGCCA GCACTGCAAG CATGGTCTGT GGAAAAAGCG GCAAAAAACC GAAAAGGAAT GGCAAATGCG ACCTTTTTCG CCTTTTTCGA TTTAGGGGTC GGAGTAGGAG CGATGGCTTT TGGGCAAATT GGCCATTGGT TCGGTTACTC TAGCATTTAT ATAACTGCCG CACTATCCGT GTTGATTTCT ATTAGCTTTT ACTTATATAT TTTGCATAAA AAATAA
|
Protein sequence | MGEPQSQNII DATKEKIWTK DFILICLANF FVFLGFQMTL PTIPLFVEHL GGNDQLIGLV VGIFTFAALM VRPFAGHALE TKGRRFVFLL GLAIFVLSVG SYSFIASIFL LFMMRVIQGI GWGFSTTASG TIATDIIPAS RRGEGMGYYG LSGNLALAFG PSMGLLLAGM LSFRHLFLIC AVLGLASLLF ASNITYKKIG QPQAQARNKW DIYEKSALEP SILLFFLTVT FGGIASFLPL YTAQKGISGI QWYFLLYALA LMVTRTFAGR LYDRKGHQAV FIPGAALIFI AMLLLAWLPS NAILFIAAIL YGLGFGTVQP ALQAWSVEKA AKNRKGMANA TFFAFFDLGV GVGAMAFGQI GHWFGYSSIY ITAALSVLIS ISFYLYILHK K
|
| |