Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_0421 |
Symbol | |
ID | 7978578 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | - |
Start bp | 475759 |
End bp | 476904 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 644797407 |
Product | major facilitator superfamily MFS_1 |
Protein accession | YP_002948607 |
Protein GI | 239825983 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00000714938 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTTT GGATTTTAAT CAGCATTGTC GCTATTTCTG GTTTATCACA AGGGATGCTG CTGCCGCTTC TTTCTATATT GCTTGAAAAA CATGGATTTT CCTCTTCTGC AAATGGCATA CATGCCACAG CGTTGTATAT CGGTGTCCTA TTGATCTCTC CATTTTTAGA AAAACCGTTG CGTAAATATG GATACCGGCC TATGATTATC CTTGGTGGTT TTATTGTAAT ATTATCGCTC GCCTTATTTT CAGCTTTTCA TTCATTTTTG ATTTGGTTTT TCTTGCGCCT TTGCATCGGA ATTGGTGACC ATATGCTCCA TTTCGCAACA CAAACATGGA TTACCGATTT TTCTCCCGCA CAACGGCGAG GACGTAACTT GTCGCTATAC GGGCTATTTT TCGGCATCGG TTTTTCAGCT GGTCCGCTGT TAGCGTCACT CATTCAATTC CATGAATCGT TGCCGTTTTT CTTATCATCG CTACTTAGCC TTATAGGCTG GTGTAGCGTC TTTTTCTTGC CAAACGAACG GCCGCAGGAA AGCGAACAAT CTGGTTCGGC ACACACATTT CAACGCTTCG TTCATGCATG GAAATACGCG TGGGTTGCTT TATTGCTTCC GTTTACTTAC GGCTTTTTGG AAGCATCGAT TCATGCCATT TTCCCTGTTT ACGCTTTGCG GGAACATATT GGTATAGAAC ATGTAGCATT TATTTTACCA GCCTTTTCAC TTGGAGGTAT CATTTTTCAA TTGCCTCTTG GAGTATTAAG TGATCGTTTT CAGCGAAAAC GAGTCATTTC CGTTGCTTTA TTGATCGGAA GTGCCAGCTT TTTTAGTGCT TATTTATTCC ATCACTCCCT TGTCGGGCTT GCCGTTTGTT TCTTTATTGC CGGTATGTTT GTCGGTTCTT TGTTTTCACT CGGGATTACG TATATGGCAG ATTTGCTGCC AAAACAGCTC TTCCCAGCCG GAAACTTACT ATGTGGTATG CTATACAGCA TCGGCAGCAT GATCGGTCCG TTTATGACCG GCTTAATGAT TCAATTCGGG GCGAATCACA ACTTTTTCTT TACGATAAGC GCACTTCTCT TCCTTGTCTT TGTGCCGCTG TTATGGAATA AAACAAAGAA GGTCACTCAT GGATGA
|
Protein sequence | MRFWILISIV AISGLSQGML LPLLSILLEK HGFSSSANGI HATALYIGVL LISPFLEKPL RKYGYRPMII LGGFIVILSL ALFSAFHSFL IWFFLRLCIG IGDHMLHFAT QTWITDFSPA QRRGRNLSLY GLFFGIGFSA GPLLASLIQF HESLPFFLSS LLSLIGWCSV FFLPNERPQE SEQSGSAHTF QRFVHAWKYA WVALLLPFTY GFLEASIHAI FPVYALREHI GIEHVAFILP AFSLGGIIFQ LPLGVLSDRF QRKRVISVAL LIGSASFFSA YLFHHSLVGL AVCFFIAGMF VGSLFSLGIT YMADLLPKQL FPAGNLLCGM LYSIGSMIGP FMTGLMIQFG ANHNFFFTIS ALLFLVFVPL LWNKTKKVTH G
|
| |