Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | GWCH70_1491 |
Symbol | |
ID | 7976580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Geobacillus sp. WCH70 |
Kingdom | Bacteria |
Replicon accession | NC_012793 |
Strand | + |
Start bp | 1567160 |
End bp | 1568182 |
Gene Length | 1023 bp |
Protein Length | 340 aa |
Translation table | 11 |
GC content | 35% |
IMG OID | 644798393 |
Product | protein of unknown function UPF0118 |
Protein accession | YP_002949566 |
Protein GI | 239826942 |
COG category | [R] General function prediction only |
COG ID | [COG0628] Predicted permease |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.961051 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGGGATCAG AAATCATGCA ATTTTTCAAG AGATCGACGT GGAAGAGAAT CGTTATTTTT TCTATTTTAA TTTTTATTCT CTATTTAGTT CGAAGCATTT TAAATATTAT TTTGCTTACT TTTATTTTTT CTTTTTTAAT GAATGGGCTT GTTGACTTTA TTTCTAAAAG GATTCGCATC AATCGGAAAA TTATTGTGCT GCTATTGTAT GCCGTGGTCG TTTCCATCCT TGTCTACGGC GTCGTCAAAT ATTTGCCCGT CGTTATTAAC GAGATTTCAC AGCTGATCAA ACAGCTTACG GAATTTTATT TGAAGCCTAA AGATAATATT GTGTTTAATT ACATTATCGA GCAACTAAAG CAATATGAAA TTACTACTTA TCTGAATTTT GGAATGACGT ATTTAATCAA ATATTTCAGC GACATTAGCA GTATCGGTCT GCAAATTTTG CTTGCGTTAA TTTTAAGTTT ATTTTTCCTT CTGGAAAAAG AGCGGATTAT TTCGTTCACG AATCGTTTTA AATATAGCAA AATCAGCTCT TTTTATGACG AATTGGAATA TTTCGGAAAA AAATTTGCCC GTACGTTTGG GAAAGTTATT GAAGCTCAAT TTGTCATTGC AACGATTAAT ATGATAATTA CGGTAGCGTG TTTGTGGATG ATGGGATTCC CACAATTATT CGGGCTTGGG ATATTAATCT TTTTTCTCGG GTTGATTCCT GTTGCGGGCG TGATCATTTC GTTGTTTCCG CTGTGCTTTA TTGCTTATAG CATAGGCGGA GTAATGAAAG TCGTTTACGT TCTCATTATG ATCGCCGTTG TTCACGCGCT CGAAGCGTAT GTTTTAAATC CAAAATTGAT GTCCTCGAAG ACAAATTTGC CCGTTTTTTA TACGTTTATC GTTTTGATTT TTTCAGAGCA CTTTTTTGGG GTATGGGGGT TGATTTTAGG GATTCCCGTA TTTGTATTTA TTCTTGATGT GCTGGGAGTA AAGACAGTGC AGGGAAAACG GGATGATGAA TAA
|
Protein sequence | MGSEIMQFFK RSTWKRIVIF SILIFILYLV RSILNIILLT FIFSFLMNGL VDFISKRIRI NRKIIVLLLY AVVVSILVYG VVKYLPVVIN EISQLIKQLT EFYLKPKDNI VFNYIIEQLK QYEITTYLNF GMTYLIKYFS DISSIGLQIL LALILSLFFL LEKERIISFT NRFKYSKISS FYDELEYFGK KFARTFGKVI EAQFVIATIN MIITVACLWM MGFPQLFGLG ILIFFLGLIP VAGVIISLFP LCFIAYSIGG VMKVVYVLIM IAVVHALEAY VLNPKLMSSK TNLPVFYTFI VLIFSEHFFG VWGLILGIPV FVFILDVLGV KTVQGKRDDE
|
| |