Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_04170 |
Symbol | |
ID | 7314092 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | + |
Start bp | 437298 |
End bp | 440012 |
Gene Length | 2715 bp |
Protein Length | 904 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 643610840 |
Product | Cellobiose phosphorylase |
Protein accession | YP_002508170 |
Protein GI | 220931262 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3459] Cellobiose phosphorylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 58 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTTGGG AATTCAACAA TAACAAGGGT ACCTTTAGTT TGAAGGGGGC TGATGATAAT AGTTATTTAT ATTTTCCACT GACTAACGAG AGTGGAATCA TGTCCTCAAT AACCCCGACA CTAAACGGGG ACATTAAAAC TGACCAGAAT AGTTTTTTGA TGCAGCCTGT ATCTGTTGAG GACCTACATA ATAACAGATC AAACCGTAAT TTCTGGTTAA AAATTGATGG TTATGGGCCG TGGTCAGTAA CGGGTAATTC TGCCAGACAA ATTTCTATGA AATATACCGG AGATAAAAAA ATAGAAACTG CTGATGTGGA AGCAGGGTTT TTATGGCATA AGGTGATAAG AAAAAATAGA AAACTTAACA TTAAAGCAGA AATAACCAAT TTTGTCCCGG TAAATGAGGA CCAGGTTGAG TTAATGAAAG TTACTATCGT TAATACGGGA CAGGAGACTA TAAAATTTAC CCCGACGGCA GCCATTCCAG TATACGGACG TTCTGCTGAT AATCTGAGGG ATCATAGACA TGTTACCTCT TTATTACATC GTATTACTAC TAATTCTAAC GGGGTTATAG TAACCCCTAC TTTGTCCTTT GATGAAAGAG GGCATCAGCC AAATAATACT TCTTATGGTG TATTTGGGCG ATGTTCAGAC TCTTCAGATC CGGTAGGTTT TTGTCCGGTT CTGGAGGACT TTATTGGAGA GGGAGGAACC CTGGACTGGC CCCGTTCTAT TGTGGAAGGT AATGTTGATT TTGTAAAACC CGGTTTTAAA TATGAGGGTT ATGAGAGTAT AGGGGCCATT AAGTTTGGTG ATAAAGAATT AAAAGCCGGG GAGTCTATTT CCTATATTAT TGTAATGGTG ATAGATCATA ATGGGAAAGA CATGAAAAAA TATATTGATA AATACTGTTC TGAAAAAGAT TTTGACAGGG AACTAAATGA AAATAAAAAA TACTGGCAAT CTACAATTGA GCAGCTCACT TTTTATACAG GGGATGAAAC CTTTGATAAC TGGATGAAGT GGGTTACCCT GCAGCCGGTA TTACGCAGGA TATATGGATG TTCATTTTTG CCATATCATG ATTACGGCCG GGGAGGCAGA GGCTGGAGAG ACCTCTGGCA GGATTGTCTT GCTTTATTAT TAATGGATGA TAAGAATGTC AGGTATCTAT TATACAATAA TTTTGCTGGT GTTAGAATAG ATGGCAGTAA TGCGACAATA ATTGGTAATA AACCAGGTGA ATTTATTGCC GACAGGAATA ATATTACCAG GTTATGGATA GACCATGGTG CCTGGCCCCT GCAGACGACA AGTCTATATA TTAATATGTC CGGTGATATA AAATTCTTGA TGGAAAAACA GTCATATTTT AAAGATAGAA TTGTATCCTT TGCCAGAGAG ATTGATAACA GGTGGCATCA GGGAATGGGT AATAAATTAA AAACAGAAGA GGGCCAGGTC TACCACGGTT CCATTCTTGA ACATATTCTC CTGCAAAATC TAACAGTGTT TTTTAATGTT GGGGAGCATA ATAACATAAG ACTGGAGGAT GCTGACTGGA ATGACGGCCT GGACATGGCT TCAGAAAAGG GAGAAAGTGT CGCTTTTACA GCCTTTTATG CCGGTAATTT GTTTGAGATT GTAAACCTGT TAGAAACATT AAGAGATAAA GAAAATGTGA CTGAAGTGAA GTTGCTGGAG GAAATCAAGG TTTTACTGGA TACACTTTAT AATCCGGTAG ATTATAATGA TGTCACTGCC AAAAATGAAG TGTTAAATAA ATATTTCGCC AGTTGTAAAT ATAATATATC CGGTAACCAG GTTACAATAA AAATTGATCA ACTCATCAAA GATATCAAAC GAAAAGCAAA CTGGTTACAG GAACATTTAA GGGAAAATGA ATATATTGGG GATAATAAGG GGCATAAGTG GTTTAATGGA TATTATGATA ATAAAGGTCA GCGGGTTGAA GGCAGGTTTG AAAAGGAAAC CAGGATGACC TTGACCGGAC AGGTATTTAC CACTATGTTT GGTATAGCAA CCGGAGAACA GGTAAAAAAT ATAATTTCTG CTGCTGATTA TTACCTCTAT GATGATGGTG TAGGAGGTTA CCGGTTAAAT ACAGATTTTA AGGACAAAGA TATTCAGCTT GGACGTTGTT TTGGTTTTGC CTATGGCCAT AAAGAAAATG GAGCCATGTT TAGCCATATG TCTGTAATGT ATGCAAACGC TCTTTACCGG AGAAATTTTG TTAAAGAAGG ATTCAAGGTG TGGCATAGTA TTTATAATCA TTGCCTTGAT TTTGACAGAA GCCGTATTTA TCCCGGTATA CCGGAATACA TCAGCCAACG GGGCAGGGGT ATGTATAGTT ATTTAACGGG TTCTGCCAGC TGGTTATTAC TTACCTTTGC TACGGAAGTC TTTGGTGTTA AAGGGTATTA CGGGGATTTG AGGCTGGAAC CAAAGCTATT AAAGGAGCAA TTTGATGAAG AAGGAAAGGC TACCATTGGT ATTAAATTTG CCGGGAAGAG ATTAACAATA ACTTATGTTA ATCCTCAACT GTTAGACTAT GATTCCTATA AGATAGATAA AGTAATGGTT AATGATGAAA TAATAGATTA TTATGAAGAT AAACAGGTAA CTATTAAAAG GGAAGATATT GAAGAAAGGG ATGAAGTAAT AGATATTACG GTAAGACTGG CATAA
|
Protein sequence | MGWEFNNNKG TFSLKGADDN SYLYFPLTNE SGIMSSITPT LNGDIKTDQN SFLMQPVSVE DLHNNRSNRN FWLKIDGYGP WSVTGNSARQ ISMKYTGDKK IETADVEAGF LWHKVIRKNR KLNIKAEITN FVPVNEDQVE LMKVTIVNTG QETIKFTPTA AIPVYGRSAD NLRDHRHVTS LLHRITTNSN GVIVTPTLSF DERGHQPNNT SYGVFGRCSD SSDPVGFCPV LEDFIGEGGT LDWPRSIVEG NVDFVKPGFK YEGYESIGAI KFGDKELKAG ESISYIIVMV IDHNGKDMKK YIDKYCSEKD FDRELNENKK YWQSTIEQLT FYTGDETFDN WMKWVTLQPV LRRIYGCSFL PYHDYGRGGR GWRDLWQDCL ALLLMDDKNV RYLLYNNFAG VRIDGSNATI IGNKPGEFIA DRNNITRLWI DHGAWPLQTT SLYINMSGDI KFLMEKQSYF KDRIVSFARE IDNRWHQGMG NKLKTEEGQV YHGSILEHIL LQNLTVFFNV GEHNNIRLED ADWNDGLDMA SEKGESVAFT AFYAGNLFEI VNLLETLRDK ENVTEVKLLE EIKVLLDTLY NPVDYNDVTA KNEVLNKYFA SCKYNISGNQ VTIKIDQLIK DIKRKANWLQ EHLRENEYIG DNKGHKWFNG YYDNKGQRVE GRFEKETRMT LTGQVFTTMF GIATGEQVKN IISAADYYLY DDGVGGYRLN TDFKDKDIQL GRCFGFAYGH KENGAMFSHM SVMYANALYR RNFVKEGFKV WHSIYNHCLD FDRSRIYPGI PEYISQRGRG MYSYLTGSAS WLLLTFATEV FGVKGYYGDL RLEPKLLKEQ FDEEGKATIG IKFAGKRLTI TYVNPQLLDY DSYKIDKVMV NDEIIDYYED KQVTIKREDI EERDEVIDIT VRLA
|
| |