Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hore_20490 |
Symbol | |
ID | 7314373 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halothermothrix orenii H 168 |
Kingdom | Bacteria |
Replicon accession | NC_011899 |
Strand | - |
Start bp | 2211551 |
End bp | 2213803 |
Gene Length | 2253 bp |
Protein Length | 750 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 643612493 |
Product | Beta-galactosidase |
Protein accession | YP_002509789 |
Protein GI | 220932881 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00000040433 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACGGA AAATATTAAA CTTTAATACG GACTGGTTTT TTCTTGATAA GGATATAGAA GGAGCAAAAG GAATAGATTT TAGTCAATCA GGGATGGAGA AAGTAAATCT ACCTCATCCC AACAGGATTT TACCACACCA TTATTTTGAA GAATCTGATT ATCAGTTTGT TTCCTGGTAC AGGCGCCCTT TTTACCTGGA AGAAGAGTAT AAAGGGAAAA GGGTGATAGT AGAGTTTGAT GGAGTAATGA CGGTTGCTGA AATATATGTT AATGGGCAGT TTGTGGGTGA GCATAAAGGG GGCTATACTT CTTTTAGTTT TGATATAACA GATTATCTGC TATCTGGAGA AAATAATCTG CTGGCAGTCA GGGTGGATTC CAGCCAGAGA AAGGATATTC CCCCGGAAGG CAACCTGGTC GATTACCTTT TGTTTGGAGG TATATACCGG GATGTTAAAA TGGTGATAGT AGACCCTGTT TATATTAACT GGTCTTTTAT TGAACTTAAA GATGTAAACC TGGAAGCAGG TGTTATAAAA CCCAGGTTTG AGCTCGTAAA CACAACAGGT AACCGGCAAA AAATAGTCTT AAATAGTCAG GTTATTAATA AAGAAGGTAA GGTTGTGGCA ATGGTAGAAT CCAGACACCT GCTTGAGCCC GGTGTAACCT CTCTGGAGCA GCCTGAGGTC AAGATAAAAG AACCTGAGTT ATGGCATCCT GACCATCCCT ATCTCTATCA TGTTTATACT GAAGTTAAAG TTGAAGGAAA GCTGGTTGAT GATTATAAAA CCAGAATCGG ACTCAGGAAA GTGGAATTTA AAGAGGATGG AAAGTTTTAT ATCAACGATA AGCCCCTTAA ACTCAGGGGA CTTAACAGGC ATCAAATGTT TCCGTATCTA GGTAATGCTA TGCCTGACCG GGGCCAGAGG AAGGATGCTG AGATTTTGAA GTATGAACTG GGGTTAAATT TTGTCCGTTC TTCCCATTAC CCGGCTGATT CTTCGTTTTT AGATAAGTGT GATGAAATAG GTTTATTAGT CCTGGAAGAG ATCCCCGGAT GGCAGCATAT CGGGAACAGG GACTGGCAGG AGTTATCTAA AAGAAATGTT GAAGAGATGA TAGTCAGAGA CCGGAACCAT CCCTGTATTT TCCTCTGGGG TGTTAGAATT AATGAATCTC CGGATAACCA TGATTTTTAC CTTGAGACAA ATGAAATTGC CCACAGACTG GACAGTACCA GGCCGACCTG TGGGATAAGG AATTTTCAGG ATAGTGAGTT TCTGGAAGAT GTATTTACTT ATAATGATTT TGAGTTAAAT CTCGAAGGAA AAATTAAATT ACCTAACCAC CAACCATATA TGATAACCGA ATATATGGGT CATATGTATC CAACCAAGGC CTATGATAGT GTCGAAAGGT TAATTAAACA CGCTGTCCGG CACGCCCATA TACAGGATAA GCAGTATGGG GTACCTTATC TGGCAGGGGC CTCAGGGTGG TGTGCCTTTG ATTATAATAC CCATGCTGAT TTTGGATCAG GTGACAGGGT ATGCTATCAC GGAGTCTGTG ATATGTTCAG GTTACCCAAA TTTGCTGCTT ATTTTTATAA AAGCCAGATA GACCCGGATG TGGAAAAGGT TGTATTTATT GCTCGATACC TGACCCCATC TTTTAATGAG GATTATGGCG ATGAGGTTAT TGTTTTTAGT AACTGTGAAG AGGTTGAACT ATATGTTGGT GATAAATTAA TAACATCAGC TAGACCAAAC CGGGTTGATT ACCCCAGTTT ACCCCACCCG CCCTTTACCT TTAAAGACTG TACCTGGTGG GAGTGGGGGG CCAGCACCAT TTCCTGCCTG AAAGCGGTCG GTAAAATAGA TGGGAAACAG GTTGCCGAGC ACACTATTTA TCCCTTTGGC AGGCCGGAGA GGTTAGTATT AAAGCCGGAT TACACTAAAC TTACGGCAGA TGGTGCTGAT TGTACCCGGG TTGTGGTTGA GCTTCAGGAT GAGCACGGAC AGGTCCTCCA TCTGGCCCAT CATCCGGTTT TCTTTGAACT GGAAGGGGTG GGGGAACTAA TTGGAGAAAA CCCCTTTAGC CTGGAAGTAG GGAGAGGTGC TGTCTTTATA AGGGCCGGGA GAACTCCAGG GAAAATACAG CTGACAGGTA AGGTCCAGGG ATTACCACCG GTCACAATAG TTGTATCTAC TGAACCTCTG GAAGATAAGA TAGTACCATT ACCCAGGAAA TAA
|
Protein sequence | MKRKILNFNT DWFFLDKDIE GAKGIDFSQS GMEKVNLPHP NRILPHHYFE ESDYQFVSWY RRPFYLEEEY KGKRVIVEFD GVMTVAEIYV NGQFVGEHKG GYTSFSFDIT DYLLSGENNL LAVRVDSSQR KDIPPEGNLV DYLLFGGIYR DVKMVIVDPV YINWSFIELK DVNLEAGVIK PRFELVNTTG NRQKIVLNSQ VINKEGKVVA MVESRHLLEP GVTSLEQPEV KIKEPELWHP DHPYLYHVYT EVKVEGKLVD DYKTRIGLRK VEFKEDGKFY INDKPLKLRG LNRHQMFPYL GNAMPDRGQR KDAEILKYEL GLNFVRSSHY PADSSFLDKC DEIGLLVLEE IPGWQHIGNR DWQELSKRNV EEMIVRDRNH PCIFLWGVRI NESPDNHDFY LETNEIAHRL DSTRPTCGIR NFQDSEFLED VFTYNDFELN LEGKIKLPNH QPYMITEYMG HMYPTKAYDS VERLIKHAVR HAHIQDKQYG VPYLAGASGW CAFDYNTHAD FGSGDRVCYH GVCDMFRLPK FAAYFYKSQI DPDVEKVVFI ARYLTPSFNE DYGDEVIVFS NCEEVELYVG DKLITSARPN RVDYPSLPHP PFTFKDCTWW EWGASTISCL KAVGKIDGKQ VAEHTIYPFG RPERLVLKPD YTKLTADGAD CTRVVVELQD EHGQVLHLAH HPVFFELEGV GELIGENPFS LEVGRGAVFI RAGRTPGKIQ LTGKVQGLPP VTIVVSTEPL EDKIVPLPRK
|
| |