Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nmul_A1940 |
Symbol | |
ID | 3784236 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosospira multiformis ATCC 25196 |
Kingdom | Bacteria |
Replicon accession | NC_007614 |
Strand | - |
Start bp | 2231010 |
End bp | 2232791 |
Gene Length | 1782 bp |
Protein Length | 593 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637812026 |
Product | glycoside hydrolase family protein |
Protein accession | YP_412627 |
Protein GI | 82703061 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG3250] Beta-galactosidase/beta-glucuronidase |
TIGRFAM ID | |
| ![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_cp.jpg)
![](https://exploration.weizmann.ac.il/pandatox/images_new/ic_hh.jpg)
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.590979 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGAAGTTCG CGCACCCTCG ACCGCAGATG CAGCGCTCCA ATTGGACCTC GCTCAACGGA GTCTGGCGCT TTCGCTACGA CGAGGCGCGT ACTTTTACTC ACCCCTCTCA GATAGAATCC TGGCCGATGG AAATCATCGT GCCGTTTCCA CCCGAATCCG AGGCAAGCGG AATCGGTGAC CGCAGTTTTC ATTCACTCTG CTGGTATGAG CGCGATTTTG ATTTCGAGCC CTCCTCGGAG CGGGTAATCC TGCATTTCGG TGCCGTGGAC TACTCCGCAA AGGTATGGGT CAACGGTCGT TTTGCCGCCA GCCACGAAGG GGGGCATACT CCATTCTGGG GGGACATCAC CCCTCTGCTC GATCCATCCG GAAAACAGAA GGTGACGGTA CAGGTGGAGG ACGATCCGCA CGAGCTGGCC AAACCGCGTG GAAAGCAGGA TTGGCAGCTC GAGCCTCATG CCATCTGGTA TCCCCGTACC ACTGGCATCT GGCAGATGGT CTGGATCGAG CGGGTATCCG AAAATTACAT TGAAAAGATT CGGTGGACGC CCCAGGTCGA GATATATGCG ATAGGCTTCG AAGCGCGCGT AATAGGAGAA GAGGCCGACG AACTGGCAGT AGACGTATGC TTGCGCCACG GGGAGCAGTT GCTCGCGCAT GACCGCTATC GGGTAGTCGA ACGGGAAGTA GACCGCGTCA TCATATTGTC TGACCCTGGA ATTGATGATT TTCGTAATGA GTTGCTATGG AGCCCGGAAC GTCCCACCTT GATCGATGCT GTTGTACGAC TGATGCGGGG CGAGGAGGTG GTCGACGAGT TCATCTCCTA CACTGCAATG CGTTCCGTCA ACATCCTGCG CGACCGCTTC ATGCTGAACG GTCGTCCCTA CACGCTGAGG CTCGTGCTTG ACCAGGGCTA CTGGCCGGAG ACGCTGCTGG CTGCGCCAAG CGACGACGCC CTGCGAGGCG ATGTGGAACT TGCAAAGGCA ATGGGCTTCA ACGGGGTGCG CAAGCATCAA AAGATAGAGG ACCCGCGCTA TCTTTATTGG GCGGACAGGC TCGGGCTGAT GGTATGGGAA GAAATGCCTT CCGCATATCG CTTCACCCGC AGCGCCATCA AGAGGATGGT GCGGGAATGG ACGGAGGCCA TCGAGCGGGA TTACAGCCAT CCCTGCGTCA TTGTATGGGT ACCTTTCAAT GAATCCTGGG GAGTACCGGA ACTTACCGCG GTCCGTAAAC AGCGGCACGC TGTCGAGGCA CTATATCACT TGACCAAAAC CCTGGATGCG ACGCGCCCGG TAATCGGCAA CGATGGATGG GAAAGCAGTG CTACGGATAT CATCGGTATT CACGATTATG ACGCAAACAT TGAACATCTG CGCCAGCGTT ATGGCGCCGA AATAAAACCT GAACAGTTGT TCGACCGTCG GCGCCCGGGG GGGCGGATTC TCACCCTTGA TGGCTACCCG CATCGAGGCC AGCCGATCAT GTTAAGCGAA TTCGGGGGGA TTGCTTTCGC CAAGTGCCCG CAACCCGGCG TCGAGCATAC TTGGGGCTAT ACTGTTGCCC ACGCCGAAGA GGAATTTGCG CGTATGTATG CCGAGTTGAT GCATACAGTG ATTCATACGG CTCTCTTCAG CGGCTTTTGC TATACCCAGT TTGCCGATAC CTTTCAGGAA GCGAACGGAC TGCTGTGCGC GGATCGTACT CCCAAGATTC CCATTGAGCA AATCGCCCGC GTCACGCGCA TCTCGCCCAC CTATATACCC GGGGGTGTTT AG
|
Protein sequence | MKFAHPRPQM QRSNWTSLNG VWRFRYDEAR TFTHPSQIES WPMEIIVPFP PESEASGIGD RSFHSLCWYE RDFDFEPSSE RVILHFGAVD YSAKVWVNGR FAASHEGGHT PFWGDITPLL DPSGKQKVTV QVEDDPHELA KPRGKQDWQL EPHAIWYPRT TGIWQMVWIE RVSENYIEKI RWTPQVEIYA IGFEARVIGE EADELAVDVC LRHGEQLLAH DRYRVVEREV DRVIILSDPG IDDFRNELLW SPERPTLIDA VVRLMRGEEV VDEFISYTAM RSVNILRDRF MLNGRPYTLR LVLDQGYWPE TLLAAPSDDA LRGDVELAKA MGFNGVRKHQ KIEDPRYLYW ADRLGLMVWE EMPSAYRFTR SAIKRMVREW TEAIERDYSH PCVIVWVPFN ESWGVPELTA VRKQRHAVEA LYHLTKTLDA TRPVIGNDGW ESSATDIIGI HDYDANIEHL RQRYGAEIKP EQLFDRRRPG GRILTLDGYP HRGQPIMLSE FGGIAFAKCP QPGVEHTWGY TVAHAEEEFA RMYAELMHTV IHTALFSGFC YTQFADTFQE ANGLLCADRT PKIPIEQIAR VTRISPTYIP GGV
|
| |