Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_3939 |
Symbol | ureC |
ID | 7873585 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 4335062 |
End bp | 4336813 |
Gene Length | 1752 bp |
Protein Length | 583 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | 643700876 |
Product | urease subunit alpha |
Protein accession | YP_002890899 |
Protein GI | 237654585 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.743234 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCGACGA TCTCGCGCCG CGCGTACGCG GAGATGTTCG GCCCCACGGT GGGCGACCGG CTCCGCCTGG CCGACACCGA GCTCGTGATC GAGGTGGAGA AGGACTTCAC CATCTACGGC GAAGAGGTGA AGTTCGGCGG CGGCAAGGTG ATCCGCGACG GCATGGGCCA GGGCCAGGGC GTGGCCGCCG ACGTGGCGGA CACCATCGTC ACCAACGCGC TGGTCATCGA CGCGGTGGCG GGCATCGTGA AGGCCGACGT GGGTCTCAAG GACGGCCGCA TCTGGAGGAT CGGCAAGGGC GGCAACCCCG ACATCCAGCC CGGGGTGACG ATCCCGATCG GCGCCGGCAC CGAGGTGATC GCCGGCGAGG GCATGATCCT CACCGCGGGC GGCATCGACA GCCACATCCA CTGGATCTGC CCGCAGCAGA TCGAGGAGGC GCTGAGCTCG GGCGTCACCA CCATGCTGGG CGGCGGCACC GGGCCGGCGA CCGGCACCTT CGCCACCACC TGCACGCCGG GCCCCTGGCA CATCCACCGC ATGCTGGAGG CTGCCGAGGC CTTCCCGATG AACCTCGGCT TCTTCGGCAA GGGCAACGCC AGCCTGCCGG CGCCGCTGAA GGAGCAGGTG CAGGCCGGCG TGATCGGGCT GAAGCTGCAC GAGGACTGGG GCACGACGCC GGCGGCGATC GACAACTGCC TGACGGTGGC CGAGCAGATG GACATCCAGG TCGCCATCCA CACCGACACG CTCAACGAGT CGGGCTTCGT CGAGACCACG CTGGCGGCGT TCAAGGATCG CACCATCCAC ACCTTCCACA CCGAGGGCGC GGGCGGCGGC CACGCGCCGG ACATCATCCG CGCGATCAGC CGCCCCAACG TGCTGCCGTC CTCGACCAAC CCGACGCGGC CCTACACGGT CAACACCATC GACGAGCACC TCGACATGCT GATGGTGTGC CACCACCTCG ATCCGAGCAT CGCCGAGGAC GTGGCCTTCG CCGAATCGCG CATCCGCCGC GAGACCATCG CCGCCGAGGA CATCCTGCAC GACACCGGCG CGTTCTCGAT GATGTCGTCC GACTCGCAGG CCATGGGCCG GGTGGGCGAG GTGGTGATCC GCACCTGGCA GACCGCGCAC AAGATGAAGC TGCAGCGCGG CTGGCTGGCG CCGCGGCGCA GCGCGGCGGC GGCGGTGCCG GACGCGGTGG CCGTGGTCGA GGGCAGCACC GCCAACGACA ACTTCCGCGT CAAGCGCTAC ATCGCCAAGT ACACCATCAA TCCCGCGCTG ACCCACGGCA TCGCCCACGA GGTGGGCTCG ATCGAGCCGG GCAAGCTCGC CGACCTGGTG CTGTGGCGCC CGGCCTTCTT CGGCGTCAAG CCGAGCCTGG TGATCAAGGG CGGCATGATC GCCGCGGCCG CGATGGGCGA CCCCAACGCC TCGATCCCGA CCCCGCAGCC GGTGCATTGG CGGCCGATGT TCGGCAGCTT CGGACGCGCG CTGAAATGCG CGGTGACCTT CGTGTCGCAG GCCGCGCTGC ACAACGCCGC GGTGGCCGCG CTCGGGCTGC AGAAGCCGCT TGTGGCGGTG AAGGGCTGCC GCACGCTGAC GAAGGCCGAC ATGGTGCTCA ACGACGCCAC GCCGCAGATC GAGGTCGATC CCGAGACCTA TGTCGTGCGC GCCGACGGCG AGCACCTCGC CTGCGAACCC GCCACCGAGC TGCCGCTGGC GCAACGCTAT TTCCTGTTCT GA
|
Protein sequence | MATISRRAYA EMFGPTVGDR LRLADTELVI EVEKDFTIYG EEVKFGGGKV IRDGMGQGQG VAADVADTIV TNALVIDAVA GIVKADVGLK DGRIWRIGKG GNPDIQPGVT IPIGAGTEVI AGEGMILTAG GIDSHIHWIC PQQIEEALSS GVTTMLGGGT GPATGTFATT CTPGPWHIHR MLEAAEAFPM NLGFFGKGNA SLPAPLKEQV QAGVIGLKLH EDWGTTPAAI DNCLTVAEQM DIQVAIHTDT LNESGFVETT LAAFKDRTIH TFHTEGAGGG HAPDIIRAIS RPNVLPSSTN PTRPYTVNTI DEHLDMLMVC HHLDPSIAED VAFAESRIRR ETIAAEDILH DTGAFSMMSS DSQAMGRVGE VVIRTWQTAH KMKLQRGWLA PRRSAAAAVP DAVAVVEGST ANDNFRVKRY IAKYTINPAL THGIAHEVGS IEPGKLADLV LWRPAFFGVK PSLVIKGGMI AAAAMGDPNA SIPTPQPVHW RPMFGSFGRA LKCAVTFVSQ AALHNAAVAA LGLQKPLVAV KGCRTLTKAD MVLNDATPQI EVDPETYVVR ADGEHLACEP ATELPLAQRY FLF
|
| |