Gene Tmz1t_3939 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTmz1t_3939 
SymbolureC 
ID7873585 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThauera sp. MZ1T 
KingdomBacteria 
Replicon accessionNC_011662 
Strand
Start bp4335062 
End bp4336813 
Gene Length1752 bp 
Protein Length583 aa 
Translation table11 
GC content70% 
IMG OID643700876 
Producturease subunit alpha 
Protein accessionYP_002890899 
Protein GI237654585 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.743234 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCGACGA TCTCGCGCCG CGCGTACGCG GAGATGTTCG GCCCCACGGT GGGCGACCGG 
CTCCGCCTGG CCGACACCGA GCTCGTGATC GAGGTGGAGA AGGACTTCAC CATCTACGGC
GAAGAGGTGA AGTTCGGCGG CGGCAAGGTG ATCCGCGACG GCATGGGCCA GGGCCAGGGC
GTGGCCGCCG ACGTGGCGGA CACCATCGTC ACCAACGCGC TGGTCATCGA CGCGGTGGCG
GGCATCGTGA AGGCCGACGT GGGTCTCAAG GACGGCCGCA TCTGGAGGAT CGGCAAGGGC
GGCAACCCCG ACATCCAGCC CGGGGTGACG ATCCCGATCG GCGCCGGCAC CGAGGTGATC
GCCGGCGAGG GCATGATCCT CACCGCGGGC GGCATCGACA GCCACATCCA CTGGATCTGC
CCGCAGCAGA TCGAGGAGGC GCTGAGCTCG GGCGTCACCA CCATGCTGGG CGGCGGCACC
GGGCCGGCGA CCGGCACCTT CGCCACCACC TGCACGCCGG GCCCCTGGCA CATCCACCGC
ATGCTGGAGG CTGCCGAGGC CTTCCCGATG AACCTCGGCT TCTTCGGCAA GGGCAACGCC
AGCCTGCCGG CGCCGCTGAA GGAGCAGGTG CAGGCCGGCG TGATCGGGCT GAAGCTGCAC
GAGGACTGGG GCACGACGCC GGCGGCGATC GACAACTGCC TGACGGTGGC CGAGCAGATG
GACATCCAGG TCGCCATCCA CACCGACACG CTCAACGAGT CGGGCTTCGT CGAGACCACG
CTGGCGGCGT TCAAGGATCG CACCATCCAC ACCTTCCACA CCGAGGGCGC GGGCGGCGGC
CACGCGCCGG ACATCATCCG CGCGATCAGC CGCCCCAACG TGCTGCCGTC CTCGACCAAC
CCGACGCGGC CCTACACGGT CAACACCATC GACGAGCACC TCGACATGCT GATGGTGTGC
CACCACCTCG ATCCGAGCAT CGCCGAGGAC GTGGCCTTCG CCGAATCGCG CATCCGCCGC
GAGACCATCG CCGCCGAGGA CATCCTGCAC GACACCGGCG CGTTCTCGAT GATGTCGTCC
GACTCGCAGG CCATGGGCCG GGTGGGCGAG GTGGTGATCC GCACCTGGCA GACCGCGCAC
AAGATGAAGC TGCAGCGCGG CTGGCTGGCG CCGCGGCGCA GCGCGGCGGC GGCGGTGCCG
GACGCGGTGG CCGTGGTCGA GGGCAGCACC GCCAACGACA ACTTCCGCGT CAAGCGCTAC
ATCGCCAAGT ACACCATCAA TCCCGCGCTG ACCCACGGCA TCGCCCACGA GGTGGGCTCG
ATCGAGCCGG GCAAGCTCGC CGACCTGGTG CTGTGGCGCC CGGCCTTCTT CGGCGTCAAG
CCGAGCCTGG TGATCAAGGG CGGCATGATC GCCGCGGCCG CGATGGGCGA CCCCAACGCC
TCGATCCCGA CCCCGCAGCC GGTGCATTGG CGGCCGATGT TCGGCAGCTT CGGACGCGCG
CTGAAATGCG CGGTGACCTT CGTGTCGCAG GCCGCGCTGC ACAACGCCGC GGTGGCCGCG
CTCGGGCTGC AGAAGCCGCT TGTGGCGGTG AAGGGCTGCC GCACGCTGAC GAAGGCCGAC
ATGGTGCTCA ACGACGCCAC GCCGCAGATC GAGGTCGATC CCGAGACCTA TGTCGTGCGC
GCCGACGGCG AGCACCTCGC CTGCGAACCC GCCACCGAGC TGCCGCTGGC GCAACGCTAT
TTCCTGTTCT GA
 
Protein sequence
MATISRRAYA EMFGPTVGDR LRLADTELVI EVEKDFTIYG EEVKFGGGKV IRDGMGQGQG 
VAADVADTIV TNALVIDAVA GIVKADVGLK DGRIWRIGKG GNPDIQPGVT IPIGAGTEVI
AGEGMILTAG GIDSHIHWIC PQQIEEALSS GVTTMLGGGT GPATGTFATT CTPGPWHIHR
MLEAAEAFPM NLGFFGKGNA SLPAPLKEQV QAGVIGLKLH EDWGTTPAAI DNCLTVAEQM
DIQVAIHTDT LNESGFVETT LAAFKDRTIH TFHTEGAGGG HAPDIIRAIS RPNVLPSSTN
PTRPYTVNTI DEHLDMLMVC HHLDPSIAED VAFAESRIRR ETIAAEDILH DTGAFSMMSS
DSQAMGRVGE VVIRTWQTAH KMKLQRGWLA PRRSAAAAVP DAVAVVEGST ANDNFRVKRY
IAKYTINPAL THGIAHEVGS IEPGKLADLV LWRPAFFGVK PSLVIKGGMI AAAAMGDPNA
SIPTPQPVHW RPMFGSFGRA LKCAVTFVSQ AALHNAAVAA LGLQKPLVAV KGCRTLTKAD
MVLNDATPQI EVDPETYVVR ADGEHLACEP ATELPLAQRY FLF