Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Hoch_1704 |
Symbol | |
ID | 8544086 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haliangium ochraceum DSM 14365 |
Kingdom | Bacteria |
Replicon accession | NC_013440 |
Strand | - |
Start bp | 2317309 |
End bp | 2319018 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646386412 |
Product | urease, alpha subunit |
Protein accession | YP_003266147 |
Protein GI | 262194938 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.000469415 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCCCACA ACGCCAAGAT CTCGCGCCGC GAATACGCGG AGAAATTCGG ACCGACGGTC GGCGACCGCA TCCGGCTCGC CGACACCGAG CTGTTCATCG AGGTCGAAAA AGACCACGCC ATCTACGGCG AGGAGGTCAA ATTCGGCGGC GGCAAGGTGA TCCGCGACGG CATGGGCCAG TCGCCGCGCG CGCACGCCGA GGGCGCGGTC GACACCGTCA TCACCAACGC CGTGATCCTC GACTGGTGGG GCGTGGTCAA AGCCGATATC GGCATCAAAG ACGGCCTCAT CGCCGCCATC GGCAAGGCCG GTAACCCCGA CATCCAGCCC GGCGTCGACA TCATCATCGG CCCCGGCACC GAGATCATCG CCGGCGAGGG TCGCATCGTC ACCGCCGGCG GCATCGACGC GCACATCCAC TTCATCGCCC CGCAGCAGAT CGAAGAGGCG CTGTGCTCGG GCATCACCAC CATGCTCGGC GGCGGCACCG GACCCGCGGC CGGCACCACG GCGACCACCT GCACCCCGGG CCCGTGGCAC ATCGAGCGCA TGCTCATGGC CGCGCCCGCG TTCCCGATGA ACCTGGGCTT CTTCGGCAAG GGCAACACCT CGCAGCCGGC CGCCCTGGTC GAGCAGATCG AGGCCGGCGC CTGCGGCCTC AAGCTGCACG AGGACTGGGG CTCGACCCCG GCCGCGATCC GCAACTGCCT CAGCGTCGCC GAGCAGTACG ACGTCCAGGT CGCCCTGCAC GCCGACACCC TCAACGAGGC CGGCTTCGTC GAGGACACGC GCGCCGCCTT CGAGGACCGC ACCGTGCACG TGTTCCACAC CGAGGGCGCC GGCGGCGGTC ACGCGCCCGA CGTGATCCGT CTGGTCGGCG AGCCCAACGT GCTGCCCTCG TCGACGAACC CGACCCGGCC GTTCACGGTC AACACCGTGG CCGAGCACCT CGACATGCTC ATGGTCTGCC ACCACCTCGA CCCCCGGATC GAGGAGGACC TGGCCTTTGC CGAGAGCCGC ATCCGCCGCG AGACCATCGC CGCCGAGGAC ATCCTCCAGG ACATGGGCGC GATCTCGATG ATGACCTCGG ACAGCCAGGC CATGGGACGC GTGGGCGAAG TCATCCTGCG CACCTGGCAG ACCGCGCACA AGATGAAGCT GCAGCGCGAC GGCGACGCCG CGCCGCCCAA CGACAACGCG CGCGCCAAGC GCTACGTCGC CAAGTACACC ATCAACCCGG CCATCGCCCA GGGCATCGCG GCGCATGTTG GCTCGGTCGA GGTCGGCAAG CTGGCCGACC TGGTCGTGTG GCAGCCGGCC TTCTTCGGCG TCAAGCCGGA GCTGGTGATC AAGGGCGGCG CTATCGCCAT GGCGCCGATG GGCGACCCCA ACGCGTCCAT CTCCACGCCG CAGCCCGTGC ACTACCGGCC CATGTTCGGC AGCTACAGCC GCTCGCGCGA GGTTGGCTCG CTGCTGTTCG TGAGCCAGGC CAGCATCGAC GGCGGCATCG AGCAGCGGCT CGGTCTGGGC CGGCGCTGCG TGGCCGTCAA GGGCACGCGC GGGCTGCGCA AGGCCGATAT GAAGCTCAAC GACCTCGCCC CCGCGATGGA GGTGGACCCG CTCAGCTACG AGGTCCGCGC CGACGGCGAG CTGCTCACCT GCGAGCCGCT GGCGGTGCTG CCCATGGCTC AGCGCTACTT CCTGTTCTAG
|
Protein sequence | MAHNAKISRR EYAEKFGPTV GDRIRLADTE LFIEVEKDHA IYGEEVKFGG GKVIRDGMGQ SPRAHAEGAV DTVITNAVIL DWWGVVKADI GIKDGLIAAI GKAGNPDIQP GVDIIIGPGT EIIAGEGRIV TAGGIDAHIH FIAPQQIEEA LCSGITTMLG GGTGPAAGTT ATTCTPGPWH IERMLMAAPA FPMNLGFFGK GNTSQPAALV EQIEAGACGL KLHEDWGSTP AAIRNCLSVA EQYDVQVALH ADTLNEAGFV EDTRAAFEDR TVHVFHTEGA GGGHAPDVIR LVGEPNVLPS STNPTRPFTV NTVAEHLDML MVCHHLDPRI EEDLAFAESR IRRETIAAED ILQDMGAISM MTSDSQAMGR VGEVILRTWQ TAHKMKLQRD GDAAPPNDNA RAKRYVAKYT INPAIAQGIA AHVGSVEVGK LADLVVWQPA FFGVKPELVI KGGAIAMAPM GDPNASISTP QPVHYRPMFG SYSRSREVGS LLFVSQASID GGIEQRLGLG RRCVAVKGTR GLRKADMKLN DLAPAMEVDP LSYEVRADGE LLTCEPLAVL PMAQRYFLF
|
| |