Gene Hoch_1704 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHoch_1704 
Symbol 
ID8544086 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaliangium ochraceum DSM 14365 
KingdomBacteria 
Replicon accessionNC_013440 
Strand
Start bp2317309 
End bp2319018 
Gene Length1710 bp 
Protein Length569 aa 
Translation table11 
GC content69% 
IMG OID646386412 
Producturease, alpha subunit 
Protein accessionYP_003266147 
Protein GI262194938 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0804] Urea amidohydrolase (urease) alpha subunit 
TIGRFAM ID[TIGR01792] urease, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.000469415 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCCACA ACGCCAAGAT CTCGCGCCGC GAATACGCGG AGAAATTCGG ACCGACGGTC 
GGCGACCGCA TCCGGCTCGC CGACACCGAG CTGTTCATCG AGGTCGAAAA AGACCACGCC
ATCTACGGCG AGGAGGTCAA ATTCGGCGGC GGCAAGGTGA TCCGCGACGG CATGGGCCAG
TCGCCGCGCG CGCACGCCGA GGGCGCGGTC GACACCGTCA TCACCAACGC CGTGATCCTC
GACTGGTGGG GCGTGGTCAA AGCCGATATC GGCATCAAAG ACGGCCTCAT CGCCGCCATC
GGCAAGGCCG GTAACCCCGA CATCCAGCCC GGCGTCGACA TCATCATCGG CCCCGGCACC
GAGATCATCG CCGGCGAGGG TCGCATCGTC ACCGCCGGCG GCATCGACGC GCACATCCAC
TTCATCGCCC CGCAGCAGAT CGAAGAGGCG CTGTGCTCGG GCATCACCAC CATGCTCGGC
GGCGGCACCG GACCCGCGGC CGGCACCACG GCGACCACCT GCACCCCGGG CCCGTGGCAC
ATCGAGCGCA TGCTCATGGC CGCGCCCGCG TTCCCGATGA ACCTGGGCTT CTTCGGCAAG
GGCAACACCT CGCAGCCGGC CGCCCTGGTC GAGCAGATCG AGGCCGGCGC CTGCGGCCTC
AAGCTGCACG AGGACTGGGG CTCGACCCCG GCCGCGATCC GCAACTGCCT CAGCGTCGCC
GAGCAGTACG ACGTCCAGGT CGCCCTGCAC GCCGACACCC TCAACGAGGC CGGCTTCGTC
GAGGACACGC GCGCCGCCTT CGAGGACCGC ACCGTGCACG TGTTCCACAC CGAGGGCGCC
GGCGGCGGTC ACGCGCCCGA CGTGATCCGT CTGGTCGGCG AGCCCAACGT GCTGCCCTCG
TCGACGAACC CGACCCGGCC GTTCACGGTC AACACCGTGG CCGAGCACCT CGACATGCTC
ATGGTCTGCC ACCACCTCGA CCCCCGGATC GAGGAGGACC TGGCCTTTGC CGAGAGCCGC
ATCCGCCGCG AGACCATCGC CGCCGAGGAC ATCCTCCAGG ACATGGGCGC GATCTCGATG
ATGACCTCGG ACAGCCAGGC CATGGGACGC GTGGGCGAAG TCATCCTGCG CACCTGGCAG
ACCGCGCACA AGATGAAGCT GCAGCGCGAC GGCGACGCCG CGCCGCCCAA CGACAACGCG
CGCGCCAAGC GCTACGTCGC CAAGTACACC ATCAACCCGG CCATCGCCCA GGGCATCGCG
GCGCATGTTG GCTCGGTCGA GGTCGGCAAG CTGGCCGACC TGGTCGTGTG GCAGCCGGCC
TTCTTCGGCG TCAAGCCGGA GCTGGTGATC AAGGGCGGCG CTATCGCCAT GGCGCCGATG
GGCGACCCCA ACGCGTCCAT CTCCACGCCG CAGCCCGTGC ACTACCGGCC CATGTTCGGC
AGCTACAGCC GCTCGCGCGA GGTTGGCTCG CTGCTGTTCG TGAGCCAGGC CAGCATCGAC
GGCGGCATCG AGCAGCGGCT CGGTCTGGGC CGGCGCTGCG TGGCCGTCAA GGGCACGCGC
GGGCTGCGCA AGGCCGATAT GAAGCTCAAC GACCTCGCCC CCGCGATGGA GGTGGACCCG
CTCAGCTACG AGGTCCGCGC CGACGGCGAG CTGCTCACCT GCGAGCCGCT GGCGGTGCTG
CCCATGGCTC AGCGCTACTT CCTGTTCTAG
 
Protein sequence
MAHNAKISRR EYAEKFGPTV GDRIRLADTE LFIEVEKDHA IYGEEVKFGG GKVIRDGMGQ 
SPRAHAEGAV DTVITNAVIL DWWGVVKADI GIKDGLIAAI GKAGNPDIQP GVDIIIGPGT
EIIAGEGRIV TAGGIDAHIH FIAPQQIEEA LCSGITTMLG GGTGPAAGTT ATTCTPGPWH
IERMLMAAPA FPMNLGFFGK GNTSQPAALV EQIEAGACGL KLHEDWGSTP AAIRNCLSVA
EQYDVQVALH ADTLNEAGFV EDTRAAFEDR TVHVFHTEGA GGGHAPDVIR LVGEPNVLPS
STNPTRPFTV NTVAEHLDML MVCHHLDPRI EEDLAFAESR IRRETIAAED ILQDMGAISM
MTSDSQAMGR VGEVILRTWQ TAHKMKLQRD GDAAPPNDNA RAKRYVAKYT INPAIAQGIA
AHVGSVEVGK LADLVVWQPA FFGVKPELVI KGGAIAMAPM GDPNASISTP QPVHYRPMFG
SYSRSREVGS LLFVSQASID GGIEQRLGLG RRCVAVKGTR GLRKADMKLN DLAPAMEVDP
LSYEVRADGE LLTCEPLAVL PMAQRYFLF