Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_1558 |
Symbol | |
ID | 8742149 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 1616632 |
End bp | 1618338 |
Gene Length | 1707 bp |
Protein Length | 568 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 646512134 |
Product | urease, alpha subunit |
Protein accession | YP_003403117 |
Protein GI | 284164838 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0804] Urea amidohydrolase (urease) alpha subunit |
TIGRFAM ID | [TIGR01792] urease, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCGCA ACCTTTCCCG CGAGGAGTAC ACGGAACTGT TCGGCGCGAC CGAGGGCGAT CGCCTCCGAC TCGGCGATAC GAACCTCTTC GCGGAGATCG AGACCGACTA CGGCGTTCCC GGCGAGGAGG CCGTCTTCGG CGGCGGGAAG ACGATGCGCG ACGGGATGGG GATGCAGTCG GGAACGACCC AGGCCGAAGG GACCCTCGAC TGGGCCTTTA CGAACGTCGT GATAATCGAT CCCGTGCTGG GGATTTGCAA GGGCGATATC GGCGTCCGCG ACGGGAAGAT CGTCGGCGTC GGGAAGGCCG GCAATCCAGA CACCATGGAC GGCGTCGACA TGGTAATCGG TCCGAGCACC GATACCATCC CCGCTGACGG GTTGATCGCG ACGCCCGGCG CGCTGGACAT CCACGTTCAC TTCAACAGCC CGCAACTGGT CGATCACGCG CTCGGATCGG GCGTCACGAC GATGCTCGGC GGCGGCTTCG GCGGCGGTGC AACGACCTGT ACGCCCGGGC CGCGGAACAT CCAGCGGTTC CTGCAGGCCG CCGAAGAGTG GCCCGTCAAC GTCGGCTTCT ACGCGAAGGG CAACAGCAGC CGGCCCGAGG CGCTCGTCGA GCAGATCGAG GCCGGCGCCT GCGGCATGAA GCTCCACGAG GACTGGGGGT CGACGCCCGC CGCGATCGAC ACCTGCCTCG AGGTCGCCGA CGAGGAGGAC GTTCAGGTCT GCATCCACAC GGACACGCTG AACGAGTCGG GCTTCGTCGA GCACACCTTC GACGCCATCG ACGGGCGCGC GATCCACACC TTCCACATCG AGGGCGCCGG CGGCGGCCAC GCGCCGGACG TCCTCGAGTT GATCGGCCAC GAGCACATGC TGCCGTCGTC GACGAACCCG TCGATGCCCT ACACCGAGAA CACGTTTGAC GAGCACCTCG ACATGGTGAT GGTCTGTCAC CACCTCGATC CGGACATCCC CGAGGACGTC GCCTTCGCCG AGTCGCGCAT CCGCGCGGAG ACGATCGGCG CCGAGGACGT GCTCCACGAC ACGGGGGCCA TCTCGATGAT GACCACCGAC TCCCAGGCGA TGGGCCGGAT GGCCGAACTA ATCAGTCGGA CGTGGCAGAC CGCCCACAAG ATGAAGGCCC AGCGCGGCCC GCTCTCCGCC GACGAGGGGA CCGACGCCGA CAACGCCCGC ATCGAACGCT ACGTCGCCAA GTACACGATC AACCCCGCCA TTACGGCGGG GATCGACGAC TACGTCGGCT CGCTCGAGCC CGGCAAACTC GCCGACATCG CCCTGTGGGA TCCGGCCTTC TTCGGCGTCA AGCCGAAGGC CGTGATCAAG GGCGGCTTCC CGGTCTGGTC CCAGATGGGC GAGGCCAACG GCTCGCTGAT GACCTGTGAA CCGGTGATCG GCCGTGAGCG CGCCGGCGCG CAGGGCCGGG CGAAACACGG CCTCTCGGTG ACGTTCGTCA GCGAGGCCGC CTACGAGAAC GAGGTCGGCG ACGCCTACGA CCTGAAAACG CCCGTTCGAC CCGTCACAGG CACGCGCGAG GTCTGTAAGT CCGACATGGT CCACAACGAC CACTGTCCGG ACGATATCGA GATCGACGCC CAGACGTTCG AGGTCGAAGT GGACGGCGAA CACGTCACCT GCGATCCGGC CGACGAGATT CCGCTCGCAC AGCGCTACCT ACTCTAA
|
Protein sequence | MSRNLSREEY TELFGATEGD RLRLGDTNLF AEIETDYGVP GEEAVFGGGK TMRDGMGMQS GTTQAEGTLD WAFTNVVIID PVLGICKGDI GVRDGKIVGV GKAGNPDTMD GVDMVIGPST DTIPADGLIA TPGALDIHVH FNSPQLVDHA LGSGVTTMLG GGFGGGATTC TPGPRNIQRF LQAAEEWPVN VGFYAKGNSS RPEALVEQIE AGACGMKLHE DWGSTPAAID TCLEVADEED VQVCIHTDTL NESGFVEHTF DAIDGRAIHT FHIEGAGGGH APDVLELIGH EHMLPSSTNP SMPYTENTFD EHLDMVMVCH HLDPDIPEDV AFAESRIRAE TIGAEDVLHD TGAISMMTTD SQAMGRMAEL ISRTWQTAHK MKAQRGPLSA DEGTDADNAR IERYVAKYTI NPAITAGIDD YVGSLEPGKL ADIALWDPAF FGVKPKAVIK GGFPVWSQMG EANGSLMTCE PVIGRERAGA QGRAKHGLSV TFVSEAAYEN EVGDAYDLKT PVRPVTGTRE VCKSDMVHND HCPDDIEIDA QTFEVEVDGE HVTCDPADEI PLAQRYLL
|
| |