Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4107 |
Symbol | |
ID | 8744735 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 371693 |
End bp | 373489 |
Gene Length | 1797 bp |
Protein Length | 598 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514664 |
Product | urocanate hydratase |
Protein accession | YP_003405611 |
Protein GI | 284167333 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGCGAAT CAGAATCTCA AGCGTCGATC GGTAGCGAAC CGAGCGACCA GTGGCGCGAG TATCGCGGGG CACCGACGGG AACCGATCGG GAGTGTCGCG GCTGGCGTCA GGAGGCCGCC CTTCGCCTGT TGAACAACAA CCTCGATCCC GAGGTGGCCG AAAAGCCCGA GGAGCTGGTC GTCTATGGTG GCACAGGTCG GGCGGCGCGA TCGTGGGACG CATACGATGC GATCTGCGAC GAACTCCGCG ACCTCGGCAA TGACGAGACG CTGCTCGTTC AGTCCGGCAA ACCCGTGGGC CGGTTCAAGA CCCATAAGCG AGCCCCTCGG GTCCTCATCG CCAACTCGAA TCTCGTGGGA AAGTGGGACG ACTGGGAGCA CTTCCACGAA CTCGAAGCGA AAGGGCTGAT CATGTACGGC CAGATGACTG CCGGCTCGTG GGCGTATATC GGCACCCAAG GCATCATTCA AGGGACCTAC GAGACGCTGG CGGAACTGGC AAACAAAAAT TACCCAGATT CTGACGGGTT ACGCGGCAAG ATTGTCGTCA CCGGTGGGCT CGGTGGGATG GGTGGTGCCC AGCCGCTCGC GGTAACAATG AACCACGGTG TTTGCATTGC TGCCGAAGTC GACGCGACAC GGATCGACCG TCGCATCGAG ACGGGCTACT GTCAGGCGAA AACCGACGAC CTCGACGAGG CGATCGAGCG CGCCGAAGAC GCTGCCGATG CCGGTGAGCC CTACAGCGTC GGCGTCCACA TGAATGCGGC CGACATGCTC GAGGCGATGC TCGATCGAGG GTTCGTTCCG CATGTCGTGA CCGACCAGAC CAGTGCTCAT GACGAACTCG AGGGGTACTA CCCATCTGGA TATACGGTCG CCGAGGCCGA TCGGCTCCGT GAAGAGGAGC CCGATCGGTA CGTTGCGGAG AGCCTCGCGA CGATGGACCG CCATGTGGAC GCGATCCTCG AGCTGCAGGA CCAGGGCGCG ATCGCGTTTG AGTACGGGAA CAACATCCGC GGACAGGTCG CAGAGCACTG CGATCGCGAG AACGCCTTCG ACTATCCGGG GTTCGTGCCG GCGTACATTC GTCCGCTCTT TTGTCGTGGG AAGGGGCCGT TTCGCTGGGT GGCGCTGTCG GGTGATCCCA CCGATATCCA TCGTACCGAC GAAGCCATCA AAGAGCTCTT TCCCGAGAAG GGCCACCTGC ACCGCTGGAT CGATCTCGCA CAGAAACAAG TCGAGTTCCA AGGGTTGCCG GCGCGTGTCT GTTGGCTCGG GTATCAGACG AGTGAGGCTC CACAGGAGAC CGGGTCTCGA ACCGATGGCA GCGACGACGC CAGCGCCGAC GATTCGGGCG GATTGACGGA ACGGGCTCGC TTCGCCCTCC GGATCAACGA GCTCGTCGCC GCCGGCGAGA TTTCGGCCCC GATCGTCGTC ACGCGAGATC ACCTCGATGC GGGGTCAGTC GCAAGTCCGA ATCGTGAGAC CGAGGCCATG GCTGACGGTT CCGACGCGAT CGCCGACTGG CCGATCCTCA ACGCCCTGCT CAACTGCGCC GCCGGCGCAG ACATCGTGAG CGTCCATGAC GGTGGGGGCG TCGGTATCGG AAACGCCATC CATGCGAACA ACCACGTCGT CCTTGACGGG TCCGACCTCG CCGCCGAGAC GGCCCGCCGC GTATTTACCA CGGATCCCGG AACGGGAGTG ATCCGCCACG TCGATGCTGG CTACGAGGAG GCGCTCGCAG AGGCACAGGA GTCGGCCGTT CACATCCCGA TGGAGAGTCG CAAGTAA
|
Protein sequence | MGESESQASI GSEPSDQWRE YRGAPTGTDR ECRGWRQEAA LRLLNNNLDP EVAEKPEELV VYGGTGRAAR SWDAYDAICD ELRDLGNDET LLVQSGKPVG RFKTHKRAPR VLIANSNLVG KWDDWEHFHE LEAKGLIMYG QMTAGSWAYI GTQGIIQGTY ETLAELANKN YPDSDGLRGK IVVTGGLGGM GGAQPLAVTM NHGVCIAAEV DATRIDRRIE TGYCQAKTDD LDEAIERAED AADAGEPYSV GVHMNAADML EAMLDRGFVP HVVTDQTSAH DELEGYYPSG YTVAEADRLR EEEPDRYVAE SLATMDRHVD AILELQDQGA IAFEYGNNIR GQVAEHCDRE NAFDYPGFVP AYIRPLFCRG KGPFRWVALS GDPTDIHRTD EAIKELFPEK GHLHRWIDLA QKQVEFQGLP ARVCWLGYQT SEAPQETGSR TDGSDDASAD DSGGLTERAR FALRINELVA AGEISAPIVV TRDHLDAGSV ASPNRETEAM ADGSDAIADW PILNALLNCA AGADIVSVHD GGGVGIGNAI HANNHVVLDG SDLAAETARR VFTTDPGTGV IRHVDAGYEE ALAEAQESAV HIPMESRK
|
| |