Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_4043 |
Symbol | |
ID | 8744671 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | - |
Start bp | 297153 |
End bp | 298910 |
Gene Length | 1758 bp |
Protein Length | 585 aa |
Translation table | 11 |
GC content | 59% |
IMG OID | 646514609 |
Product | urocanate hydratase |
Protein accession | YP_003405556 |
Protein GI | 284167278 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG2987] Urocanate hydratase |
TIGRFAM ID | [TIGR01228] urocanate hydratase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.487767 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATATACG TACGCATGAA ACAGCAACAG ACGGACGATG ACTCGACAGA TAAGATCGGC AGTCCCTCGT CGGAGTGGCT AGAGTATCAA GGTTCACCCA CCGGGACGGA TATCGAGTGT GAGGGGTGGA GACAGGAAGC CGCCCTCCGA ATGCTCAACA ACAATCTCGA TCCGGAAGTC GGAGAGAAAC CGGAAGATCT CGTAGTGTAC GGCGGTACTG GTCGGGCAGC CCGGAGCTGG GATGCATACG ATACGATTCT CTCGGAGCTA CGGACGCTTG CCGATGACGA AACGCTACTA GTCCAGTCGG GGAAACCGGT CGGCCGATTC AAGACTCACG AACGTTCGCC ACAGGTGCTT ATTGCTAATT CGAACCTGGT CGGCGCTTGG GACGACTGGG AGCACTTCCA CGAACTTGAA TCAAAGGGGC TTATCATGTA CGGCCAGATG ACCGCCGGAT CGTGGGCGTA TATCGGAACG CAGGGTATTA TTCAAGGGAC CTTCGAGACA TTGGCCGAAG CGGCTCGCCA ACACTTCCCC GAGCGGGAGG GGCTGGAAGG AACGGTCACA GTCACCGCTG GCCTCGGTGG TATGGGTGGA GCACAACCGC TAGCTGTGAC GATGAATCAT GGCGTGTGTA TCGCGGCTGA GGTCGACGAA CACCGGATCG ACAGACGGAT CGAGACGGAC TACTGTATGG AGAAGACAGA CGACCTGGAC GAGGCTATTG AGATGGCCGA AGATGCGGCA GCGAACGGTG AACCGCTCTC TATCGCCCTC CACATGAATG CTGCCGATAT GTTCGACGGG CTGCTTGAAC GCGACTTCGT CCCGGACATC GTCACCGACC AGACAAGCGC GCACGACGAA CTCGAAGGCT ACTACCCCGC CGGTTACACT GTGGAGGAGG CCGACGCGTT ACGCGAGGCG GATCCCGACC GGTACGTCGA AGAGAGCCTC GACACGATGC AACGTCACGT CGAAGGCATC CTCGAAATGC AAGAGCGCGG TGCGGTGGCC TTCGAGTACG GGAACAACAT TCGTGGACAA GTCGAGGAGC ACCGAGAGAT GGCGAGCGCA TTCGATTTCC CGGGGTTCGT CCCGGCGTAC ATCCGACCCC TGTTCTGCCA GGGGAAGGGA CCCTTCCGTT GGGTCGCTCT CTCTGGAGAC GAAGAGGACA TCCATCGGAC TGACGACGCT ATCAAGGAAC TCTTTCCCGA GAAAGACCAA CTGCACCGCT GGATCGATCT CGCACAGGAA CAGGTTTCGT TCCAAGGTCT CCCCAGCCGT GTCTGCTGGC TCGGGTACCA GAGCGACGAC GGCCTAACCG AGCGTGCGCG ATTTGCGCTT CGAATTAACG AACTCGTCGA CGAAGGCGAG ATTGCGGCGC CCGTCGTCGT CACACGCGAT CACCTCGATG CTGGCAGTGT GGCTAGCCCG AACCGAGAGA CTGAAGCTAT GCGAGACGGC TCGGATGCCG TCGCCGACTG GCCTATCCTG AACGCTCTGC TCAACTGCGC TGCCGGCGCT GATATCGTCA GCGTTCACGA TGGGGGTGGC GTCGGTATTG GCAACTCCCT TCATACCAAC AACCACGTCG TCCTCGACGG CTCCGACCTC GCTGCTGAAA AGGCCCGCCG CGTGTTTACG ACTGACCCCG GCATGGGTGT CATCCGCCAC GCCGACGCTG GGTACGACGA AGCGCTCAAC GAGGCAACGA CTTCGGATGT CCACGTCCCG ATGGCTGAGA ACAAATGA
|
Protein sequence | MIYVRMKQQQ TDDDSTDKIG SPSSEWLEYQ GSPTGTDIEC EGWRQEAALR MLNNNLDPEV GEKPEDLVVY GGTGRAARSW DAYDTILSEL RTLADDETLL VQSGKPVGRF KTHERSPQVL IANSNLVGAW DDWEHFHELE SKGLIMYGQM TAGSWAYIGT QGIIQGTFET LAEAARQHFP EREGLEGTVT VTAGLGGMGG AQPLAVTMNH GVCIAAEVDE HRIDRRIETD YCMEKTDDLD EAIEMAEDAA ANGEPLSIAL HMNAADMFDG LLERDFVPDI VTDQTSAHDE LEGYYPAGYT VEEADALREA DPDRYVEESL DTMQRHVEGI LEMQERGAVA FEYGNNIRGQ VEEHREMASA FDFPGFVPAY IRPLFCQGKG PFRWVALSGD EEDIHRTDDA IKELFPEKDQ LHRWIDLAQE QVSFQGLPSR VCWLGYQSDD GLTERARFAL RINELVDEGE IAAPVVVTRD HLDAGSVASP NRETEAMRDG SDAVADWPIL NALLNCAAGA DIVSVHDGGG VGIGNSLHTN NHVVLDGSDL AAEKARRVFT TDPGMGVIRH ADAGYDEALN EATTSDVHVP MAENK
|
| |