Gene Htur_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_4043 
Symbol 
ID8744671 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp297153 
End bp298910 
Gene Length1758 bp 
Protein Length585 aa 
Translation table11 
GC content59% 
IMG OID646514609 
Producturocanate hydratase 
Protein accessionYP_003405556 
Protein GI284167278 
COG category[E] Amino acid transport and metabolism 
COG ID[COG2987] Urocanate hydratase 
TIGRFAM ID[TIGR01228] urocanate hydratase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.487767 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATACG TACGCATGAA ACAGCAACAG ACGGACGATG ACTCGACAGA TAAGATCGGC 
AGTCCCTCGT CGGAGTGGCT AGAGTATCAA GGTTCACCCA CCGGGACGGA TATCGAGTGT
GAGGGGTGGA GACAGGAAGC CGCCCTCCGA ATGCTCAACA ACAATCTCGA TCCGGAAGTC
GGAGAGAAAC CGGAAGATCT CGTAGTGTAC GGCGGTACTG GTCGGGCAGC CCGGAGCTGG
GATGCATACG ATACGATTCT CTCGGAGCTA CGGACGCTTG CCGATGACGA AACGCTACTA
GTCCAGTCGG GGAAACCGGT CGGCCGATTC AAGACTCACG AACGTTCGCC ACAGGTGCTT
ATTGCTAATT CGAACCTGGT CGGCGCTTGG GACGACTGGG AGCACTTCCA CGAACTTGAA
TCAAAGGGGC TTATCATGTA CGGCCAGATG ACCGCCGGAT CGTGGGCGTA TATCGGAACG
CAGGGTATTA TTCAAGGGAC CTTCGAGACA TTGGCCGAAG CGGCTCGCCA ACACTTCCCC
GAGCGGGAGG GGCTGGAAGG AACGGTCACA GTCACCGCTG GCCTCGGTGG TATGGGTGGA
GCACAACCGC TAGCTGTGAC GATGAATCAT GGCGTGTGTA TCGCGGCTGA GGTCGACGAA
CACCGGATCG ACAGACGGAT CGAGACGGAC TACTGTATGG AGAAGACAGA CGACCTGGAC
GAGGCTATTG AGATGGCCGA AGATGCGGCA GCGAACGGTG AACCGCTCTC TATCGCCCTC
CACATGAATG CTGCCGATAT GTTCGACGGG CTGCTTGAAC GCGACTTCGT CCCGGACATC
GTCACCGACC AGACAAGCGC GCACGACGAA CTCGAAGGCT ACTACCCCGC CGGTTACACT
GTGGAGGAGG CCGACGCGTT ACGCGAGGCG GATCCCGACC GGTACGTCGA AGAGAGCCTC
GACACGATGC AACGTCACGT CGAAGGCATC CTCGAAATGC AAGAGCGCGG TGCGGTGGCC
TTCGAGTACG GGAACAACAT TCGTGGACAA GTCGAGGAGC ACCGAGAGAT GGCGAGCGCA
TTCGATTTCC CGGGGTTCGT CCCGGCGTAC ATCCGACCCC TGTTCTGCCA GGGGAAGGGA
CCCTTCCGTT GGGTCGCTCT CTCTGGAGAC GAAGAGGACA TCCATCGGAC TGACGACGCT
ATCAAGGAAC TCTTTCCCGA GAAAGACCAA CTGCACCGCT GGATCGATCT CGCACAGGAA
CAGGTTTCGT TCCAAGGTCT CCCCAGCCGT GTCTGCTGGC TCGGGTACCA GAGCGACGAC
GGCCTAACCG AGCGTGCGCG ATTTGCGCTT CGAATTAACG AACTCGTCGA CGAAGGCGAG
ATTGCGGCGC CCGTCGTCGT CACACGCGAT CACCTCGATG CTGGCAGTGT GGCTAGCCCG
AACCGAGAGA CTGAAGCTAT GCGAGACGGC TCGGATGCCG TCGCCGACTG GCCTATCCTG
AACGCTCTGC TCAACTGCGC TGCCGGCGCT GATATCGTCA GCGTTCACGA TGGGGGTGGC
GTCGGTATTG GCAACTCCCT TCATACCAAC AACCACGTCG TCCTCGACGG CTCCGACCTC
GCTGCTGAAA AGGCCCGCCG CGTGTTTACG ACTGACCCCG GCATGGGTGT CATCCGCCAC
GCCGACGCTG GGTACGACGA AGCGCTCAAC GAGGCAACGA CTTCGGATGT CCACGTCCCG
ATGGCTGAGA ACAAATGA
 
Protein sequence
MIYVRMKQQQ TDDDSTDKIG SPSSEWLEYQ GSPTGTDIEC EGWRQEAALR MLNNNLDPEV 
GEKPEDLVVY GGTGRAARSW DAYDTILSEL RTLADDETLL VQSGKPVGRF KTHERSPQVL
IANSNLVGAW DDWEHFHELE SKGLIMYGQM TAGSWAYIGT QGIIQGTFET LAEAARQHFP
EREGLEGTVT VTAGLGGMGG AQPLAVTMNH GVCIAAEVDE HRIDRRIETD YCMEKTDDLD
EAIEMAEDAA ANGEPLSIAL HMNAADMFDG LLERDFVPDI VTDQTSAHDE LEGYYPAGYT
VEEADALREA DPDRYVEESL DTMQRHVEGI LEMQERGAVA FEYGNNIRGQ VEEHREMASA
FDFPGFVPAY IRPLFCQGKG PFRWVALSGD EEDIHRTDDA IKELFPEKDQ LHRWIDLAQE
QVSFQGLPSR VCWLGYQSDD GLTERARFAL RINELVDEGE IAAPVVVTRD HLDAGSVASP
NRETEAMRDG SDAVADWPIL NALLNCAAGA DIVSVHDGGG VGIGNSLHTN NHVVLDGSDL
AAEKARRVFT TDPGMGVIRH ADAGYDEALN EATTSDVHVP MAENK