Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3952 |
Symbol | |
ID | 8744580 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 213371 |
End bp | 214639 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 646514533 |
Product | amidohydrolase |
Protein accession | YP_003405480 |
Protein GI | 284167202 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGATG AGCGAGAGGT CAGAGAGGAG GTTCACGTCG TCGTCGAGGA CGGAGAAATC GTCGACATCG CCGACGGCTA CGAGTCCGCC GAGGAGACGA TCGACGCACG CGACGAGGTC GTTATTCCGG GACTCGTGAA CTGTCACACG CACATGTACG CGCTTCCGAT CCGCGGAGCA CCGCTGACCG CGTCCCCGGA GAGCTTCTAC GAAGCGCTGG TCGATATCTG GTGGGAAGTC GACGAAGCGT TCACGACGCG TGACGCTCGG CTGTCCTCGC TCGGCTCGTG TGCCGAAATG GTTCGGGGCG GGGTCACGAC GTTCTGTGAT AACTATTCCG GGCCGAACAC GCTGCCCGGG GCGCTCGACG CCGTCGCCGA CGGCGTCTCG CAGACGCCGA TCCGCGGTAT GATAACGTTC GAAACGACCG CACGGAACTC CGAAGAAGAG GCCATCGAGG GGATCAGCGA GAACCAGCGG TACATTCGGG AGTCGGAAGA CGAGTACGAC GGCGTCACCG GCCACTACTG TCTCCACACC CTGTTTACGA ACACGGAGAG CGTCGTCGAC GAGTGCGTCC GGCGCGCAGT CAGCGACGAC CGGCCTATCC AGATCCATCT CGAGGAAGGT CTGGTCGACG TCCACGAATC GATCAAGGAG TACGGAGTAC GACCCGTCCC CGCGCTCGAC TCGATGGGAT TCTTCGAGGC GGACGTCATC GCCGCCCACT GCGTCCACTC CACGGAACGC GAACTCGAGA TTCTCGCCGA AAACGATGTG AGGGTCGCGC ACAACCCGTA CTCGAATATC AACAACGCGG TCGGAATCGC CGACGTCGAA ACGATGGAAG CGCACGACAT GACGATCGGC ATCGGGGACG ATGGCTGGGA CCCCGATATG TTCGAAACGA TGCGATCGGC CGTCGGCATT CACAAGTTGA AGGAGAACGA TCCGAGCGGC TTCGACGGAG CGAAAGCGCT CGAGTGGGCG ACCATCGGAA GCGCGGGCGT CCTCGGAATG GACGATCGGA TCGGCAGCAT CGAAGTCGGC AAGCGCGGCG ACTTCGTCTC GCTCGACCTC GGGCCGAACC CCGTGCTTCC CGAGAGCGCA CCGTACTACG TCGTCAGTGC CGCGAGCGGG GCCGACGTGA CGCGGACGGT CATCGACGGT GAGATCGCGT ATAGCCCGGA CCGGGGTGTA CGCGGCGTAG ACGAAGCGGA CATGGAGACC GTCGGCGAAG CGAGCGCCGA ACTCTGGGAG CGCCTTTGA
|
Protein sequence | MNDEREVREE VHVVVEDGEI VDIADGYESA EETIDARDEV VIPGLVNCHT HMYALPIRGA PLTASPESFY EALVDIWWEV DEAFTTRDAR LSSLGSCAEM VRGGVTTFCD NYSGPNTLPG ALDAVADGVS QTPIRGMITF ETTARNSEEE AIEGISENQR YIRESEDEYD GVTGHYCLHT LFTNTESVVD ECVRRAVSDD RPIQIHLEEG LVDVHESIKE YGVRPVPALD SMGFFEADVI AAHCVHSTER ELEILAENDV RVAHNPYSNI NNAVGIADVE TMEAHDMTIG IGDDGWDPDM FETMRSAVGI HKLKENDPSG FDGAKALEWA TIGSAGVLGM DDRIGSIEVG KRGDFVSLDL GPNPVLPESA PYYVVSAASG ADVTRTVIDG EIAYSPDRGV RGVDEADMET VGEASAELWE RL
|
| |