Gene Htur_3952 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3952 
Symbol 
ID8744580 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp213371 
End bp214639 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content63% 
IMG OID646514533 
Productamidohydrolase 
Protein accessionYP_003405480 
Protein GI284167202 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGATG AGCGAGAGGT CAGAGAGGAG GTTCACGTCG TCGTCGAGGA CGGAGAAATC 
GTCGACATCG CCGACGGCTA CGAGTCCGCC GAGGAGACGA TCGACGCACG CGACGAGGTC
GTTATTCCGG GACTCGTGAA CTGTCACACG CACATGTACG CGCTTCCGAT CCGCGGAGCA
CCGCTGACCG CGTCCCCGGA GAGCTTCTAC GAAGCGCTGG TCGATATCTG GTGGGAAGTC
GACGAAGCGT TCACGACGCG TGACGCTCGG CTGTCCTCGC TCGGCTCGTG TGCCGAAATG
GTTCGGGGCG GGGTCACGAC GTTCTGTGAT AACTATTCCG GGCCGAACAC GCTGCCCGGG
GCGCTCGACG CCGTCGCCGA CGGCGTCTCG CAGACGCCGA TCCGCGGTAT GATAACGTTC
GAAACGACCG CACGGAACTC CGAAGAAGAG GCCATCGAGG GGATCAGCGA GAACCAGCGG
TACATTCGGG AGTCGGAAGA CGAGTACGAC GGCGTCACCG GCCACTACTG TCTCCACACC
CTGTTTACGA ACACGGAGAG CGTCGTCGAC GAGTGCGTCC GGCGCGCAGT CAGCGACGAC
CGGCCTATCC AGATCCATCT CGAGGAAGGT CTGGTCGACG TCCACGAATC GATCAAGGAG
TACGGAGTAC GACCCGTCCC CGCGCTCGAC TCGATGGGAT TCTTCGAGGC GGACGTCATC
GCCGCCCACT GCGTCCACTC CACGGAACGC GAACTCGAGA TTCTCGCCGA AAACGATGTG
AGGGTCGCGC ACAACCCGTA CTCGAATATC AACAACGCGG TCGGAATCGC CGACGTCGAA
ACGATGGAAG CGCACGACAT GACGATCGGC ATCGGGGACG ATGGCTGGGA CCCCGATATG
TTCGAAACGA TGCGATCGGC CGTCGGCATT CACAAGTTGA AGGAGAACGA TCCGAGCGGC
TTCGACGGAG CGAAAGCGCT CGAGTGGGCG ACCATCGGAA GCGCGGGCGT CCTCGGAATG
GACGATCGGA TCGGCAGCAT CGAAGTCGGC AAGCGCGGCG ACTTCGTCTC GCTCGACCTC
GGGCCGAACC CCGTGCTTCC CGAGAGCGCA CCGTACTACG TCGTCAGTGC CGCGAGCGGG
GCCGACGTGA CGCGGACGGT CATCGACGGT GAGATCGCGT ATAGCCCGGA CCGGGGTGTA
CGCGGCGTAG ACGAAGCGGA CATGGAGACC GTCGGCGAAG CGAGCGCCGA ACTCTGGGAG
CGCCTTTGA
 
Protein sequence
MNDEREVREE VHVVVEDGEI VDIADGYESA EETIDARDEV VIPGLVNCHT HMYALPIRGA 
PLTASPESFY EALVDIWWEV DEAFTTRDAR LSSLGSCAEM VRGGVTTFCD NYSGPNTLPG
ALDAVADGVS QTPIRGMITF ETTARNSEEE AIEGISENQR YIRESEDEYD GVTGHYCLHT
LFTNTESVVD ECVRRAVSDD RPIQIHLEEG LVDVHESIKE YGVRPVPALD SMGFFEADVI
AAHCVHSTER ELEILAENDV RVAHNPYSNI NNAVGIADVE TMEAHDMTIG IGDDGWDPDM
FETMRSAVGI HKLKENDPSG FDGAKALEWA TIGSAGVLGM DDRIGSIEVG KRGDFVSLDL
GPNPVLPESA PYYVVSAASG ADVTRTVIDG EIAYSPDRGV RGVDEADMET VGEASAELWE
RL