Gene Htur_3918 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHtur_3918 
Symbol 
ID8744546 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaloterrigena turkmenica DSM 5511 
KingdomArchaea 
Replicon accessionNC_013744 
Strand
Start bp175108 
End bp176436 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content60% 
IMG OID646514499 
Productamidohydrolase 
Protein accessionYP_003405446 
Protein GI284167168 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTACTCG CTGGCACCGT AATCGTCGAC TCGAGCACCG TTATCAACGA CGGTGCCGTC 
GTCGTGACCG ATTCGATCAT CGAAGCCGTC GGAGAATACG CGGTCCTCGC GGATCGATAT
CCGGATCACG ATCAGCGGGA GTACGACGTC CTCCTACCTG GCCTCGTCGG TGGTCATATT
CACTCCGTAC AGAGTCTAGG CCGCGGTATC GCCGACGATA CGGAGCTCCT CGACTGGTTG
TTCGACTATA TTCTACCGAT GGAAGCGTCG CTTTCGGCAG AAGAGATGGA AGTGGCCGCG
AAACTCGGAT ATCTAGAGAT GATAGAGAGC GGGACGACGA CGTGCGTGGA CCATCTCTCC
GTCGACCACG CGGATCGAGC GTTCCAGGCC GCGGGAGAAA TCGGCATTCG CGGCGTCCTC
GGAAAAGTGC TGATGGATCG CCGGTCACCG ACGAATCTTC TGGAAGACAC GTCGGATGCG
CTGGCGGAAA CGGAACGCCT GATCGAGGAG TACCACGGTT CGTTCGACGA CCGAATCCGA
TACGCTGTTA CTCCTCGGTT CGCCGTTTCT TGTACCGAGG AGTGTCTGCG CGGCGCTCGC
GAACTCGCCG ACGAGTACGA AGGCGTCAGA ATCCACACGC ACGCGAGCGA GAATCAGAGC
GAAATCGAGA CCGTCAAAGA AGACACCGGG ATGCGAAATA TCCACTGGCT CGACGAGGTC
GGTCTCACTG GCGAGGATGT CGTCCTCGCT CACTGCGTTT GGACGGACGA GAGCGAACGG
CAGGTCCTCG AAGAAACGGG GACACACGTC ACCCACTGTC CGTCTTCGAA TATGAAACTC
GCGAGCGGTA TCGCCCCCGT CTGGGACTAC CTCGAGCGAG GTATCAACGT CGCGCTCGGC
AACGACGGGC CACCCTGTAA CAACACGCTC GACCCGTTCA CCGAAATGCG ACAGGCGAGC
CTCCTGCAGA AAGTGGATCG ACTCGATCCG ACCGCGACCC CCGCGAGTGA GATATTCGAA
ATGGCCACGA TAAACGGCGC GAAAGCGGCC GGGTTCGACC GTCTGGGAGC AATCCGCGAA
GGATGGCGCG CCGACATCGT GGGCATTCGA ACGGATATCA CGCGTGCGAC TCCGCTTCAC
GACGTCCTCT CTCACCTCGT GTTCGGCGCT CACGGAGAGG ACGTGGTGTT CTCGATGGTC
GACGGGAACG TGCTCATGGA AGACGGCGAA GTAACGACGG TGGACGCGGA AACGGTTCGA
CGGAGGGCCG ACGAGATCGG TCTCTCACTC GAGTCTCACC GCGAGGCGGC GAAGGAAGTG
AAACCGTGA
 
Protein sequence
MLLAGTVIVD SSTVINDGAV VVTDSIIEAV GEYAVLADRY PDHDQREYDV LLPGLVGGHI 
HSVQSLGRGI ADDTELLDWL FDYILPMEAS LSAEEMEVAA KLGYLEMIES GTTTCVDHLS
VDHADRAFQA AGEIGIRGVL GKVLMDRRSP TNLLEDTSDA LAETERLIEE YHGSFDDRIR
YAVTPRFAVS CTEECLRGAR ELADEYEGVR IHTHASENQS EIETVKEDTG MRNIHWLDEV
GLTGEDVVLA HCVWTDESER QVLEETGTHV THCPSSNMKL ASGIAPVWDY LERGINVALG
NDGPPCNNTL DPFTEMRQAS LLQKVDRLDP TATPASEIFE MATINGAKAA GFDRLGAIRE
GWRADIVGIR TDITRATPLH DVLSHLVFGA HGEDVVFSMV DGNVLMEDGE VTTVDAETVR
RRADEIGLSL ESHREAAKEV KP