Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_3918 |
Symbol | |
ID | 8744546 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013744 |
Strand | + |
Start bp | 175108 |
End bp | 176436 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 646514499 |
Product | amidohydrolase |
Protein accession | YP_003405446 |
Protein GI | 284167168 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTACTCG CTGGCACCGT AATCGTCGAC TCGAGCACCG TTATCAACGA CGGTGCCGTC GTCGTGACCG ATTCGATCAT CGAAGCCGTC GGAGAATACG CGGTCCTCGC GGATCGATAT CCGGATCACG ATCAGCGGGA GTACGACGTC CTCCTACCTG GCCTCGTCGG TGGTCATATT CACTCCGTAC AGAGTCTAGG CCGCGGTATC GCCGACGATA CGGAGCTCCT CGACTGGTTG TTCGACTATA TTCTACCGAT GGAAGCGTCG CTTTCGGCAG AAGAGATGGA AGTGGCCGCG AAACTCGGAT ATCTAGAGAT GATAGAGAGC GGGACGACGA CGTGCGTGGA CCATCTCTCC GTCGACCACG CGGATCGAGC GTTCCAGGCC GCGGGAGAAA TCGGCATTCG CGGCGTCCTC GGAAAAGTGC TGATGGATCG CCGGTCACCG ACGAATCTTC TGGAAGACAC GTCGGATGCG CTGGCGGAAA CGGAACGCCT GATCGAGGAG TACCACGGTT CGTTCGACGA CCGAATCCGA TACGCTGTTA CTCCTCGGTT CGCCGTTTCT TGTACCGAGG AGTGTCTGCG CGGCGCTCGC GAACTCGCCG ACGAGTACGA AGGCGTCAGA ATCCACACGC ACGCGAGCGA GAATCAGAGC GAAATCGAGA CCGTCAAAGA AGACACCGGG ATGCGAAATA TCCACTGGCT CGACGAGGTC GGTCTCACTG GCGAGGATGT CGTCCTCGCT CACTGCGTTT GGACGGACGA GAGCGAACGG CAGGTCCTCG AAGAAACGGG GACACACGTC ACCCACTGTC CGTCTTCGAA TATGAAACTC GCGAGCGGTA TCGCCCCCGT CTGGGACTAC CTCGAGCGAG GTATCAACGT CGCGCTCGGC AACGACGGGC CACCCTGTAA CAACACGCTC GACCCGTTCA CCGAAATGCG ACAGGCGAGC CTCCTGCAGA AAGTGGATCG ACTCGATCCG ACCGCGACCC CCGCGAGTGA GATATTCGAA ATGGCCACGA TAAACGGCGC GAAAGCGGCC GGGTTCGACC GTCTGGGAGC AATCCGCGAA GGATGGCGCG CCGACATCGT GGGCATTCGA ACGGATATCA CGCGTGCGAC TCCGCTTCAC GACGTCCTCT CTCACCTCGT GTTCGGCGCT CACGGAGAGG ACGTGGTGTT CTCGATGGTC GACGGGAACG TGCTCATGGA AGACGGCGAA GTAACGACGG TGGACGCGGA AACGGTTCGA CGGAGGGCCG ACGAGATCGG TCTCTCACTC GAGTCTCACC GCGAGGCGGC GAAGGAAGTG AAACCGTGA
|
Protein sequence | MLLAGTVIVD SSTVINDGAV VVTDSIIEAV GEYAVLADRY PDHDQREYDV LLPGLVGGHI HSVQSLGRGI ADDTELLDWL FDYILPMEAS LSAEEMEVAA KLGYLEMIES GTTTCVDHLS VDHADRAFQA AGEIGIRGVL GKVLMDRRSP TNLLEDTSDA LAETERLIEE YHGSFDDRIR YAVTPRFAVS CTEECLRGAR ELADEYEGVR IHTHASENQS EIETVKEDTG MRNIHWLDEV GLTGEDVVLA HCVWTDESER QVLEETGTHV THCPSSNMKL ASGIAPVWDY LERGINVALG NDGPPCNNTL DPFTEMRQAS LLQKVDRLDP TATPASEIFE MATINGAKAA GFDRLGAIRE GWRADIVGIR TDITRATPLH DVLSHLVFGA HGEDVVFSMV DGNVLMEDGE VTTVDAETVR RRADEIGLSL ESHREAAKEV KP
|
| |