Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0755 |
Symbol | |
ID | 8383025 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 739335 |
End bp | 740354 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 644971818 |
Product | amidohydrolase |
Protein accession | YP_003129673 |
Protein GI | 257051840 |
COG category | [F] Nucleotide transport and metabolism [R] General function prediction only |
COG ID | [COG0402] Cytosine deaminase and related metal-dependent hydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.219334 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAACTGG AAGGGACGAT TCTCCGCGGG CGGGAGTTCG CGCCGGTCGA AGGCCGAGTC GTCGTCGAGG ACGGTGAGAT CACAGCCGTC GAAGAAGCCA GCATCGACAC CGATCAGATC ATCCTGCCGG CGTTCGTCAA CGCCCACACG CACATCGGCG ATTCCATCGC AAAAGAAGCC GGGGCCGGCC TCAGCCTCGA CGAACTCGTT GCGCCGCCGG ACGGGTTGAA ACACCAGCTC CTCCGTGAGG CGAGTCGCGA GGAGCTTGTC GCTGCCATGC GGCGCTCGCT TTCGTTCATG GAACGGGCCG GAACTGGTGC GTTCCTCGAG TTCCGGGAGG GCGGTGTCTC CGGCGTCGAA GCGATCGAAG CGGCCCTCGA CGGGCTCGCG ATCGAGGGCG TGATCCTGGG CCGGGAGACG ATCGACGCGA TGGCCGCCGC CGACGGCTTC GGCGCAAGCG GCGCACGCGA CGGCGAGTTC GGGGCCGAAC GCTCGGCGAC ACGCGAGGCC GGCAAACTCT TTGGCATCCA CGCCGGTGAG CGTGACAGCG ACGACGTCAA CCCGGCGCTG GATCTGGATC CGGACTTCCT CGTTCATATG GTCCACCTCG ACGCGATTCA CTACGAACGC CTCGACGACG AGGGGACACC GGTCGTCCTC TGTCCACGTT CGAACCTCGT GACCGACGCC GGGGTCGCGC CGGCCCGCGA ACTCTTCGAC CGCACGACAG TCGCACTCGG GACGGACAAC GTGTTCCTCA ACAGCCCGTC GATGTTCCGC GAGATGGAAT TCGCCGCGAA ACTCTACGAC GTCTCGGCGC GGGAAGTCCT GCAGATGGCG ACGGTCGCCG GTGCCGAGAT CGCCGGGCTG GACGCCGGCG TGATCGAGCC GGGACGCGAG GCTCGGCTGC TGGTTCTGGA CGGCGACTCG GACAATCTCG CCGGCGCGGA AGACGTCGTC CGCGCCGTGG TTCGGCGGGC CGGCGTCGAC GACGTGACTG ACGTCCTCCT CGCGAACTGA
|
Protein sequence | MQLEGTILRG REFAPVEGRV VVEDGEITAV EEASIDTDQI ILPAFVNAHT HIGDSIAKEA GAGLSLDELV APPDGLKHQL LREASREELV AAMRRSLSFM ERAGTGAFLE FREGGVSGVE AIEAALDGLA IEGVILGRET IDAMAAADGF GASGARDGEF GAERSATREA GKLFGIHAGE RDSDDVNPAL DLDPDFLVHM VHLDAIHYER LDDEGTPVVL CPRSNLVTDA GVAPARELFD RTTVALGTDN VFLNSPSMFR EMEFAAKLYD VSAREVLQMA TVAGAEIAGL DAGVIEPGRE ARLLVLDGDS DNLAGAEDVV RAVVRRAGVD DVTDVLLAN
|
| |