Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_0035 |
Symbol | |
ID | 8382295 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 34293 |
End bp | 35381 |
Gene Length | 1089 bp |
Protein Length | 362 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 644971093 |
Product | nitrite reductase, copper-containing |
Protein accession | YP_003128957 |
Protein GI | 257051124 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2132] Putative multicopper oxidases |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence [TIGR02376] nitrite reductase, copper-containing |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.208873 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCATCTA TCCCGGTCGC CACACGACGG CGCGTTCTCC AGGCCCTCGG AGTCGGCGGC GCCGCGGCCC TCGCCGGCTG TGGTGCGTCG AGCGACACCG ACGACCAGCC TGCCGAGAAT ACGTCACCCA CCGAACCTCA CATGAACGAT CCCCAGATGA CAGACGTCGA CCGTATCGCC GCCGATCCGA CCGACGTCCC TGATCCGATC GACCGGTCCT CGCCGGCGAC AGTCAACGTC GAACTCGAGA CGCGCGAACT CGTCGCCGAG GTCGAACCGG GCGTCACCTA CACCTACATG ACCTTCGACA ATCAGGTTCC GGGGCCACTG ATCCGGGTCC GCAAAGGTGA CACGGTGAAT ATGACGGTCA CCAGCCACGA GGACAACACG ATGCCCCACA ACATCGACCT CCACGCGGTC AGGGGGCCAG GTGGCGGGGC CGAGGCGTCG ATGGTCGCCC CCGGCGAAAC CGAGACCTTC CAGTTCAAGG CGACCTACCC CGGCGCATTC ATCTATCACT GTGCCGTCCC GAACTTGGAC TATCACATCG CCTCGGGGAT GTACGGTCTC ATTCTCGTCG AACCCGAGGA CGGGCTTCCC GAGGTCGATC ACGAACTCTA CTTCGGCCAG AACGAACTCT ATACCACGGG CGACGTATCC CAGGATGGCC ACCACGACTT CGACATGGAT GCGATGACGG CCGAAGAGCC GACGTACGTG CTCATGAACG GTGAGAGTCG CGCTATTACG GAAAATCGGT ATGGCCCGGT AACCGTCGAC GTCGGCGACA CCGCCCGCGT GTACTTCGTC AACGGTGGTC CAAACCTCAC GTCGAGTTTC CACCCGATCG GGTGTGTCTG GGACGAAGTC CATCCCCAGG GTGGGATCGG GGGGCCGCCC CATCGAAACA TCCAGACGAC GCCCGTCATG CCCGGCTCGG CGACCATCGC GACGATGCAC TTCGAGGTCC CCGGCCCGGT GAAACTCGTC GATCACGCCC TCTCGCGGGT CGCCCGGAAA GGGCTCCTGG CCGTCGTCGA AGCCGAGGGC GACGCCCGTC CTGATCTCTT TGATCCCGAT CCGGACTGA
|
Protein sequence | MSSIPVATRR RVLQALGVGG AAALAGCGAS SDTDDQPAEN TSPTEPHMND PQMTDVDRIA ADPTDVPDPI DRSSPATVNV ELETRELVAE VEPGVTYTYM TFDNQVPGPL IRVRKGDTVN MTVTSHEDNT MPHNIDLHAV RGPGGGAEAS MVAPGETETF QFKATYPGAF IYHCAVPNLD YHIASGMYGL ILVEPEDGLP EVDHELYFGQ NELYTTGDVS QDGHHDFDMD AMTAEEPTYV LMNGESRAIT ENRYGPVTVD VGDTARVYFV NGGPNLTSSF HPIGCVWDEV HPQGGIGGPP HRNIQTTPVM PGSATIATMH FEVPGPVKLV DHALSRVARK GLLAVVEAEG DARPDLFDPD PD
|
| |