Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_1954 |
Symbol | |
ID | 8384247 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 1978078 |
End bp | 1979739 |
Gene Length | 1662 bp |
Protein Length | 553 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 644973023 |
Product | O-sialoglycoprotein endopeptidase/protein kinase |
Protein accession | YP_003130855 |
Protein GI | 257053022 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGGATTC TCGGCATCGA AGGCACGGCG TGGGCGGCGA GTGCCGCCGT CTACGAACGG ACGGACAGCG GTGAATCCGT CGTTATCGAG ACTGACGCCT ACGAACCCGA CAGCGGCGGG ATTCACCCGC GGGAAGCCGC CGAGCACATG CGCGAGGCGA TCCCGCAGGT CGTCGAACGG GCACTCGACA TCGCCCGCGA GCAGGCTGCC GACGCGGGCG AAGACCCCGA CGAATCGCCG GTCGACGCCG TCGCTTTCTC ACGCGGTCCG GGACTGGGGC CCTGTCTGCG GATCGTCGCC ACGGCCGCCA GGGCACTGGC ACAGCGGCTG GACGTCCCGC TGGTCGGCGT CAATCACATG GTTGCGCATC TGGAGATCGG CCGTCATCGC TCGGGCTTTT CCGCGCCGGT GTGTCTGAAC GCCTCCGGCG CGAACGCCCA CATTCTGGGG TATCGAAACG GGCGGTATCG CGTCCTCGGG GAGACGATGG ACACCGGCGT CGGCAACGCC ATCGACAAGT TCACCCGCCA CCTCGGGTGG TCCCATCCCG GCGGGCCGAA GGTCGAAAAG CGGGCAAAAG ACGGCGAGTA CATCGACCTG CCCTACGTCG TCAAGGGGAT GGACTTCTCC TTTTCGGGAA TCATGAGCGC CGCCAAGCAA GCGATTGACG ATGGGGAGGC AGTAGAGGAC GTCTGCTACT CGCTCCAGGA GAACATCTTC GCGATGCTGA CGGAAGTCGC GGAGCGGGCC CTCTCCCTGA CCGACGCCGA CGAACTCGTC CTCGGCGGGG GTGTCGGGCA GAACGAGCGC CTCAGAGAGA TGCTCGGCAA GATGTGCGAC CAGCGCGGGG CTGATTTTTA CGCGCCCGAA CCCAGATTTC TCCGGGACAA CGCGGGGATG ATCGCCGTCC TCGGCGCGAA GATGTACGAC GCGGGCGACA CGATTCCGAT CGAGGACTCA CGCGTCCGGC CGGACTTCCG GCCCGACGAG GTTGACGTGA CCTGGCGATC CGACGAGGCC GTCGGTTCGT GGGGCGGGTC GAGCGACGAC GGGACGGTCG GTGCCCGGGA CGGAGCGGGA GCCGACGATG CCGTCCAGGG GGCCGAAGCG ACCGTCACCG TCGAGGACGG CCGGGTCAGG AAGGAGCGCC AGCCACGGAC CTACCGCCAT CCGACGCTCG ACGAGCGCCT CCGGACCGAG CGAACGCGCG AAGAAGCCCG ACTCACGAGC GAAGCGCGCC GCGTCGGCGT CCCGACGCCG GTCGTCCACG ACGTCGACCC GCAGGAAGGC GTCCTGGTCT TCGAGCGCGT GGGCGAGCGG GATCTCCGTG AGGCCCTGAC ACTCGATCGG GTCCGGGACG TCGGGCGACA CCTGGCGACG ATCCACGACG CGGGGTTCGT CCACGGCGAT CCGACGACGC GAAATGTCAG AGTTTCAGAA GATCGCACTC ACCTCATCGA CTTCGGCCTG GGCTACTACA CCGGCCACGC CGAGGATCAC GCGATGGACC TCCACGTCTT CGCCCAGTCG CTGGCTGGAA CCGCTGACGA CCCCGAGGCA CTGCGATCGG CCGCCGAGGA CGCCTATCGC GAGACGGCAG ACGAAGGCGG GGCGGTGCTG GATCGTCTCC GCGAGATCGA GGGACGCGGC CGGTATCAGT GA
|
Protein sequence | MRILGIEGTA WAASAAVYER TDSGESVVIE TDAYEPDSGG IHPREAAEHM REAIPQVVER ALDIAREQAA DAGEDPDESP VDAVAFSRGP GLGPCLRIVA TAARALAQRL DVPLVGVNHM VAHLEIGRHR SGFSAPVCLN ASGANAHILG YRNGRYRVLG ETMDTGVGNA IDKFTRHLGW SHPGGPKVEK RAKDGEYIDL PYVVKGMDFS FSGIMSAAKQ AIDDGEAVED VCYSLQENIF AMLTEVAERA LSLTDADELV LGGGVGQNER LREMLGKMCD QRGADFYAPE PRFLRDNAGM IAVLGAKMYD AGDTIPIEDS RVRPDFRPDE VDVTWRSDEA VGSWGGSSDD GTVGARDGAG ADDAVQGAEA TVTVEDGRVR KERQPRTYRH PTLDERLRTE RTREEARLTS EARRVGVPTP VVHDVDPQEG VLVFERVGER DLREALTLDR VRDVGRHLAT IHDAGFVHGD PTTRNVRVSE DRTHLIDFGL GYYTGHAEDH AMDLHVFAQS LAGTADDPEA LRSAAEDAYR ETADEGGAVL DRLREIEGRG RYQ
|
| |