Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2284 |
Symbol | |
ID | 8384581 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | - |
Start bp | 2333304 |
End bp | 2334407 |
Gene Length | 1104 bp |
Protein Length | 367 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 644973356 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003131184 |
Protein GI | 257053351 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.135107 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTCCAG AGACATCTAC ACGGCGGCGG TTTCTGGCGC TCGGCGGAAC GGCACTCGCG TCCGCGATCG CGGGGTGTTC GACGTCGCTG TCGGCACCCG AGTCGTCGCC AGGGGAGTCG CCGTCGCCCT CCGCAGCCGG AGAGTCTCCC GAGACGGACA CAGCCGAGAG TCCCTACACC CGGGTGTACC GCGAGACCAT CCCCTCGGTC ACGACCGTCC AGACGACGAC CGGCCAGGGA ACCGGCTTCC AGTACGACGA GGGGCACGTC GTCACGAACG CCCACGTCGT CGGGACGGCG AGTGAGGCGC AGGTCCGCTT CCACGACGGG ACGTGGGCGA GTGGCCCGGT GGTCGGGACG GATCCACACA GCGACCTCGC CGTGATCGAA CCCGGGACAG TGCCCGATTC CGCAGCGCCG CTCCCGTTCG GCGACCAGCC GCCGACGATC GGCCGGGAGG TCGTCGTCAT CGGGAACCCG TACAACCTCG ACGGGTCGGT CACGTCGGGG ATCGTCAGCG GCACCGACCG GCTGATCCCG TCGCCGGCGG GCTATCAGAT TCCGGACGCG ATCCAGACCG ACGCGGCGGT GAACCCGGGC AACAGCGGCG GCCCGCTGAT GGATCTCGAC GGTTCCGTGG TGGGCGTCGT GAATTCGAAG CAGGGCGACA GCATCGCCTT CGGGATCTCG GCTGCGCTGA CGCAGCGCGT GGTCCCGGAA CTCATCGAGT CGGGAACCTA CGAGCACGCC TACATGGGCG TCTCACTCGA CGTGGTCACG CCGCGGCTCG CAGAGGCGAA CGACCTGGCG GAGCCACAGG GCCTGCTGGT CGTCCAGACC GTTCGTGGTG GCCCGGCTGA CAGCGTCCTC CAGCCCAGTA GCATCGAGTA CGTCGGTGGA CGCCAGGTCC CCGTGGGTGG CGACGTTATC CGCGCCATCG ACGGGACGCC GATGGAGACC TTCGAGGACC TGGCGAGCTA TCTGGCGCTG GAAACCCGGC CCGGTGATAC CATCGAGGTC GCGATCCTGC GGGACGGTGA CGAGCGAACC GTCGAGGTGA CGCTCGATGC CCGCCCGGAC CGCTCGACGT CACCGCTCCG TTGA
|
Protein sequence | MTPETSTRRR FLALGGTALA SAIAGCSTSL SAPESSPGES PSPSAAGESP ETDTAESPYT RVYRETIPSV TTVQTTTGQG TGFQYDEGHV VTNAHVVGTA SEAQVRFHDG TWASGPVVGT DPHSDLAVIE PGTVPDSAAP LPFGDQPPTI GREVVVIGNP YNLDGSVTSG IVSGTDRLIP SPAGYQIPDA IQTDAAVNPG NSGGPLMDLD GSVVGVVNSK QGDSIAFGIS AALTQRVVPE LIESGTYEHA YMGVSLDVVT PRLAEANDLA EPQGLLVVQT VRGGPADSVL QPSSIEYVGG RQVPVGGDVI RAIDGTPMET FEDLASYLAL ETRPGDTIEV AILRDGDERT VEVTLDARPD RSTSPLR
|
| |