Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Huta_2751 |
Symbol | |
ID | 8385057 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Halorhabdus utahensis DSM 12940 |
Kingdom | Archaea |
Replicon accession | NC_013158 |
Strand | + |
Start bp | 2822529 |
End bp | 2823602 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 63% |
IMG OID | 644973826 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003131645 |
Protein GI | 257053812 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGTGC TCGGTCTCGC TGGCTGCGTC GAGTCACCGA CCGGAGATGT CGCCACGTCT GCGTCCGAGC AGTCGGCACC GACGACGAAC ACTACCGTCG AAAACCAGGT TGACGAGCCC GCGACCGATG ATCGGTCACG GTACGCCGAC GTCTACCGAT CAGTTAGCGA TTCGGTTGTC CAGATCAGGG TCATCACGCC CTTTGGGACG GCGGGAACCG GCAGCGGGTT CGTCTACGAC GAGCGCCACC TCGTGACGAA CGAACACGTG GTTTCGAACG CCGAGGAACT GTACGTCCGC TATCCTTCGA CCGGGTGGCG GGAGGCGTCT GTCGTCGGCA CGGACAGCGA TAGCGACCTT GCGGTCTTGT CCGTTGACGA GCACCCACCC GCCGCCGGTT CGTTGTCGCT CGTCGAAGAC GAACCGGCTG TCGGAACCGA AGTGGTTGCG ATCGGGAATC CATATGGACT CTCGGGGTCA GTCTCGGCGG GCATTGTCAG CGGTGTCGAC CGGACACTGT CGAGCCCAGG CGAGTTCTCG ATCCCCGATA CGATTCAGAC CGACGCAGCC GTGAATCCGG GCAACAGCGG AGGGCCGCTC GTCAACCTGG AGGGAGAGGT CGTCGGCGTC ATCAGCGCCG GCCAGGGTGA CAATATCGGA CTGGCCATCT CCAGTGCGTT GACACGCAAC GTCGTCCCGG CGTTGATCGA GACGGGTTCG TACGAGCATC CCTACCTCGG TATCCGGCTG CTCGACGTCA CCCCAGCAGT GGCAGAAGCT AACGACCTTT CGGAAGCCTC GGGTGTCTAC GTGACGGAGA CCATCGAGGG CGATCCGTCG GACGGTGTTC TGCAAGGCGC GACCGAAGAA ACGGTCGTCA ATGGCCAATC GATACCCGTC GGCGGCGACG TGATCACGCA TATCGAAGGC GAACCGACGC CGACCAGCCA GCAGTTGGGG AGCGTCCTCG CACTCGAAAC CCAGGTCGGA CAGCCGGCCA CGATTCGGGT CCTTCGGGAT GGTGCGACCG AGACGCTGGA AGTGACGATC GGATCTCGGA GCGAGGCGGA GTGA
|
Protein sequence | MAVLGLAGCV ESPTGDVATS ASEQSAPTTN TTVENQVDEP ATDDRSRYAD VYRSVSDSVV QIRVITPFGT AGTGSGFVYD ERHLVTNEHV VSNAEELYVR YPSTGWREAS VVGTDSDSDL AVLSVDEHPP AAGSLSLVED EPAVGTEVVA IGNPYGLSGS VSAGIVSGVD RTLSSPGEFS IPDTIQTDAA VNPGNSGGPL VNLEGEVVGV ISAGQGDNIG LAISSALTRN VVPALIETGS YEHPYLGIRL LDVTPAVAEA NDLSEASGVY VTETIEGDPS DGVLQGATEE TVVNGQSIPV GGDVITHIEG EPTPTSQQLG SVLALETQVG QPATIRVLRD GATETLEVTI GSRSEAE
|
| |