Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Htur_2644 |
Symbol | |
ID | 8743257 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haloterrigena turkmenica DSM 5511 |
Kingdom | Archaea |
Replicon accession | NC_013743 |
Strand | - |
Start bp | 2711883 |
End bp | 2713070 |
Gene Length | 1188 bp |
Protein Length | 395 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 646513232 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_003404193 |
Protein GI | 284165914 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | [TIGR01409] Tat (twin-arginine translocation) pathway signal sequence |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGCTCG TCGGGGTGAC AATGGGCGTG ACGCCAGATC CAGATCAGTG TACTCGCAGA CGGATACTCG GCGCCGTCGG CGCAGCCGGG GCCGCAGCGG CGATCGGTCT CGGCGGTACC GGTGGCGGCA GTGCACAGAA CGAAACCAAC GGGTCGAACG CCACGGTCGG ACAGGACGAC GGTCCCGCCG TCGACAGTCC GTACACGGAG ACGTATCGCA ACACCATCGA TTCCGTGGTT CTGGTCACCG TCTCCGGCAC GGGTGGTCTC GAGGGCGGTG GCGGCGGACT CGGCACCGGG TTCGTCGTCG ACGACCAGCA CGTCGTGACC AACAACCACG TCGTCCAGAA CGCGAGCGAG GGCGGGATCG AAATCCAGTT CAACAATCAG GAGTGGCGGA CCGCGTCGGT CGTCGGGACC GACGGCTACA GCGATCTCGC CGTCCTGCGC GTCGACGACA TGCCCGACAT CGCCGCCGGG CTCTCGCTCT CGGAGTCCGA GCCCGTGATC GGGCAGGAGG TGCTCGCGAT CGGCAATCCC CTCGGGTTCG ACGCGTCGGT CTCGCAGGGA ATCGTCAGCG GAATCGACCG CTCGCTCCCC AGCCCTACTG GCTTCTCGAT TCCGGCCGCG ATCCAGACCG ACGCGCCGAT CAACCCCGGC AACAGCGGCG GCCCGCTGGT GAGCCTCGAG GGAGAGGTGC TCGGCGTGGT CTTCGCCGGG GCCGGCCAGA CCATCGGCTT CGCGATCTCC GCGCGCCTCG CGAACCGGGT CGTCCCCGCG CTCATCGAAG ACGGAACGTA CGAACACCCC TACATGGGCG TCGGGGTCCT CCCGGTCGGA CCGGAGATCG CCGACGAGAT CGGCCTCGAG GAAGCCAACG GCGTGCTGGT CGCCGAGGTC GTTCCGAACT CGCCGGCGGA CGGCGTTCTC CAGTCGGCCA ACCGCGTCCG GCCAGGCAGC GGCGACGTTA TCGTCGCCAT CAACGGAACG GAGATCCCGA ATCAGGATCA GCTCTCAGCG TACCTCGCGC TCGAGACCTC GCCCGGCGAC ACGATCGAAC TCGAAATCGT CCGGGACGGC GAGCAGCAAA CCGTCGAGTT GACCCTCGAG GAGCGGCCGG GTATCGAACG ACCCGGGACC AGGGTTCCGG GCGGCCCGGG CGAGCGACCG CCGGCGGCGG GGCCGTAA
|
Protein sequence | MPLVGVTMGV TPDPDQCTRR RILGAVGAAG AAAAIGLGGT GGGSAQNETN GSNATVGQDD GPAVDSPYTE TYRNTIDSVV LVTVSGTGGL EGGGGGLGTG FVVDDQHVVT NNHVVQNASE GGIEIQFNNQ EWRTASVVGT DGYSDLAVLR VDDMPDIAAG LSLSESEPVI GQEVLAIGNP LGFDASVSQG IVSGIDRSLP SPTGFSIPAA IQTDAPINPG NSGGPLVSLE GEVLGVVFAG AGQTIGFAIS ARLANRVVPA LIEDGTYEHP YMGVGVLPVG PEIADEIGLE EANGVLVAEV VPNSPADGVL QSANRVRPGS GDVIVAINGT EIPNQDQLSA YLALETSPGD TIELEIVRDG EQQTVELTLE ERPGIERPGT RVPGGPGERP PAAGP
|
| |