Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tneu_1199 |
Symbol | |
ID | 6165281 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thermoproteus neutrophilus V24Sta |
Kingdom | Archaea |
Replicon accession | NC_010525 |
Strand | + |
Start bp | 1084254 |
End bp | 1085264 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641668348 |
Product | glycoprotease family metalloendopeptidase |
Protein accession | YP_001794573 |
Protein GI | 171185654 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.834375 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTGGTCC TCGGCGTTGA GTCGACCGCC CACACCTTCA GCATAGGGGT CGTTAAAGAC GGCGTGGTGC TCGGCCAACT GGGGAAGACC TACATCCCGC CGGGGGGCGG GGGGATACAC CCCCGCGAGG CGGCTGAGCA CCACGCCAGG GTGGCCCCCT CCATACTCCG CCAGCTCCTG GGCCAGCTGG GGGTGGGGCT GTCGGACATC GGCGCGGTGG CCTACGCCGC CGGCCCAGGT CTAGGCCCCG CCCTCAGGGT GGGGGCCGTA CTGGCGAGGG CCTTGGCCAT TAGGCTGGGC GTGCCGGTTG TGCCCGTGCA CCACGGCGTG GCGCACATCG AGGTGGCCCG CTACGCCACC GGCGCGTGCG ACCCGCTGGT GGTTCTGATC TCCGGCGGCC ACACGGTGGT GGCGGGGTAC TCCGATGGGC GCTATAGGGT TTTCGGCGAA ACCCTCGACG TGGCTATCGG AAACGCCATT GACATGTTTG CGAGGGAGGT GGGGCTGGGC TTCCCGGGGG TGCCGGCGGT GGAGAAATGC GCCGAGTCCG CGGAGACGGT GGTGCCCTTC CCCATGCCGA TAGTTGGGCA GGACCTCTCC TATGCGGGGC TCGCCACCCA CGCGCTTCAG CTCGTGAAGA GGGGGGTCCC CCTCCCCGTG GTCTGCAGAT CGCTTGTGGA AACCGCCTAC TACATGCTTG CGGAGGTGGT GGAGAGGGCG CTGGCCTATA CGAGGAAGAG GGAGGTGGTG GTGGCGGGGG GCGTCGCGAG GAGCAGGCGG CTGAAGGAGA TCCTGCGGGC CGTGGGCGAG GAGCACGGCG CCGTTGTGAA GGTTGTCCCC GACGAATATG CGGGCGACAA CGGGGCCATG ATAGCCCTCA CCGGCTACTA CGCCTATAGA CGCGGCGTAT ACACCACGCC GGAGGGCAGC TTCGTGAGGC AGAGGTGGAG GCTAGACAGC GTGGACGTGC CCTGGTTCCG CGACCTCTGC CCGGTCACAA CGTATATATA G
|
Protein sequence | MLVLGVESTA HTFSIGVVKD GVVLGQLGKT YIPPGGGGIH PREAAEHHAR VAPSILRQLL GQLGVGLSDI GAVAYAAGPG LGPALRVGAV LARALAIRLG VPVVPVHHGV AHIEVARYAT GACDPLVVLI SGGHTVVAGY SDGRYRVFGE TLDVAIGNAI DMFAREVGLG FPGVPAVEKC AESAETVVPF PMPIVGQDLS YAGLATHALQ LVKRGVPLPV VCRSLVETAY YMLAEVVERA LAYTRKREVV VAGGVARSRR LKEILRAVGE EHGAVVKVVP DEYAGDNGAM IALTGYYAYR RGVYTTPEGS FVRQRWRLDS VDVPWFRDLC PVTTYI
|
| |