Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1440 |
Symbol | |
ID | 5054656 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1296515 |
End bp | 1297513 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 640468981 |
Product | metalloendopeptidase glycoprotease family |
Protein accession | YP_001153650 |
Protein GI | 145591648 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0533] Metal-dependent proteases with possible chaperone activity |
TIGRFAM ID | [TIGR00329] metallohydrolase, glycoprotease/Kae1 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0361472 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0191854 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCTCGTTC TCGGGGTTGA GTCCACGGCG CATACCATCA GCTTGGGATT AGTGAAAGAT GGAGATGTCT TAGGACAAGT TGGGAAGACG TACGTGCCTC CGTCTGGGTT GGGGATTCAC CCCCGGGAGG CGGCGGACCA CCATTCCCAG ATGGCGCCGC AACTTCTCAG CCACTTGCTT TATAGGCACG GCGTAAGGCT CTCCGATGTC GACGTCGTTG CCTATGCGGC TGGGCCTGGG CTGGGCCCAG CGTTGAGGGT TGGCGCGGTG TTGGCAAGGG CTATTGCCAT AAAGCTCGGC GTGCCTATTG TGCCGGTACA CCACGGAATT GCCCACATTG AAATTGCAAG ATATGCGACT AAGTCGTGCG ATCCTCTCGT AGTGCTGATA TCTGGGGGGC ATACCGTAAT TGCCGGCTAC TCCGACAGGC GGTACAGAAT TTTTGGCGAG ACTCTTGATG TGGCTATTGG CAACGCCATT GACATGTTTG CCAGAGAGGC GGGACTGGGC TTCCCCGGAG TGCCCGCCGT GGAGAGGTGC GGAGAATCTG CAGATAGGCT TGTGGAGTTC CCAATGCCCA TTGTGGGACA GGATATGTCA TATGCCGGGC TGACTACCTA CGCGTTGAAG TTACTCAAAG AAGGAGTTCC TCTTTCTGTG ATCTGCAAAT CGCTAGTGGA GGCAGCCTAC TACATGTTGG CAGAGGTCAC CGAGCGGGCG CTTGCGTTTA CCAGGAAGAG CGAGTTGGTG GTGGCAGGAG GCGTGGCGAG GAGTAGGAGG CTAAGGGAAA TTCTCAGCCA AGTAGGGGCC TATCACGGGG CCGAGGTAAA GGTAGTACCT GACGAATATG CCGGCGATAA CGGGGCTATG ATAGCCCTAA CTGGTTATTA CGCATACAAG CGCGGCGTCT ACACAACGCC GGAGGAGAGC TTCGTGAGGC AGAGGTGGCG CCTCGACGCA GTGGATGTAC CGTGGTTCTG GGATCTGTGC AATAGATAA
|
Protein sequence | MLVLGVESTA HTISLGLVKD GDVLGQVGKT YVPPSGLGIH PREAADHHSQ MAPQLLSHLL YRHGVRLSDV DVVAYAAGPG LGPALRVGAV LARAIAIKLG VPIVPVHHGI AHIEIARYAT KSCDPLVVLI SGGHTVIAGY SDRRYRIFGE TLDVAIGNAI DMFAREAGLG FPGVPAVERC GESADRLVEF PMPIVGQDMS YAGLTTYALK LLKEGVPLSV ICKSLVEAAY YMLAEVTERA LAFTRKSELV VAGGVARSRR LREILSQVGA YHGAEVKVVP DEYAGDNGAM IALTGYYAYK RGVYTTPEES FVRQRWRLDA VDVPWFWDLC NR
|
| |