Gene Pars_1440 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1440 
Symbol 
ID5054656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1296515 
End bp1297513 
Gene Length999 bp 
Protein Length332 aa 
Translation table11 
GC content57% 
IMG OID640468981 
Productmetalloendopeptidase glycoprotease family 
Protein accessionYP_001153650 
Protein GI145591648 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0533] Metal-dependent proteases with possible chaperone activity 
TIGRFAM ID[TIGR00329] metallohydrolase, glycoprotease/Kae1 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0361472 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value0.0191854 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGTTC TCGGGGTTGA GTCCACGGCG CATACCATCA GCTTGGGATT AGTGAAAGAT 
GGAGATGTCT TAGGACAAGT TGGGAAGACG TACGTGCCTC CGTCTGGGTT GGGGATTCAC
CCCCGGGAGG CGGCGGACCA CCATTCCCAG ATGGCGCCGC AACTTCTCAG CCACTTGCTT
TATAGGCACG GCGTAAGGCT CTCCGATGTC GACGTCGTTG CCTATGCGGC TGGGCCTGGG
CTGGGCCCAG CGTTGAGGGT TGGCGCGGTG TTGGCAAGGG CTATTGCCAT AAAGCTCGGC
GTGCCTATTG TGCCGGTACA CCACGGAATT GCCCACATTG AAATTGCAAG ATATGCGACT
AAGTCGTGCG ATCCTCTCGT AGTGCTGATA TCTGGGGGGC ATACCGTAAT TGCCGGCTAC
TCCGACAGGC GGTACAGAAT TTTTGGCGAG ACTCTTGATG TGGCTATTGG CAACGCCATT
GACATGTTTG CCAGAGAGGC GGGACTGGGC TTCCCCGGAG TGCCCGCCGT GGAGAGGTGC
GGAGAATCTG CAGATAGGCT TGTGGAGTTC CCAATGCCCA TTGTGGGACA GGATATGTCA
TATGCCGGGC TGACTACCTA CGCGTTGAAG TTACTCAAAG AAGGAGTTCC TCTTTCTGTG
ATCTGCAAAT CGCTAGTGGA GGCAGCCTAC TACATGTTGG CAGAGGTCAC CGAGCGGGCG
CTTGCGTTTA CCAGGAAGAG CGAGTTGGTG GTGGCAGGAG GCGTGGCGAG GAGTAGGAGG
CTAAGGGAAA TTCTCAGCCA AGTAGGGGCC TATCACGGGG CCGAGGTAAA GGTAGTACCT
GACGAATATG CCGGCGATAA CGGGGCTATG ATAGCCCTAA CTGGTTATTA CGCATACAAG
CGCGGCGTCT ACACAACGCC GGAGGAGAGC TTCGTGAGGC AGAGGTGGCG CCTCGACGCA
GTGGATGTAC CGTGGTTCTG GGATCTGTGC AATAGATAA
 
Protein sequence
MLVLGVESTA HTISLGLVKD GDVLGQVGKT YVPPSGLGIH PREAADHHSQ MAPQLLSHLL 
YRHGVRLSDV DVVAYAAGPG LGPALRVGAV LARAIAIKLG VPIVPVHHGI AHIEIARYAT
KSCDPLVVLI SGGHTVIAGY SDRRYRIFGE TLDVAIGNAI DMFAREAGLG FPGVPAVERC
GESADRLVEF PMPIVGQDMS YAGLTTYALK LLKEGVPLSV ICKSLVEAAY YMLAEVTERA
LAFTRKSELV VAGGVARSRR LREILSQVGA YHGAEVKVVP DEYAGDNGAM IALTGYYAYK
RGVYTTPEES FVRQRWRLDA VDVPWFWDLC NR