Gene Pars_1861 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1861 
Symbol 
ID5055996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1664300 
End bp1665346 
Gene Length1047 bp 
Protein Length348 aa 
Translation table11 
GC content54% 
IMG OID640469407 
Productpeptidase M48, Ste24p 
Protein accessionYP_001154064 
Protein GI145592062 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGCCGG TGTTCTTAGA CCCTGTGGCA ATGGGCTTGT ACATCTTAGG ATACATCATC 
ATGCTAGCAG TGGCGGCTAC TGTGGCGCCC AAAGTAGCTA GTTCCGTGTC TGGGCGCTTC
ACGCTCTACG GCGCTATGGC GTTGACAGCT GTCCTAATCG TGTTGACGAC AGCCTTTGTT
ATCTACCTAA TAGCCGTGGT AGCGGCCCCG TCGGTGGCGG GGTATGGGTG GGGGTTCTTC
GCAGGGCTTA TCTTCTTCGT CGTTTTTATG AATCTGCTTA CCTACCTCGC GTCGCCGTTT
TTAATAAATG CATCATATGG CGCCAGGCCA GACCCCCGTC TGCAACAGAT AGTGGACGAG
GTGGCAGCCA GGCTGGGTGC GCCGTTCAAA ATCAAGGCGG TGGTGGTCGA CGGGCCTCCT
AACGCCTTCG CCTACGGCAA TATGCTCTCA GGTAGATACG TGGCAGTTAC GAGTTCAATG
CTGGCGATGA CTGAAAGGAG GGAGCTGGAG GCCGTGATAG GGCACGAGAT TGGGCACCAT
CTGCATAGAG ACAACGCGTT AATGCTACTC TTCGGCGTAC TCCCGTCAAT TCTCTACTAC
TTGGGCGTCA CTTCCGTACG TATGGCGATG GCGTCTTCCG GCAACAGGAA CAACAGCCCG
GCGCTTCTGG CCGCAGTGGG CGTGCTCGCC GTAATAGTAT CCTTCCTAGT CCAGCTTCTG
GTATTGGCGT TCAGCAGACT CAGGGAGTAC TACGCCGATA CAGAGGGTGC AAAGGCCGCC
GGCAAGGAGG CCATGCAATT CGCGTTGGCT AAGATTCACA AATTCTACTT CTCAAACCCT
GAGGCCCACG AGGTTGTCAG CAACGACAAG TTCAGGGCTC TGTTTATATA TGCGCTTGTC
CAAGCAGTGG CTAATCCCTT CGTGTCGGTT ACCAAGAGCG ATGTGGAGCA GATAAAGCGC
TCGGGCTATT CGGTGTTTCA AGAGATATTC TCGACACATC CGCCCATACC GAAGCGGTTG
AAATTCCTCG ACGAGCTACC TTATTAA
 
Protein sequence
MLPVFLDPVA MGLYILGYII MLAVAATVAP KVASSVSGRF TLYGAMALTA VLIVLTTAFV 
IYLIAVVAAP SVAGYGWGFF AGLIFFVVFM NLLTYLASPF LINASYGARP DPRLQQIVDE
VAARLGAPFK IKAVVVDGPP NAFAYGNMLS GRYVAVTSSM LAMTERRELE AVIGHEIGHH
LHRDNALMLL FGVLPSILYY LGVTSVRMAM ASSGNRNNSP ALLAAVGVLA VIVSFLVQLL
VLAFSRLREY YADTEGAKAA GKEAMQFALA KIHKFYFSNP EAHEVVSNDK FRALFIYALV
QAVANPFVSV TKSDVEQIKR SGYSVFQEIF STHPPIPKRL KFLDELPY