Gene Pars_0841 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0841 
Symbol 
ID5054868 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp746780 
End bp747964 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID640468402 
Producthypothetical protein 
Protein accessionYP_001153079 
Protein GI145591077 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones27 
Fosmid unclonability p-value0.391136 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGGGT GGAGGGCTGT TGTACTTGCA CTGCTCGCCT CCGCCTTGGC GCTGGCCGCA 
TCCAACGCCG TGGCCTTGTA CCTCGATGGC ACCATTGACG GAACCGCAGT TACCCTAGCC
CGAGCGGCGC TGGCGGATGC CCAAAGCCGC GGGTTACCTC TCGTGGTGGT TATCAATACC
TACGGGGGTT TCCTTGCCCC AATGGATCAA ATTGTGGAGC TGTTCTTAAA CGCGGGGGTT
CCGGTATATG CCTACGTGCC GGAGGGGGGC AAGGCCGTTT CGGCGGGGGC CTTCGTGGCC
ATGGCGGCTA GGAGGATCTA CATGGCGCCC ACCGCCGAGA TAGGCGCCGC CGAGCCTAGG
CCGCCTGACC CCAAGGTGGT GAACTACGCC GCGGCCCGGA TGAGGGCGCT GGCGTCTGCC
AAGTGGAACG ACTCCAGGGT GGACATAGCC GAGTCTTTTG TCAGGGAGAA CAAGGTGCTC
ACAGGAGCGG AGGCCGTTAA GCTGGGAATC GCTGAGCCAC TGCCGTCGGG AGGCTGGGTT
TTTGTCGCCG AGTACCGGAG GGACCCCCTC TCCAGCCTCC TAAACGCCTT GTCTGATCCC
GCGGTGATAT CGCTACTGCT CCTGCTGGGC GTCGTGTTCA TTGGCTACGA GCTCCTAGCC
GGCGGCTTCC AAGGCGTCGG CGTAGTCGGG GGGCTTTTAC TAGTGCTCGC CCTCTACCTC
TTGGGCCAGC TGGGCTCTGA GTGGCTCTGG GCGGCGCTGG CCATCGGCGG GGCTACGCTC
ATCGCCGCGG AGATCTTCGC CGGCCACGGC GCCTTCGCCG CCACTGGTCT GGCCCTCTTC
GGCCTCTCCC TATACTTCGC AAGCGTCAGC CAGCCCTACT ACCAGCTCCA AGGCGCCTCC
TACGCCCTGT CCTCCTTGGC CGCGCTGGGC GCCTTGGCCG TGGCCTACCT GGGCTATAAG
GTGAGGCAGG CGATGCGGAG AAAGCCGCTT AACTACAAGG CACAGCTGGT GGGGGCCTTG
GGGGTCGCCA AGACTGAGAT AAGGCCGGGT CAGCCGGGGG TGGCGTACGT GGCGGAGGAG
GAGTGGACAG CTGTCTCAGA CGAGGAGATA AAGCCAGGGG AGAGGGTGGT GGTGGAGGGC
GTTGAGGGCC TCACCTTAAT GGTTAAAAAG GCCAAGTCTG CATAG
 
Protein sequence
MAGWRAVVLA LLASALALAA SNAVALYLDG TIDGTAVTLA RAALADAQSR GLPLVVVINT 
YGGFLAPMDQ IVELFLNAGV PVYAYVPEGG KAVSAGAFVA MAARRIYMAP TAEIGAAEPR
PPDPKVVNYA AARMRALASA KWNDSRVDIA ESFVRENKVL TGAEAVKLGI AEPLPSGGWV
FVAEYRRDPL SSLLNALSDP AVISLLLLLG VVFIGYELLA GGFQGVGVVG GLLLVLALYL
LGQLGSEWLW AALAIGGATL IAAEIFAGHG AFAATGLALF GLSLYFASVS QPYYQLQGAS
YALSSLAALG ALAVAYLGYK VRQAMRRKPL NYKAQLVGAL GVAKTEIRPG QPGVAYVAEE
EWTAVSDEEI KPGERVVVEG VEGLTLMVKK AKSA