Gene Pars_0014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0014 
Symbol 
ID5055744 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp9814 
End bp11097 
Gene Length1284 bp 
Protein Length427 aa 
Translation table11 
GC content50% 
IMG OID640467594 
Producthypothetical protein 
Protein accessionYP_001152283 
Protein GI145590281 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAATAG CGATTCTACT GTTGTTAGTA GCCTTTTCCT ATGCCTATGT GGCTACGTCA 
GTTTATGTAG TTGAAATTAA CGGGGTTGTA GGCCCCTATA CCTACTCGCA GATACAGCGA
GCTATATCCC TCGCTGAGCA GAACAACGGG CTTGTGCTCA TATTGTTATC AACTCCTGGA
GGCTTGGCAG ACCCCACTCT CCAGATAATA AGGGAGATTG GCAACTCCCC CGTGCCTGTT
GTGGGATATG TCTACCCGGA CTACAGCTAT GCCTGGTCTG CGGGGACTTA TATACTGCTA
TCGACCCACA TCGCCGCCAT GGCGCCGCAT ACTGTAATAG GCTCTTGTCA GCCGATATCT
GGTGGAACCC CGGTAAATGA GTCTAAGATT CTAAACGCAT TGATAGGATA TCTCGAAACT
GTGTCTAAGT CGTATGGCAG GAACGGTACT TTTGCCCGTC TCTGTATTAC CCAGAATATT
AACCTAGACG CCGAGACGGC GTTGAAATAT AGGGTAATAG ACGTGGTGGC CACCGGTGTA
GATGACCTTT TGAAGAAGAT AAACGGCATG ACGGTGTTGT TGCGAAACCA GCAAACAGAG
CTAGTGGTAG AGAGCCCGGT GATAAGGCGT GTCGAACCTT CACTTACTGA AACTTTACAG
ATGTGGCTAA GCGACCCAGT GATGTCCAGC GTCTTGTCTC TCTTGGCGTT TTTGTTACTG
CTTGCGGCGT TTATCACAGG CCACCCCGCC GCCGCGGTGG CGGCCATAGT ATTGCTGGTT
ATATCTATGT TCTCGATTTT GCCAACGGCG TGGTTGGGCC TCGCGCTTAT AATTATGGGC
GCCGTGTTGA TACTGGCAGA GATATTAATG GGCATGGCGG CACACGGCGC CGTGGCCGGC
GTAGGCGCCG TCCTGCTAGT AGTGGGATTC TTATCCGCCT ATCCTGCTAA CGTTTTCAGT
GGAGAGCTTA TCCACATCAG GGATTGGTGG CTCATCCAGC TTGGCCTATA TGTAAACATA
GCAATACTTC TAGGATTTCT CGTCTTTGTC GTGTACAAGG CAGTTATTAT CCATAAACAG
AGGCCGCCCT CTGAAATTTT GACAACTCTC AAGGGGGCAG AGGGGGTGGC AGTGGACGAT
ATAGGGCCTG GATCTCCCGG CTTTGTGATA GTCTTCGGAG AATACTGGAG GGCTGTTTCT
GATACACCGG TAAAGAAGGG TTGCAGAATA CGTGTGGTGG AGATTGCTGG GGAGATCTTG
AAAATAGAGC CGGTTCAGTG TTAG
 
Protein sequence
MRIAILLLLV AFSYAYVATS VYVVEINGVV GPYTYSQIQR AISLAEQNNG LVLILLSTPG 
GLADPTLQII REIGNSPVPV VGYVYPDYSY AWSAGTYILL STHIAAMAPH TVIGSCQPIS
GGTPVNESKI LNALIGYLET VSKSYGRNGT FARLCITQNI NLDAETALKY RVIDVVATGV
DDLLKKINGM TVLLRNQQTE LVVESPVIRR VEPSLTETLQ MWLSDPVMSS VLSLLAFLLL
LAAFITGHPA AAVAAIVLLV ISMFSILPTA WLGLALIIMG AVLILAEILM GMAAHGAVAG
VGAVLLVVGF LSAYPANVFS GELIHIRDWW LIQLGLYVNI AILLGFLVFV VYKAVIIHKQ
RPPSEILTTL KGAEGVAVDD IGPGSPGFVI VFGEYWRAVS DTPVKKGCRI RVVEIAGEIL
KIEPVQC