Gene Pars_1395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1395 
Symbol 
ID5055958 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1258452 
End bp1259546 
Gene Length1095 bp 
Protein Length364 aa 
Translation table11 
GC content61% 
IMG OID640468938 
Producthypothetical protein 
Protein accessionYP_001153607 
Protein GI145591605 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1030] Membrane-bound serine protease (ClpP class) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.51965 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.883409 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGGTTTC TACTTCTTTT TGCGGCCTTA GCCGCGGCGG TAGCTGGCTA CAACGTATCT 
ACTGTGTACG TCGTCGATGT ACGCGGCGTC GTGGGGCCTC ACACCTACTG GCAGGTGGCT
AAGGCGGTTG AGGCGGCGGA GAGGGGCGGC GGCGCTGTGT TGTTGCTCTT GTCCACCCCT
GGTGGGCTGG CGGCGCCTGC TAAGCGCATA ATGGGCCTAG TCCTACATTC AAAGGTCCCG
GTGCTGGGCT ACGTATATGG CGAAGAGGCG GCGTCTGCGG GGACCTACAT CCTAATGGCT
ACCCACATTG CCGGGATGGC GCCCCACTCA AAAATAGGCG CTTGCCAGCC GGTGTTGTTG
GTATTCCTGG TGGAGGATCC TGGGGTTATC GCGCAGCACT TGAGCATCTT GGCGGAGGCG
ATGAGCAGAA GGGGGCGAAA CGTGGAGTTT GCGGAGAGGT GTGTCCGGTC TAAGGAATAC
CTAATCGGCG CCGAGGAGGC CCAGAAGATG GGGGTAGTGG AGGTGGTGGC GGGGAACTTC
GTCGAGTTTG TGAAGAGGGC TAACGGAACC GCGGTGAGCC TCGACGGGGT TGAGGACAAG
GTGTTTTTCC ACTCTCCAAA ATACGTCCTG GTGGCCCCAG GCCCCGTTGA GCTTTTCCAG
TCTTGGCATC TGCCCGAGTC GCCCGCCGCC TTGCTTTACT TCTCGACACT CCCCCTTCTG
CTACACGTTG CGCTGTTCCT CGCCGCGATG TACGCAGTAC TTCTCTACGC CAAGATGAGG
GGCTGGGCCG CAGTGGCCAA CCTGTCGGCG TTTGTCCTCG CGTTGTATGT CTCCCTTGCA
ACGTTGCCTC CGCCTTGGCT CTTGGCCTCC GTGGCAGGTG CCGTGGCTAT ACTCGCCGAT
CTTTTTATAA GTAGGCACAC GCGAGGCTTC GTTGCCTTTG CGGCGGCGTT TGTCCCCCAG
ACGGCGGTGT CCGCCTTCTA CCAAGAAGGC GCGGCGGCGG TGGCGTGGGC AATTGCCCTG
ATAATCTCAG CGTCAGCGGC CGGGGCTGTT ATTTACATAT CACGCAGAAA GAGGCCCCAG
GTGCCCTCCT GGTAG
 
Protein sequence
MRFLLLFAAL AAAVAGYNVS TVYVVDVRGV VGPHTYWQVA KAVEAAERGG GAVLLLLSTP 
GGLAAPAKRI MGLVLHSKVP VLGYVYGEEA ASAGTYILMA THIAGMAPHS KIGACQPVLL
VFLVEDPGVI AQHLSILAEA MSRRGRNVEF AERCVRSKEY LIGAEEAQKM GVVEVVAGNF
VEFVKRANGT AVSLDGVEDK VFFHSPKYVL VAPGPVELFQ SWHLPESPAA LLYFSTLPLL
LHVALFLAAM YAVLLYAKMR GWAAVANLSA FVLALYVSLA TLPPPWLLAS VAGAVAILAD
LFISRHTRGF VAFAAAFVPQ TAVSAFYQEG AAAVAWAIAL IISASAAGAV IYISRRKRPQ
VPSW