Gene Pars_2135 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2135 
Symbol 
ID5055702 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1908760 
End bp1910412 
Gene Length1653 bp 
Protein Length550 aa 
Translation table11 
GC content55% 
IMG OID640469687 
Productthermosome 
Protein accessionYP_001154333 
Protein GI145592331 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0459] Chaperonin GroEL (HSP60 family) 
TIGRFAM ID[TIGR02339] thermosome, various subunits, archaeal 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGG CAGTGCTAAC CCAGATCGGT GGCGTTCCCG TGTTGGTGCT TAAAGAGGGG 
ACGCAGAGGG CGTTTGGTAA AGAGGCTCTT AGGCTTAACA TAATGATTGC CCGTGCTATT
GCTGAGGTAA TGCGCACGAC GCTTGGGCCT AAGGGTATGG ACAAGATGCT TATTGACAGC
TTGGGCGACA TCACTATTAC AAACGATGGT GCGACTATTT TGGACGAGAT GGACGTGCAG
CACCCCATTG CGAAGCTACT TGTCGAGATT AGCAAGTCTC AGGAGGAGGA GGCTGGAGAT
GGCACGACCA CCGCTGTTGT TCTTGCTGGA GCTTTGCTTG AGGAGGCTGA GAAGTTACTG
GAGAAGAATA TTCACCCGAC GGTGATTGTA AGCGGTTTTA AGAAGGCGCT TGACGTCGCT
ACTGAGCACC TCCGTAAGGT TGCTGTTCCC GTAAATAGGA GCGATGTGGA TACCCTTAAG
AAGATTGCCA TGACGTCGAT GGGCGGTAAG ATTTCTGAGA CTGTTAAGGA CTACTTCGCC
GACTTGGCTG TAAAGGCGGT GTTGCAGGTG GCTGAGCAGA GAGATGGGAA GTGGTATGTA
GACCTAGACA ATATACAGAT TGTTAAGAAA CACGGCGGCT CTCTGCTTGA CACCCAGCTT
GTCTACGGCA TTGTTGTGGA CAAGGAGGTG GTCCACGCGG CGATGCCTAA GCGTGTCATA
AATGCTAAGA TCGCACTCCT GGACGCCCCG CTGGAGGTTG AGAAGCCCGA AATCGACGCC
GAGATCCGCA TAAACGACCC AATGCAGATG AAGGCGTTCC TAGAGGAGGA GGAGAAGATC
TTGAAGAGTT ATGTAGATAA GCTGAAGTCT CTTGGAGTGA CTGCCCTCTT CACCACCAAG
GGGATTGACG ACATTGCGCA GTACTACCTT GCCAAGGCCG GCATCCTAGC AGTAAGGCGT
GTCAAGAGGT CAGATATCGA GAAGCTGGTG AGGGCTACCG GCGCGAGGCT GGTGACGTCT
CTTGAAGACT TAACCGAGGC AGACCTCGGC TTCGCTGGCC TCGTCGAGGA GAGGAGGGTG
GGCGACGAGA AGATGGTGTT TGTAGAGCAG TGCAAGAACC CGAGGGCCGT GTCAATACTG
GTGCGTGGCG GCTTCGAGCG GCTTGTGGAC GAGGCTGAGA GGAACCTGGA CGACGCCCTT
AGCGTGGTTG CCGACGTGGT GGAGGAGCCG TACATACTGC CCGCGGGCGG TGCCGCCGAG
ATAGAGGCCG CCAAGTCTGT GAGGGCCTTC GCGCCTAAGG TGGGCGGGAG GGAGCAGTAC
GCCGTTGAGG CATTCGCCAG GGCTCTGGAG GCTATACCGA AGGCGCTGGC CGAGAACGCC
GGTCTTGACC CAATCGACAT CGTGACAGAG CTGACACACA AACACGAGCT AGCAGACGGC
TGGAAATACG GCCTAGACGT GTACCAAGGC AAGGTTGTGG ACATGCTAGC CCTCGGCCTA
ATTGAGCCGC TTTCGGTGAA GATAAACGCG CTGAAAGTGG CCGTGGAGGC CGCCTCAGCG
ATTCTAAGAA TAGATGAGAT AATCGCTGCC AGTAAATTAG AGAAGGAGGA GAAGGGAGAG
AAGAAGGAGG AAAAGAAGGA AGAGTTCGAC TAA
 
Protein sequence
MSQAVLTQIG GVPVLVLKEG TQRAFGKEAL RLNIMIARAI AEVMRTTLGP KGMDKMLIDS 
LGDITITNDG ATILDEMDVQ HPIAKLLVEI SKSQEEEAGD GTTTAVVLAG ALLEEAEKLL
EKNIHPTVIV SGFKKALDVA TEHLRKVAVP VNRSDVDTLK KIAMTSMGGK ISETVKDYFA
DLAVKAVLQV AEQRDGKWYV DLDNIQIVKK HGGSLLDTQL VYGIVVDKEV VHAAMPKRVI
NAKIALLDAP LEVEKPEIDA EIRINDPMQM KAFLEEEEKI LKSYVDKLKS LGVTALFTTK
GIDDIAQYYL AKAGILAVRR VKRSDIEKLV RATGARLVTS LEDLTEADLG FAGLVEERRV
GDEKMVFVEQ CKNPRAVSIL VRGGFERLVD EAERNLDDAL SVVADVVEEP YILPAGGAAE
IEAAKSVRAF APKVGGREQY AVEAFARALE AIPKALAENA GLDPIDIVTE LTHKHELADG
WKYGLDVYQG KVVDMLALGL IEPLSVKINA LKVAVEAASA ILRIDEIIAA SKLEKEEKGE
KKEEKKEEFD