Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2135 |
Symbol | |
ID | 5055702 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1908760 |
End bp | 1910412 |
Gene Length | 1653 bp |
Protein Length | 550 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640469687 |
Product | thermosome |
Protein accession | YP_001154333 |
Protein GI | 145592331 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0459] Chaperonin GroEL (HSP60 family) |
TIGRFAM ID | [TIGR02339] thermosome, various subunits, archaeal |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 41 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTCAGG CAGTGCTAAC CCAGATCGGT GGCGTTCCCG TGTTGGTGCT TAAAGAGGGG ACGCAGAGGG CGTTTGGTAA AGAGGCTCTT AGGCTTAACA TAATGATTGC CCGTGCTATT GCTGAGGTAA TGCGCACGAC GCTTGGGCCT AAGGGTATGG ACAAGATGCT TATTGACAGC TTGGGCGACA TCACTATTAC AAACGATGGT GCGACTATTT TGGACGAGAT GGACGTGCAG CACCCCATTG CGAAGCTACT TGTCGAGATT AGCAAGTCTC AGGAGGAGGA GGCTGGAGAT GGCACGACCA CCGCTGTTGT TCTTGCTGGA GCTTTGCTTG AGGAGGCTGA GAAGTTACTG GAGAAGAATA TTCACCCGAC GGTGATTGTA AGCGGTTTTA AGAAGGCGCT TGACGTCGCT ACTGAGCACC TCCGTAAGGT TGCTGTTCCC GTAAATAGGA GCGATGTGGA TACCCTTAAG AAGATTGCCA TGACGTCGAT GGGCGGTAAG ATTTCTGAGA CTGTTAAGGA CTACTTCGCC GACTTGGCTG TAAAGGCGGT GTTGCAGGTG GCTGAGCAGA GAGATGGGAA GTGGTATGTA GACCTAGACA ATATACAGAT TGTTAAGAAA CACGGCGGCT CTCTGCTTGA CACCCAGCTT GTCTACGGCA TTGTTGTGGA CAAGGAGGTG GTCCACGCGG CGATGCCTAA GCGTGTCATA AATGCTAAGA TCGCACTCCT GGACGCCCCG CTGGAGGTTG AGAAGCCCGA AATCGACGCC GAGATCCGCA TAAACGACCC AATGCAGATG AAGGCGTTCC TAGAGGAGGA GGAGAAGATC TTGAAGAGTT ATGTAGATAA GCTGAAGTCT CTTGGAGTGA CTGCCCTCTT CACCACCAAG GGGATTGACG ACATTGCGCA GTACTACCTT GCCAAGGCCG GCATCCTAGC AGTAAGGCGT GTCAAGAGGT CAGATATCGA GAAGCTGGTG AGGGCTACCG GCGCGAGGCT GGTGACGTCT CTTGAAGACT TAACCGAGGC AGACCTCGGC TTCGCTGGCC TCGTCGAGGA GAGGAGGGTG GGCGACGAGA AGATGGTGTT TGTAGAGCAG TGCAAGAACC CGAGGGCCGT GTCAATACTG GTGCGTGGCG GCTTCGAGCG GCTTGTGGAC GAGGCTGAGA GGAACCTGGA CGACGCCCTT AGCGTGGTTG CCGACGTGGT GGAGGAGCCG TACATACTGC CCGCGGGCGG TGCCGCCGAG ATAGAGGCCG CCAAGTCTGT GAGGGCCTTC GCGCCTAAGG TGGGCGGGAG GGAGCAGTAC GCCGTTGAGG CATTCGCCAG GGCTCTGGAG GCTATACCGA AGGCGCTGGC CGAGAACGCC GGTCTTGACC CAATCGACAT CGTGACAGAG CTGACACACA AACACGAGCT AGCAGACGGC TGGAAATACG GCCTAGACGT GTACCAAGGC AAGGTTGTGG ACATGCTAGC CCTCGGCCTA ATTGAGCCGC TTTCGGTGAA GATAAACGCG CTGAAAGTGG CCGTGGAGGC CGCCTCAGCG ATTCTAAGAA TAGATGAGAT AATCGCTGCC AGTAAATTAG AGAAGGAGGA GAAGGGAGAG AAGAAGGAGG AAAAGAAGGA AGAGTTCGAC TAA
|
Protein sequence | MSQAVLTQIG GVPVLVLKEG TQRAFGKEAL RLNIMIARAI AEVMRTTLGP KGMDKMLIDS LGDITITNDG ATILDEMDVQ HPIAKLLVEI SKSQEEEAGD GTTTAVVLAG ALLEEAEKLL EKNIHPTVIV SGFKKALDVA TEHLRKVAVP VNRSDVDTLK KIAMTSMGGK ISETVKDYFA DLAVKAVLQV AEQRDGKWYV DLDNIQIVKK HGGSLLDTQL VYGIVVDKEV VHAAMPKRVI NAKIALLDAP LEVEKPEIDA EIRINDPMQM KAFLEEEEKI LKSYVDKLKS LGVTALFTTK GIDDIAQYYL AKAGILAVRR VKRSDIEKLV RATGARLVTS LEDLTEADLG FAGLVEERRV GDEKMVFVEQ CKNPRAVSIL VRGGFERLVD EAERNLDDAL SVVADVVEEP YILPAGGAAE IEAAKSVRAF APKVGGREQY AVEAFARALE AIPKALAENA GLDPIDIVTE LTHKHELADG WKYGLDVYQG KVVDMLALGL IEPLSVKINA LKVAVEAASA ILRIDEIIAA SKLEKEEKGE KKEEKKEEFD
|
| |