Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2335 |
Symbol | |
ID | 5055968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2089083 |
End bp | 2090003 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640469887 |
Product | protein of unknown function RIO1 |
Protein accession | YP_001154531 |
Protein GI | 145592529 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG0478] RIO-like serine/threonine protein kinase fused to N-terminal HTH domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.861409 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 0.0151879 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGAGAG ACGTTGTGGA GGGCTACGGC AAATTGACGA AGCGGGATTT GCGCGTACTG AGATTAATCG AGGTTGGCCA CCGCAGATAC GAGTATGTCC CCGAAGAGCT CGTCGTTGAT TGGGCCAGAT ACAGGAAGGA GGAGGTTGTT CAGAGCATAA AGCGTTTGCA CTACTACGGC TTCTTGAGGA GGAACGTAGC GCCGTACAGG GGGTGGAAGA TCACCACGCA TGGGTACGAC GTGTTGGCCC TACACACTCT CGTCCAGAGA CGCAAGATTC TCTCCTTATC GCCTACGCCA GTCGGAGTCG GTAAGGAGTC TGTGGTCTAC GCCGGTGTGA CGCCGTCAGG TTTTAAAGTT GCGCTTAAGT TTCACAGAGG GGGGGTCTCC GTCTTTAGGT ACGAGCGGCC TTTTCTCAGA AAAGTGTCTA AATACAGACA CCTGGCCGAC GTGTTTGAGA CAAGGCTTTC CGCCCTGGCC GAGTACTTTG CCCTAAGCAA GGTGTTCGAG GCCGGCGGGT GGGTCCCGGA GCCGATAGCT TACAACCGGC ATGTAGTTCT TATGAGCTAT GTAGAAGGGG TGGAGCTGTA CAGGTCGGCG GAGGAGGATA TGAGAAAAAT CGCAGATGAT GTAGTGCACA CTATCTCCAC GGCGTTGAGA ATCGGGATCA TACACGGCGA TCTCTCTCCG TACAACATAA TTGTGGGGGA CCGGGGATAC GTCATAGACT GGCCCCAATG GGTCCCCACA AGGCATCCAA AGGCAGAATA TTACCTAAGA CGCGACCTCG CCACAATTTC TGCTTTTTTC AAGAAGTGGG GGGTAGAAAT CCCAGTAGAA GAGCTGTTGC TAGCTGTGGG AGAGTCCGGC GACAGCGGCG AGCAGTTCCT CTCAGAAATT TATAAACAAG ACTTCCTATA G
|
Protein sequence | MLRDVVEGYG KLTKRDLRVL RLIEVGHRRY EYVPEELVVD WARYRKEEVV QSIKRLHYYG FLRRNVAPYR GWKITTHGYD VLALHTLVQR RKILSLSPTP VGVGKESVVY AGVTPSGFKV ALKFHRGGVS VFRYERPFLR KVSKYRHLAD VFETRLSALA EYFALSKVFE AGGWVPEPIA YNRHVVLMSY VEGVELYRSA EEDMRKIADD VVHTISTALR IGIIHGDLSP YNIIVGDRGY VIDWPQWVPT RHPKAEYYLR RDLATISAFF KKWGVEIPVE ELLLAVGESG DSGEQFLSEI YKQDFL
|
| |