Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1655 |
Symbol | |
ID | 5054846 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1493052 |
End bp | 1494131 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469198 |
Product | THUMP domain-containing protein |
Protein accession | YP_001153860 |
Protein GI | 145591858 |
COG category | [R] General function prediction only |
COG ID | [COG1818] Predicted RNA-binding protein, contains THUMP domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.57853 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 0.517535 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACGTAA TAGTAAAGAC GAGGAGGGGG TTTGAGAAAA TCGCCGCGTC GCACATTGGC GAGGTCTTGG GACCAGGTGC TGAGGTCGAG CCGGCTCCGT TCGGCTACCT AGGCATAGTG TTTGTGAAGG CCCCGGGTTC GGACAAGTGG GCGCTTGCCC AGCTCATAGC GCAGAAGGTG CCAGAAGCCG ACCGCGTGTT GGTGGTGGAA AAGCTGGTGC CCGCCGACCC CGCCGAGATA GAAAAGGCGG CGGTTGAGGT GGCCAAGCGG TACATAAATC CAGACGACAC CTTCGCCATT AGGACTACCA GGAGGGGCTC CCATACCTTC TCCTCAATTG ACATAAACGT CAGAGTGGGC GCCGCGGTGA AGGAGGTCAC CGGCGCCAAT GTAGACCTTG AAGAGCCCAC AAAGCCTCTA TACGTGGAGA TTTTCCAGGA CACCGCGGCT GTGTGCATCC CAGCGACGGG GGAGTATAGA AAGCTCCGCC GCGACAAGCC ACTTGCCTTG GGGTATCTTA GAAAGGTGGC TCTGGGACAA TTCGTATACG AGGGCGACGA AGAGGCCGTG CGCAAAATGG GCGAGAGAAT CGGGCGGGCT GTCCAGACGT TTGAAGTAGG CGAGCTGGTA ATCCTCCTCC ACAAGCCTAT CCCCGCCCGC ACGTTGCGCC TCTTCGCAGA GGCTGTGGAA GAGGGCATAG AAAGCAGGTA CCAGATACAG ACGAGGAGCT ATGGCAGGCC GGTTTGGAAA GTCCCCGTAC ACGTCTACGA GCTGTACCAG TGGGTGAGAG ACAGAGCCGG CGAGCCCCTC ATAGTGACGG ACCCAAAAGG CGACTACGTA ACACACGCAA AGGAGAGGCT GGCGGAGCTT TTTAAAAGCG GCAGAGTAAA CGTGCTGATA GGGGCGAGGG AGGGCGTCCC CACAGGTGTT TTCAGATTCG CCTCATTGGT AATCGACTTG ATACCAGAGG TCACGATAGC GACCGATTTC GTGGTCCCCG CCCTGGCCAT CGGGCTGATC AGTGCGCTGG AGGAGGCAGG CACACTGCCC AGATACCTAG GCAAGCGGAA GAAAAAATAA
|
Protein sequence | MDVIVKTRRG FEKIAASHIG EVLGPGAEVE PAPFGYLGIV FVKAPGSDKW ALAQLIAQKV PEADRVLVVE KLVPADPAEI EKAAVEVAKR YINPDDTFAI RTTRRGSHTF SSIDINVRVG AAVKEVTGAN VDLEEPTKPL YVEIFQDTAA VCIPATGEYR KLRRDKPLAL GYLRKVALGQ FVYEGDEEAV RKMGERIGRA VQTFEVGELV ILLHKPIPAR TLRLFAEAVE EGIESRYQIQ TRSYGRPVWK VPVHVYELYQ WVRDRAGEPL IVTDPKGDYV THAKERLAEL FKSGRVNVLI GAREGVPTGV FRFASLVIDL IPEVTIATDF VVPALAIGLI SALEEAGTLP RYLGKRKKK
|
| |