Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1378 |
Symbol | |
ID | 5054423 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1239486 |
End bp | 1240754 |
Gene Length | 1269 bp |
Protein Length | 422 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640468923 |
Product | Pre-mRNA processing ribonucleoprotein, binding region |
Protein accession | YP_001153592 |
Protein GI | 145591590 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.00034062 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0041241 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAAAA TACATATCGC AACGGACGTT CTCGGCTTCT TCGCGGTGGA CGAGGGGGGC AACCTCGTAG ACAAGGAACT ATTCGAGAAG AAGCCTGAAC TTATTGCGGA GAGGCTTATC GAGCTGGAGA AATCCAACCC GGTGCCGGAG CTTGTGAAGC TTGTGGAGAG GCTAAGGGGC AGGGCGGAGA AGATTGTGCT AGAAGACCCG GAGCTGGCGC GGAAGCTTGT ATCCACGGTG AAGTGGGCCG AGGTGGTGGG CGAGAGCCCC TCCCCCGTAT TGGTGGCGTT TAGGCAGAAT TTCCAGAGGC ATCTCTCCAG CATTGGCCTG AGCTGGGAGG AGTACACAAA GTTCCTCTTC GAGATAAGCG ATCTGGTAAC GAGGTTAAAG CTGAGGCAGG CTGTGGCCAA GCGCGACTTG TACATCGCCC AGGCCATAAG CGCGCTTGAC GACGTGGACA AGATCATGAA CCTAATCGCG TCGAGGATAA GGGAGTGGTA CGGCCTCCAC TTCCCCGAAC TTGAGGAGTT GGTGAGAGAC AACAAGGAGT ACGTCTCTAT CGTATACCAC ATAGGCCATA GGTCTAAGAT TACGGAAGAC GCCTTGAAGA AGGTGGCCCC CGAGGCGCCG GAGGACAGAG TCAAGAAGAT AGTGGAGGCG GCGAAGAGGA GCGTCGGCGC AGAGATGTCA GACTGGGATC TCGACCAGCT CAAGACGTAT GCTGACGTAT TCCTGAAGCT CAACGCTTAC AGAGACCAGC TGGCTGCGTA CATCGACGAG GCCATGAAGG AGGTGGCCCC CAACATCAGG GAGCTGGTGG GGCCTCTGCT GGGCGCGAGG CTGATAAAGC TCGCCGGCGG CTTGACGAGG ATGGCGTTTC TCCCCGCCTC GACGATACAG GTCCTCGGCG CAGAGAAGGC GCTGTTCAGG GCGTTGAGGA CAGGAGGAAA GCCTCCAAAA CACGGCGTCA TATTCCAGTA TCCGGACATC TTCCGCTCTC CCCGCTGGCA GAGGGGGAAA ATCGCCAGGG CCCTTGCGGC TAAGCTGGCG ATTGCTGCCA AGGCAGATGC CTTCACTGGG AATTTCATAG CGCCGAGGCT AAAAGAGGAG TTGTTGAAGC GTATACAGGA AATAAAGACG TTATATGCAA AGCCGCCTCC CAAAGCCCCC GCACAGCCAA GCGCCAAGAC GCCGCCTCCT CCACCGCCGT CACCGCCAAG AGGGGGCGAG AGGAGGCCTC CTCCGAGGAG GGAAAGGGGA AGGAGGTAA
|
Protein sequence | MAKIHIATDV LGFFAVDEGG NLVDKELFEK KPELIAERLI ELEKSNPVPE LVKLVERLRG RAEKIVLEDP ELARKLVSTV KWAEVVGESP SPVLVAFRQN FQRHLSSIGL SWEEYTKFLF EISDLVTRLK LRQAVAKRDL YIAQAISALD DVDKIMNLIA SRIREWYGLH FPELEELVRD NKEYVSIVYH IGHRSKITED ALKKVAPEAP EDRVKKIVEA AKRSVGAEMS DWDLDQLKTY ADVFLKLNAY RDQLAAYIDE AMKEVAPNIR ELVGPLLGAR LIKLAGGLTR MAFLPASTIQ VLGAEKALFR ALRTGGKPPK HGVIFQYPDI FRSPRWQRGK IARALAAKLA IAAKADAFTG NFIAPRLKEE LLKRIQEIKT LYAKPPPKAP AQPSAKTPPP PPPSPPRGGE RRPPPRRERG RR
|
| |