Gene Pars_1378 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1378 
Symbol 
ID5054423 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1239486 
End bp1240754 
Gene Length1269 bp 
Protein Length422 aa 
Translation table11 
GC content58% 
IMG OID640468923 
ProductPre-mRNA processing ribonucleoprotein, binding region 
Protein accessionYP_001153592 
Protein GI145591590 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1498] Protein implicated in ribosomal biogenesis, Nop56p homolog 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00034062 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0041241 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCGAAAA TACATATCGC AACGGACGTT CTCGGCTTCT TCGCGGTGGA CGAGGGGGGC 
AACCTCGTAG ACAAGGAACT ATTCGAGAAG AAGCCTGAAC TTATTGCGGA GAGGCTTATC
GAGCTGGAGA AATCCAACCC GGTGCCGGAG CTTGTGAAGC TTGTGGAGAG GCTAAGGGGC
AGGGCGGAGA AGATTGTGCT AGAAGACCCG GAGCTGGCGC GGAAGCTTGT ATCCACGGTG
AAGTGGGCCG AGGTGGTGGG CGAGAGCCCC TCCCCCGTAT TGGTGGCGTT TAGGCAGAAT
TTCCAGAGGC ATCTCTCCAG CATTGGCCTG AGCTGGGAGG AGTACACAAA GTTCCTCTTC
GAGATAAGCG ATCTGGTAAC GAGGTTAAAG CTGAGGCAGG CTGTGGCCAA GCGCGACTTG
TACATCGCCC AGGCCATAAG CGCGCTTGAC GACGTGGACA AGATCATGAA CCTAATCGCG
TCGAGGATAA GGGAGTGGTA CGGCCTCCAC TTCCCCGAAC TTGAGGAGTT GGTGAGAGAC
AACAAGGAGT ACGTCTCTAT CGTATACCAC ATAGGCCATA GGTCTAAGAT TACGGAAGAC
GCCTTGAAGA AGGTGGCCCC CGAGGCGCCG GAGGACAGAG TCAAGAAGAT AGTGGAGGCG
GCGAAGAGGA GCGTCGGCGC AGAGATGTCA GACTGGGATC TCGACCAGCT CAAGACGTAT
GCTGACGTAT TCCTGAAGCT CAACGCTTAC AGAGACCAGC TGGCTGCGTA CATCGACGAG
GCCATGAAGG AGGTGGCCCC CAACATCAGG GAGCTGGTGG GGCCTCTGCT GGGCGCGAGG
CTGATAAAGC TCGCCGGCGG CTTGACGAGG ATGGCGTTTC TCCCCGCCTC GACGATACAG
GTCCTCGGCG CAGAGAAGGC GCTGTTCAGG GCGTTGAGGA CAGGAGGAAA GCCTCCAAAA
CACGGCGTCA TATTCCAGTA TCCGGACATC TTCCGCTCTC CCCGCTGGCA GAGGGGGAAA
ATCGCCAGGG CCCTTGCGGC TAAGCTGGCG ATTGCTGCCA AGGCAGATGC CTTCACTGGG
AATTTCATAG CGCCGAGGCT AAAAGAGGAG TTGTTGAAGC GTATACAGGA AATAAAGACG
TTATATGCAA AGCCGCCTCC CAAAGCCCCC GCACAGCCAA GCGCCAAGAC GCCGCCTCCT
CCACCGCCGT CACCGCCAAG AGGGGGCGAG AGGAGGCCTC CTCCGAGGAG GGAAAGGGGA
AGGAGGTAA
 
Protein sequence
MAKIHIATDV LGFFAVDEGG NLVDKELFEK KPELIAERLI ELEKSNPVPE LVKLVERLRG 
RAEKIVLEDP ELARKLVSTV KWAEVVGESP SPVLVAFRQN FQRHLSSIGL SWEEYTKFLF
EISDLVTRLK LRQAVAKRDL YIAQAISALD DVDKIMNLIA SRIREWYGLH FPELEELVRD
NKEYVSIVYH IGHRSKITED ALKKVAPEAP EDRVKKIVEA AKRSVGAEMS DWDLDQLKTY
ADVFLKLNAY RDQLAAYIDE AMKEVAPNIR ELVGPLLGAR LIKLAGGLTR MAFLPASTIQ
VLGAEKALFR ALRTGGKPPK HGVIFQYPDI FRSPRWQRGK IARALAAKLA IAAKADAFTG
NFIAPRLKEE LLKRIQEIKT LYAKPPPKAP AQPSAKTPPP PPPSPPRGGE RRPPPRRERG
RR