Gene Pars_0986 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0986 
Symbol 
ID5055469 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp879137 
End bp880306 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content53% 
IMG OID640468542 
Productpeptide chain release factor 1 
Protein accessionYP_001153218 
Protein GI145591216 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1503] Peptide chain release factor 1 (eRF1) 
TIGRFAM ID[TIGR00108] peptide chain release factor eRF/aRF, subunit 1
[TIGR03676] peptide chain release factor 1, archaeal and eukaryotic forms 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.706075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.000607277 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCTTTA ATAGGCCGCC TAATGGCGTT TACTACGTAA AGACCGCCAC GGAGCTGAGG 
GCGTTTGTGA ATGTCTTGAA AAAGTTCCGG GGCTACGCCA CTACCCTCAT TACTTTGTAC
ATCAACTCGG AGCGTCCTAT CCCCGATGTG TTGAATCTGC TAAGATCGGA GTGGTCCACC
GCGGCTAACA TTAAAGACAA GACGACTAGG ACGCACGTCC AGGACACCTT GGAGAGGATT
ATCAACAACC TCAAGGGCGA GGCTAAGGCT CCTGAAAACG GCATGGCCAT CTTCGCGGGG
TTCCACATGA TAAACCAGGG CAACTACGAG TGGGTGTACT ACGTCGTGGT TCCGCCCCAG
CCTATCTATA CGTTTAAATA CATCTGCGAC ACGGCTTTCC ACACGGAGAT CTTAGAAGAG
CAACTACACG CAGCTGTTAC CTACGGCATA GTGGTAGTGG AGAGAGGAGA GGCGGTGATC
GCGTTGCTAA AAGGCGGGCA GTGGGAAGTT GTTAAGACTG TTGAGTTCTT CGTGCCGGGG
AAGCACCACG CAGGAGGACA GTCCGCCAAC CGCTTTAAGC GCCAGACAGA GCACTTAGCC
GAAACTTTTT ACAAGGTGCT GGCTGAGGAG GTTAACAAGA TATTCTTACA GATCCCCACG
CTGAAGGGGA TCATCGTGGC GGGGCCTGGG CCCACGAAGG AGGACTTCTT AGAAGAGGGG
GGCTTGGACT ACCGCCTCAA GGACAAGGTT CTGGCCGTTG TACCTGCGTG CTGCGCCAAC
GAGTACGGTG TGGTGGAGGC TATTAGAAAC GCCCAGGACC AGCTCAAGGA GAGCGAATAC
GTCAAGGCAA AGGAGGTTAT GGACAAGGTC ATGTTCTACG CCGTTAAGAA AAGCGATTAC
ATTGTATATG GGAGGGACCG AACGCTTAAG GCACTTCAGA TGGGAATGGC GGAGCTGGTG
GTAATAGCCG AGGAACTCGG CGAAGACGTA GTTCTCGACG TAGTAATGAA GGCGGAGGAG
AAGGGAATTA AAGTAGAAGT TATACCAAAG GGCGTGGAGG AGTCCAAGAC GCTGATGCAG
GCATTTGGAG GATACGTGGC GCTTCTTTCA ACCCCCGTTT GGGTTCTTGA GCAACAAATT
GCGGCAGAGG CCGCCACAAC AACGTCATAA
 
Protein sequence
MSFNRPPNGV YYVKTATELR AFVNVLKKFR GYATTLITLY INSERPIPDV LNLLRSEWST 
AANIKDKTTR THVQDTLERI INNLKGEAKA PENGMAIFAG FHMINQGNYE WVYYVVVPPQ
PIYTFKYICD TAFHTEILEE QLHAAVTYGI VVVERGEAVI ALLKGGQWEV VKTVEFFVPG
KHHAGGQSAN RFKRQTEHLA ETFYKVLAEE VNKIFLQIPT LKGIIVAGPG PTKEDFLEEG
GLDYRLKDKV LAVVPACCAN EYGVVEAIRN AQDQLKESEY VKAKEVMDKV MFYAVKKSDY
IVYGRDRTLK ALQMGMAELV VIAEELGEDV VLDVVMKAEE KGIKVEVIPK GVEESKTLMQ
AFGGYVALLS TPVWVLEQQI AAEAATTTS