Gene Pars_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1987 
Symbol 
ID5055491 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1777429 
End bp1778499 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content57% 
IMG OID640469534 
ProductN(2),N(2)-dimethylguanosine tRNA methyltransferase 
Protein accessionYP_001154186 
Protein GI145592184 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG1867] N2,N2-dimethylguanosine tRNA methyltransferase 
TIGRFAM ID[TIGR00308] tRNA(guanine-26,N2-N2) methyltransferase 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones34 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGCCGCA TAGTACTGAG GAGGGAGGGC ACTGTAGAAT TCTACGCTCC AGATCCCCGT 
GCCTATGGCA GTATCTACTC CGCCCCTGTT TTTTACAACC CTGCCATGGA GAAAAACAGG
ACGCTCTCCG TCTTGTTCCT CAAGGCGTAT GGCGCCACTG GGCTGACTGT GTGCGAGCCG
CTCAGCGGCA CAGGCGTGAG GGGTATTAGA TACGCCGTGG AGACCGGCGC GGTTGGAAGA
CTTATCCTAA ACGACATCTC GCGCGAGGCT GTCGAGCTCA TAAAGAAAAA TCTAGAGGTA
AACGGCGTCG ACGCCGAGGT CTACAACGAG GATGCCAACG TGCTTCTCCA CAGGCTAAGG
GACAAGTGCG ACGTTGTCGA CATCGACCCC TTCGGCTCGC CGGCGCCCTT CTTGCAAGCC
GGCTTCCGTG CCTTGCGGGA AGAAGGCGTG ATCTGCGTCA CAGCCACGGA CACCGCCGTC
TTGGTAGGGC GCTATCCCAA GAAGTGTCTT AGGAGGTACG GCGCTGTTAT GTTCAAAACG
CCTTTCTACA TTGAGGCAGG TTTGAGGAAT CTCACTGGCT ACGTGGCCAG GGTGGCTGCG
TCTGAGGACT ACGGGTTCGA GCCGCTTTTT GCCTATTGGG AAGGCCACTA CTTCAGGCTC
TGTGCCCGGG CCGTGAGGGG GGCCCGCGAT GCAGACTCCA ATTTTAACTT CATCGGATAT
GTGGAGTACA AAAAGCCTTC CAGGAAAGTG GTCAGACGCC CTGGCGAAAG ATCTCTGGGC
CCTCTCTGGG TTGGGCCCAT GGGGGACCCG ATCTTGGTAA ACAAGATGGC CGAGTACGGG
CCGTATGGGG ACTTTTTACA GCTACTCTCT GAGGAGTACT CGGTGGCCGC GCCTTGGTTT
TTCAAAGTCC CCGAGTTCGC CTTAGGTGGG GTAAGCCGGG GTATAGAAGA GGCGCTCAAC
GCCTTGAAAA AAGGCGGTAT ATACGCCGTG CGGACACACA TGGCTCCTGA CGGCTTTAAA
ACAGAGGTGA GTGCCGGCGA GGTGGAGAGA GTCCTGAGAA TTGTTATATA G
 
Protein sequence
MSRIVLRREG TVEFYAPDPR AYGSIYSAPV FYNPAMEKNR TLSVLFLKAY GATGLTVCEP 
LSGTGVRGIR YAVETGAVGR LILNDISREA VELIKKNLEV NGVDAEVYNE DANVLLHRLR
DKCDVVDIDP FGSPAPFLQA GFRALREEGV ICVTATDTAV LVGRYPKKCL RRYGAVMFKT
PFYIEAGLRN LTGYVARVAA SEDYGFEPLF AYWEGHYFRL CARAVRGARD ADSNFNFIGY
VEYKKPSRKV VRRPGERSLG PLWVGPMGDP ILVNKMAEYG PYGDFLQLLS EEYSVAAPWF
FKVPEFALGG VSRGIEEALN ALKKGGIYAV RTHMAPDGFK TEVSAGEVER VLRIVI