Gene Pars_1844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1844 
Symbol 
ID5056182 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1649924 
End bp1651093 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content51% 
IMG OID640469390 
ProductTGS domain-containing protein 
Protein accessionYP_001154047 
Protein GI145592045 
COG category[R] General function prediction only 
COG ID[COG1163] Predicted GTPase 
TIGRFAM ID[TIGR00231] small GTP-binding protein domain 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.262151 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCTA ATCTTCCTGC TGAGGCCAAG GCTGCTTGGC TGAAGGTGAT GGAGGCGAAG 
ACTCCTGAAG AGAAGCTACG GGCCATGGAA GAGTTCTTAT CCGCAGTGCC TAAGCACAAG
GGTACTGAGA AGCTGATTAA GCACATTCGG AGGAGGATGG CCGAGCTTAG GAGAGAGCTC
CAGGAGAGGC GTGAAAAGGC GCGTGCCGTG CGGGGCGGAG GGGGGGCGCG ACTTTACGTT
GCCAAAGAGG GGGATGTGCA AGTCGCTGTC GTGGGCCCGC CTATGTCTGG CAAAACGGCG
TTGTTGAGAT GCCTTACCAA CACTCACCTC GAGCCTGACG AACTCCCCTT TTCCACTGTC
GAGCCCATCC CGTCTATGTT TGTGGAGGAC GGCGTATATG TACAGCTGGT GAAAACGCCT
AGCTTAGTGC TGGACCAGAG TAGTGATCTT AACACAGTGA CGCTGGCGAC TGTGAGAAAT
GCCGATGCGG TGATGTTGGT GGTAGACGCT AACAATAACG CAACGCTTCT GCACAGAATA
ATACAGTTTT TTGAAGACGA GGGGATCTAC CTAACCCCTC CGACTAACTA CGTAAAAATT
GAGCGGAAGG GGCTGGGCGG CGTCCAGATA GTTGGTTCTG GCAAAATCGT AGGGGGGACG
TTGAGCGATG TTAAGAAGCT TCTACACGAG TACGGCATAT ACCACGCAGT GGTGCACATA
GAGGGGGTTG TATCTCTTGA CGAAGTAGAG GAGGCTTTGT ACTTAGACAA GATGTATAAG
CCTACTATAG TAATAATGTC AAAAGTCGAT TTATACCAAG TTAATAGAGA AGTAGAGGAG
TTTTTTACGA AGGCCGGCGT TAAGTACTAC AAGACCGATT TGAGGGTGTG TAATCTCGAT
AGGAGGAGAC TACTTGAGGA TATTCTACAA GCCACGGGGC GTATAAGAGT TTTTACAAAG
CCGGTTCATT CCAAGTGGTA CGTAGAGAAG CCAATTGTTG TGAAAGCAGG CTCAACAGTC
GGCGACGTTG CCGCCATGAT TCATTCATCG CTCGCCGAGA CGTTTAAGTA CGCTATTGTG
TGGCGCAGAG ATCAGTATCC CAACTGGCCT AAACGCGTGG GCCGCGACTA CGTCTTGTCC
GACAACGATG TAGTGGAAAT ACATGCATGA
 
Protein sequence
MPANLPAEAK AAWLKVMEAK TPEEKLRAME EFLSAVPKHK GTEKLIKHIR RRMAELRREL 
QERREKARAV RGGGGARLYV AKEGDVQVAV VGPPMSGKTA LLRCLTNTHL EPDELPFSTV
EPIPSMFVED GVYVQLVKTP SLVLDQSSDL NTVTLATVRN ADAVMLVVDA NNNATLLHRI
IQFFEDEGIY LTPPTNYVKI ERKGLGGVQI VGSGKIVGGT LSDVKKLLHE YGIYHAVVHI
EGVVSLDEVE EALYLDKMYK PTIVIMSKVD LYQVNREVEE FFTKAGVKYY KTDLRVCNLD
RRRLLEDILQ ATGRIRVFTK PVHSKWYVEK PIVVKAGSTV GDVAAMIHSS LAETFKYAIV
WRRDQYPNWP KRVGRDYVLS DNDVVEIHA