Gene Pars_1675 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1675 
Symbol 
ID5055888 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1512648 
End bp1513955 
Gene Length1308 bp 
Protein Length435 aa 
Translation table11 
GC content55% 
IMG OID640469216 
Producthypothetical protein 
Protein accessionYP_001153878 
Protein GI145591876 
COG category[R] General function prediction only 
COG ID[COG2403] Predicted GTPase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTAGAG TTGCAATTAT CGGAGCTGCC GGAAGAGACT TCCACGTCTA TAACACCGTA 
TACCGAGGCT CGCGAGAATA TAAAGTCGTG GCGTTTCTAA TGACCCAGAT CCCTATCCCA
AACAGGAGAT ATCCGCCCTC TCTCTCGGGC GTGCCGGAAG GCGTTCCAAT ATACACCTGG
AAAAGTTACG AGGAGTTGAC TAGATATCTA AAAGAGCTCC GTGTCGACGA GGCAGTCTTG
GCGTTCAGCG ATTTGACCTA TGAGGATGTT GGCCACATAA TCTCAGCGGT GTTGGCCAGC
GGCGCCTCAT TTAAAATACA CGGGCCCAAC GACACGTACC TAAATTCTAT AAAGCCCGTC
ATCGCAGTAA CGGCAACTAG AACAGGCGCG GGAAAATCCA CTGTCTCCCG GGAGGTTGTA
AGAGAACTCA CCTCGCGTGG GTTAAGGGTC GTGGCTGTGA GGCACCCTAT GCCGTACAGA
GAGCTGGAGG ACAGCGTCGT GGAGGTATTC AAAAAGCCGG AAGACCTAGA GAAGCTTACC
TTCGAGGAGA GAGAGGAGTA CGAGCAGTAC GTCGAGATGG GCGTGCCCGT CCTTGCCGGC
GTGGACTACG GACTGGTGCT GAGGGAGGCA GAGAGGCACG GCGACGTTGT CTTGTGGGAC
GGCGGCAACA ACGACTTCCC CTTCTTCAAG CCGGGCTTCA TGATTGTAGT TACCGACGCC
AGGAGGGCTG GGCACGAGGT CGGCTCCTTC CCAGGAGAGG TCAACCTACG TCTAGCAGAC
GCCGTGATAA TTACAAAGGT CAGTGACGCC GGGAGGGAAA ACGTCGAAAA AGTTGTGGCC
AATGTCAAGA GGGTCAACCC CAGGGCCACC ATAACCAAGG CAGACCTAGA AGTCGGCGTC
GACAGCAACA TATCGGGCAA GAGGGTACTG GTGGTCGAAG ACGCGCCGAC AGTCACCCAC
GGAGGGTTGC CCTACGCCGC TGGCTACATT GCTGCGGTTA AATACGGCGC AGTTGTGGTA
GACCCAAGAC CCTACGCCGT GGGCGTAATT AAAAAAGTGT ACGAAGAGTA CGGCACAGGG
CCCGTCTTGC CAAGTCTGGG CTACACCGAG GAGCAGAAAC GTGACCTAGA AGAAACTATT
AGAAGGGCCG ACGCAGACCT CGTGTTGCTC GCTACTCCTG CGAAAATTGA GCGCGTCGTC
AAGATTGACA AGCCGATTGC GAGGGTCTCC TGGAGGCTTA AGGTAGTGGA AGGGCCGACA
GTCAAAGAAC TTATTGATCG GTTCCTCGAA ACGGCGTCTC TACGCTAG
 
Protein sequence
MRRVAIIGAA GRDFHVYNTV YRGSREYKVV AFLMTQIPIP NRRYPPSLSG VPEGVPIYTW 
KSYEELTRYL KELRVDEAVL AFSDLTYEDV GHIISAVLAS GASFKIHGPN DTYLNSIKPV
IAVTATRTGA GKSTVSREVV RELTSRGLRV VAVRHPMPYR ELEDSVVEVF KKPEDLEKLT
FEEREEYEQY VEMGVPVLAG VDYGLVLREA ERHGDVVLWD GGNNDFPFFK PGFMIVVTDA
RRAGHEVGSF PGEVNLRLAD AVIITKVSDA GRENVEKVVA NVKRVNPRAT ITKADLEVGV
DSNISGKRVL VVEDAPTVTH GGLPYAAGYI AAVKYGAVVV DPRPYAVGVI KKVYEEYGTG
PVLPSLGYTE EQKRDLEETI RRADADLVLL ATPAKIERVV KIDKPIARVS WRLKVVEGPT
VKELIDRFLE TASLR