Gene Pars_1467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1467 
Symbol 
ID5054851 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1326972 
End bp1329191 
Gene Length2220 bp 
Protein Length739 aa 
Translation table11 
GC content64% 
IMG OID640469007 
Producthypothetical protein 
Protein accessionYP_001153676 
Protein GI145591674 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCCGC AGGTTAGGCG GGCTGTTGAA CAGTTCAGAC GGGAGTACGA ATCTCTACTC 
ACCGAGGAGG AGCTCGAACA GTGCGTTAGG GAAATTGAGA AGACTGAGAC GGCGATAGCG
GGGTACGTAT GCGCCGCCAG AATTCTTATG AACAGACCAC CGCCGCAACC GAGGCAGGCG
CGGGAAGCTC CCAGGCGGGA AGCCCCCAGC CTCCACATCC CCCTCCCCAG CCTCCCAGCC
CCCATCTCCC GCCTCCCCCG GCTCCCAGCC CTCCGGAGCT GGCCCGTGGC GGCCGGCCTC
GGCATGACCG CCGCCGCCCT CCTCGGCGCG GTCCACCCAG CCCTAGCCCT CCTCTCCCTA
CCCGCGTTGT ACATCGCCCG AGCCGCGTTA GGCGCTGTAC ATACGCCCAT CTGGCGGGAG
GGGGACCACG CTGTTGCCCT CATGGGCAGG GAGAAGGTCA AGGCGAGGCT CTACCGGGTG
GCCGCGGTGT TTAGAGACGT ACACGGAATG GGGCCGTATG AGTTCGCCAA CGCCGTGCGC
GCCTTTGTTC CCCTCGTTAA AGGCGTCTAC TACGATGGGA GGGACGTCTA CGTCTTGCTA
GAAGACGGCG CCGAGGAGGC CCGGACGGCG TTGCGCCGCC TGGGCATTGT AGTGGAGGAC
CAGCCCTCGC CCCCTCCCCC GGAGCTCGGC CCGAAACAGA CGGCGCTGAG ATACGCCCCC
CTAGCCGCAT TACCCCTCGC GGCCTCCCTC CTCGCCCCAG CCGCACTGCC CTTCCTCGTA
TTCGCAGTAG TGTTCTACCT GCTAATGCTG GCTAGAGACG TGGGCACCCC CGCGGCGGGG
AGAGGCATTG AGAATAACGA CGCCCTATTT GCAATACTAC GGAAGGAGGA GATTTGGTCG
ATAGCCCGCG TCTCCCAGCT CACGATTAGC AAGGCGTTGT TAGTCTGGGC GCCAAACAAA
GCCTTTCTAT CGCGAATCAC CAAGCGGGCG TTGAGGCAGG AGCACCTGGC TATGTTGCTC
CTCTCCCGCG TGAGAATGAT GCGCGCCGAG GAGGTGGCCG CGGTGAGACA ACGGGTTGTC
CACCAGAGGG AGGAGGCCTT CTCTGTCGCC GGCTTGGTGG AGGGCAAACC GTCGGCGTTC
TCAGTCGGCC GTCCCAACGT CGTAGACGCG CTCACCTTCG ACCTCGCCGA GTTCACCCCG
TACTCCTTCA TGCTGTTGCC GTTTCAATGC GGCTCCGGTA CGTATAAACT GGGCTGGGAC
GACAGAGGGC GGGAGGTCTG CATAGACCCC TACCAGCTGG AGTCCCCACA CGCCGTCGTT
ATTGGCAAGA CGGGGTCCGG CAAAACCACT TGGTCTCTGG CACAAGCTTT ACAGGCGTTG
CGGGCGGGAA GGTTTGTTGT TGCAATCGAC CCCCACGGCC ACTGGGCGAG GTACGCTACC
GCCGTGGTAG ACGCCAGGCG GTACATCCCG CGTATCAAAT TCTCCGTCGA GGGCGGCGGG
GAGGAGTTTT CAGATGTCGA CCTCCTTCTG GACGTCCTCC GCGCGGCTGG GGTGGCGGTG
GCAGACGTAC ACTACACAGT CTTACTAAAT GCCCTCGAGC GGGCGGGGGG CTCTGCCGAC
TTGCCGAGCT TAGTTACAGC GTTATCGAGA ATTAGAGACC CTCTCAACGC CCTCGCCGTG
GATATGATCG CCGGGCGGAT TAAAGCACTG GCCCGGGCCG AGCCGATAGA CCTACCCACG
TCGGGCCTCG TGGTGGTGAC CACCTACGGC GCCGAGTCGC CACACGCCGT CATGCGCCTC
ATCACGTGGC TATTCTCCTA CGCCGTCTGG GCCAAGCAGA CGTGTCCCAG GCCGCCCTGC
AAGCCCCGGC TGGAGATTTA CATCGATGAG GCCCACCTCC TCCTCAGACA CCTCGAGGCG
CTGGCGCTTG CGTGGAGGGG GCTGAGGAAA TACGGCGTTA GGCTGGTGGC GTTGTCGCAA
GACGTCGCCG AGTTCGGAGG GCCTCTCTCC ACAATTATCG CCAACTCGGA CACTAAGGCG
GTCCTGGCCA TAGACCCGAC GCAGTTGCAA AACATCTCCC GGGCCGTGGG GGTAGACCCC
TCGGTGCTCG AACGTGTGGC CACCGAGGCC CTGCCCGAGG AGCGCTACGC CGTTGTGAGA
TTCGGCGGCA GGGCGCCGGT CTTCATACGG CTTATCCGCC CCGAGGACCT CATTTCCTGA
 
Protein sequence
MRPQVRRAVE QFRREYESLL TEEELEQCVR EIEKTETAIA GYVCAARILM NRPPPQPRQA 
REAPRREAPS LHIPLPSLPA PISRLPRLPA LRSWPVAAGL GMTAAALLGA VHPALALLSL
PALYIARAAL GAVHTPIWRE GDHAVALMGR EKVKARLYRV AAVFRDVHGM GPYEFANAVR
AFVPLVKGVY YDGRDVYVLL EDGAEEARTA LRRLGIVVED QPSPPPPELG PKQTALRYAP
LAALPLAASL LAPAALPFLV FAVVFYLLML ARDVGTPAAG RGIENNDALF AILRKEEIWS
IARVSQLTIS KALLVWAPNK AFLSRITKRA LRQEHLAMLL LSRVRMMRAE EVAAVRQRVV
HQREEAFSVA GLVEGKPSAF SVGRPNVVDA LTFDLAEFTP YSFMLLPFQC GSGTYKLGWD
DRGREVCIDP YQLESPHAVV IGKTGSGKTT WSLAQALQAL RAGRFVVAID PHGHWARYAT
AVVDARRYIP RIKFSVEGGG EEFSDVDLLL DVLRAAGVAV ADVHYTVLLN ALERAGGSAD
LPSLVTALSR IRDPLNALAV DMIAGRIKAL ARAEPIDLPT SGLVVVTTYG AESPHAVMRL
ITWLFSYAVW AKQTCPRPPC KPRLEIYIDE AHLLLRHLEA LALAWRGLRK YGVRLVALSQ
DVAEFGGPLS TIIANSDTKA VLAIDPTQLQ NISRAVGVDP SVLERVATEA LPEERYAVVR
FGGRAPVFIR LIRPEDLIS