Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1467 |
Symbol | |
ID | 5054851 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1326972 |
End bp | 1329191 |
Gene Length | 2220 bp |
Protein Length | 739 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640469007 |
Product | hypothetical protein |
Protein accession | YP_001153676 |
Protein GI | 145591674 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGCCGC AGGTTAGGCG GGCTGTTGAA CAGTTCAGAC GGGAGTACGA ATCTCTACTC ACCGAGGAGG AGCTCGAACA GTGCGTTAGG GAAATTGAGA AGACTGAGAC GGCGATAGCG GGGTACGTAT GCGCCGCCAG AATTCTTATG AACAGACCAC CGCCGCAACC GAGGCAGGCG CGGGAAGCTC CCAGGCGGGA AGCCCCCAGC CTCCACATCC CCCTCCCCAG CCTCCCAGCC CCCATCTCCC GCCTCCCCCG GCTCCCAGCC CTCCGGAGCT GGCCCGTGGC GGCCGGCCTC GGCATGACCG CCGCCGCCCT CCTCGGCGCG GTCCACCCAG CCCTAGCCCT CCTCTCCCTA CCCGCGTTGT ACATCGCCCG AGCCGCGTTA GGCGCTGTAC ATACGCCCAT CTGGCGGGAG GGGGACCACG CTGTTGCCCT CATGGGCAGG GAGAAGGTCA AGGCGAGGCT CTACCGGGTG GCCGCGGTGT TTAGAGACGT ACACGGAATG GGGCCGTATG AGTTCGCCAA CGCCGTGCGC GCCTTTGTTC CCCTCGTTAA AGGCGTCTAC TACGATGGGA GGGACGTCTA CGTCTTGCTA GAAGACGGCG CCGAGGAGGC CCGGACGGCG TTGCGCCGCC TGGGCATTGT AGTGGAGGAC CAGCCCTCGC CCCCTCCCCC GGAGCTCGGC CCGAAACAGA CGGCGCTGAG ATACGCCCCC CTAGCCGCAT TACCCCTCGC GGCCTCCCTC CTCGCCCCAG CCGCACTGCC CTTCCTCGTA TTCGCAGTAG TGTTCTACCT GCTAATGCTG GCTAGAGACG TGGGCACCCC CGCGGCGGGG AGAGGCATTG AGAATAACGA CGCCCTATTT GCAATACTAC GGAAGGAGGA GATTTGGTCG ATAGCCCGCG TCTCCCAGCT CACGATTAGC AAGGCGTTGT TAGTCTGGGC GCCAAACAAA GCCTTTCTAT CGCGAATCAC CAAGCGGGCG TTGAGGCAGG AGCACCTGGC TATGTTGCTC CTCTCCCGCG TGAGAATGAT GCGCGCCGAG GAGGTGGCCG CGGTGAGACA ACGGGTTGTC CACCAGAGGG AGGAGGCCTT CTCTGTCGCC GGCTTGGTGG AGGGCAAACC GTCGGCGTTC TCAGTCGGCC GTCCCAACGT CGTAGACGCG CTCACCTTCG ACCTCGCCGA GTTCACCCCG TACTCCTTCA TGCTGTTGCC GTTTCAATGC GGCTCCGGTA CGTATAAACT GGGCTGGGAC GACAGAGGGC GGGAGGTCTG CATAGACCCC TACCAGCTGG AGTCCCCACA CGCCGTCGTT ATTGGCAAGA CGGGGTCCGG CAAAACCACT TGGTCTCTGG CACAAGCTTT ACAGGCGTTG CGGGCGGGAA GGTTTGTTGT TGCAATCGAC CCCCACGGCC ACTGGGCGAG GTACGCTACC GCCGTGGTAG ACGCCAGGCG GTACATCCCG CGTATCAAAT TCTCCGTCGA GGGCGGCGGG GAGGAGTTTT CAGATGTCGA CCTCCTTCTG GACGTCCTCC GCGCGGCTGG GGTGGCGGTG GCAGACGTAC ACTACACAGT CTTACTAAAT GCCCTCGAGC GGGCGGGGGG CTCTGCCGAC TTGCCGAGCT TAGTTACAGC GTTATCGAGA ATTAGAGACC CTCTCAACGC CCTCGCCGTG GATATGATCG CCGGGCGGAT TAAAGCACTG GCCCGGGCCG AGCCGATAGA CCTACCCACG TCGGGCCTCG TGGTGGTGAC CACCTACGGC GCCGAGTCGC CACACGCCGT CATGCGCCTC ATCACGTGGC TATTCTCCTA CGCCGTCTGG GCCAAGCAGA CGTGTCCCAG GCCGCCCTGC AAGCCCCGGC TGGAGATTTA CATCGATGAG GCCCACCTCC TCCTCAGACA CCTCGAGGCG CTGGCGCTTG CGTGGAGGGG GCTGAGGAAA TACGGCGTTA GGCTGGTGGC GTTGTCGCAA GACGTCGCCG AGTTCGGAGG GCCTCTCTCC ACAATTATCG CCAACTCGGA CACTAAGGCG GTCCTGGCCA TAGACCCGAC GCAGTTGCAA AACATCTCCC GGGCCGTGGG GGTAGACCCC TCGGTGCTCG AACGTGTGGC CACCGAGGCC CTGCCCGAGG AGCGCTACGC CGTTGTGAGA TTCGGCGGCA GGGCGCCGGT CTTCATACGG CTTATCCGCC CCGAGGACCT CATTTCCTGA
|
Protein sequence | MRPQVRRAVE QFRREYESLL TEEELEQCVR EIEKTETAIA GYVCAARILM NRPPPQPRQA REAPRREAPS LHIPLPSLPA PISRLPRLPA LRSWPVAAGL GMTAAALLGA VHPALALLSL PALYIARAAL GAVHTPIWRE GDHAVALMGR EKVKARLYRV AAVFRDVHGM GPYEFANAVR AFVPLVKGVY YDGRDVYVLL EDGAEEARTA LRRLGIVVED QPSPPPPELG PKQTALRYAP LAALPLAASL LAPAALPFLV FAVVFYLLML ARDVGTPAAG RGIENNDALF AILRKEEIWS IARVSQLTIS KALLVWAPNK AFLSRITKRA LRQEHLAMLL LSRVRMMRAE EVAAVRQRVV HQREEAFSVA GLVEGKPSAF SVGRPNVVDA LTFDLAEFTP YSFMLLPFQC GSGTYKLGWD DRGREVCIDP YQLESPHAVV IGKTGSGKTT WSLAQALQAL RAGRFVVAID PHGHWARYAT AVVDARRYIP RIKFSVEGGG EEFSDVDLLL DVLRAAGVAV ADVHYTVLLN ALERAGGSAD LPSLVTALSR IRDPLNALAV DMIAGRIKAL ARAEPIDLPT SGLVVVTTYG AESPHAVMRL ITWLFSYAVW AKQTCPRPPC KPRLEIYIDE AHLLLRHLEA LALAWRGLRK YGVRLVALSQ DVAEFGGPLS TIIANSDTKA VLAIDPTQLQ NISRAVGVDP SVLERVATEA LPEERYAVVR FGGRAPVFIR LIRPEDLIS
|
| |