Gene Pars_1255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1255 
Symbol 
ID5055791 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1134028 
End bp1135524 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content55% 
IMG OID640468798 
Productrestriction endonuclease 
Protein accessionYP_001153471 
Protein GI145591469 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.185862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value0.543832 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCGTCC TGGATATCTT AACGTCTCTC AGCTCTGAAG AGTTTGAAAA ATATGTAGCT 
GACTATGTGC TACCGGTCTT GGGCTTAAGA GTTCACAACG TTGTGGGTGG GCCGTACGAC
AGAGGTTGTG ACATAATTGC TGAGGACACG CGGTTTGGGA GTAGGGTATG CGTCCAGGTT
AAGAGGTACT CGCCGGAGAG GAAAGTGACG GAGAAAGATG TCAGAAACGT TTTGTTCGGC
ATGGAGCAAC ACCGCTGTGA CCGCGGGCTC ATTGTCACCA CCTCTGATCT CAACGGACCT
GCGCTGAGCT TAGCGAGGCA GTACCGGATA GACTACATAA ACGGCGCGAG GCTTGCCAGG
ATGGTGGAGG AGCAGTTAAT TCCCCTGGTG ATGCCCAAGG CCGTGGTCGC AGCGGTGCAT
CAAGAAGAGG GGAGTCATGA GGCCGTGGAG AGGGAGGTGA GGGATGACGG GGTGTTTATC
CCGCTGGGAG TCACCAACGC CGTGGAGGTT GCAAGGGCCT ATCTGAAGTC CAAGGGGGCG
CTACATCCTC AGCTCGGGGG CGTATCCGCA CTTTTGAAAA GGCTCTACGT GTTTAAAGCC
AAGGCGAGCT ACAAGCTGGG GAGGAGGAGG TCGGAGGAGG CCGTAATTTC GGTGGACGCA
GAGGGTGAGG TATACGAGGG CGTGCCGCCT CTGATCAATA CGGTTAACTT CTATGTGGAG
TACGAGACGA GCAGGGAGGA CTACTACTCG GCCCGGGAAA TCGCCATCCG CTATATCACA
AGTAGAATAG TCCCGGAGGG GGCGCAAGAT ATCAAGATCC AGCTCAAGAA CCACGCCCTT
GCGTGGGTTG CGGCGCTGTA CGCCATACGC TTTAAGGTGG GGCTCGTTGA CGTGGTTGTA
CACGTCGATA AGAAGGGAAG AGTTGTGAAA ATGGAGCGAG GCCGCCTTAC CGATGACTTA
GTGAGGGGAG CGTACGGCGG CGAGGTGGTG AGAGGCGATG GTTACAAGGT GAGGCTAGAT
CAGGGCAATT TCGTGGAGGA GCTAAAGCTC AACGAGTTTG GAGAGGTTGT GGCAAGGGCT
CGGGCAGTCA AGGAGAGCTA CGCCGTGGAG GTGGCATCTA AGTTTTTCGG CATTGCCGGG
GAGGACGTGA GGTATAAACG CGAAGGCGGC GCGGTAAAAG TAGACATTTT TCTAAACGGC
CACCACCACC TCGCCAAAGT GGACGAAAAC GGCGAGGTGG TTGACTACGT GGTGGTGCCC
GACGCAGAAA TTTACGAAGG CTTCGAAAAG GGGTATAACA TAAGGATGAG GGCGCTCATT
GTGAAGACAG TGGAAGATGG CGAAGAAGTG GTGCGGGTTG TGACAAGCGA AGGCGTAGTT
GACGAAAAAA GAGCGAAGAG GTCTCTTCTA AGGAAAATCG GGAGCAGTCT AGCGGGGCTT
GTCAAGAAGT CGGAGGAGTA CTCAATAGAT ACGGCTGACC CCCTCAACTT AATCTAG
 
Protein sequence
MGVLDILTSL SSEEFEKYVA DYVLPVLGLR VHNVVGGPYD RGCDIIAEDT RFGSRVCVQV 
KRYSPERKVT EKDVRNVLFG MEQHRCDRGL IVTTSDLNGP ALSLARQYRI DYINGARLAR
MVEEQLIPLV MPKAVVAAVH QEEGSHEAVE REVRDDGVFI PLGVTNAVEV ARAYLKSKGA
LHPQLGGVSA LLKRLYVFKA KASYKLGRRR SEEAVISVDA EGEVYEGVPP LINTVNFYVE
YETSREDYYS AREIAIRYIT SRIVPEGAQD IKIQLKNHAL AWVAALYAIR FKVGLVDVVV
HVDKKGRVVK MERGRLTDDL VRGAYGGEVV RGDGYKVRLD QGNFVEELKL NEFGEVVARA
RAVKESYAVE VASKFFGIAG EDVRYKREGG AVKVDIFLNG HHHLAKVDEN GEVVDYVVVP
DAEIYEGFEK GYNIRMRALI VKTVEDGEEV VRVVTSEGVV DEKRAKRSLL RKIGSSLAGL
VKKSEEYSID TADPLNLI