Gene Pars_1615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1615 
Symbol 
ID5055017 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1456582 
End bp1457625 
Gene Length1044 bp 
Protein Length347 aa 
Translation table11 
GC content55% 
IMG OID640469156 
Producthypothetical protein 
Protein accessionYP_001153821 
Protein GI145591819 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.273235 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAACTA AGATTTGCCT AGAATATTTC AAAGGCGTCA AAAGCGGCTG TGTCCAGCTC 
GGCCGCGGGA CGGTGGTGTT CGGCCCTCCA AACTCTGGGA AGACCACGTA TTTTGAGGCT
GTTGCCCTTC TTGTCCAGAG CAGAGGCGAG CAGTGGCTGG CCCTGGAAGG CCCCCTACTA
ATTGTGCACG AGGCGGAAGA CCTCCACCAC GGCGGCGATT TGGAAACCCC ATTCACGATA
GAGCTGTCGG TTATGTTAGA AGGCGGGGAG GTTGTATACG GCTATCGATA CGCCGCAGGT
CCCAACTATG TCGAGCAGTG GGTCAAGAAA GACGGCGAAC TACTTGTAAG ACTTGTTAAA
AAGGGAGATG GCGGCGTGAT GACGCATCCT ACCGAGGCGA GGCTGTGCGT GGCGCCATTT
GCCGTTATGA ACGAGGATGT TTTGATAGCC TGCGACCCCG TGGAGGATGA GAGGTTTAGA
ATGGCTGAAA GAGCGTTGCT GGAGTTGAGA ATAGGGCTGA AGGACAAGTT CTACCTAATC
AGCGGGAGGA GGCTGGCGGC GTGGAAGTAC ACCTACGAGA CACATGTGGA TTTAATGCCA
CAGACTAGCG TAGGCCCCGA GGGCCAGTTC ACCCCCCACC ACCTCTCGCG GATCTTGACT
CTCCCCTCCT ACGAGGCTGT GAGGGAACAG CTCTATGAGT ACCTGCACGT CGCAGACGTA
GAGGATATCC GCGTCGGCCT CATCAAAAGC GGGCGCATAG CGCTGTATGT AAGAAGAGGC
GGGCTATGGA CAAATATATA CAACGCAGGG AATTACACCA AGGCGGTCCT CCCAGCCCTG
TTACAGCTCC TCTTGGCCAA CGAGGGGTCT ACGGTGTTTA TAGACGATGC AGACCTTGCC
GTGCCCAGCG ACAAGTCTGA AACGTTGTTG TCAGCTATGG CGGAAATAGC ACAGAGGAGG
CATCTGCAGC TGGCGGTCTC GGCCAAAGAG CCGGGCTTCG CCAAGGTAGC AGAAAAATTG
GGGCTTGTAG TTGAGTCTTT GTGA
 
Protein sequence
MITKICLEYF KGVKSGCVQL GRGTVVFGPP NSGKTTYFEA VALLVQSRGE QWLALEGPLL 
IVHEAEDLHH GGDLETPFTI ELSVMLEGGE VVYGYRYAAG PNYVEQWVKK DGELLVRLVK
KGDGGVMTHP TEARLCVAPF AVMNEDVLIA CDPVEDERFR MAERALLELR IGLKDKFYLI
SGRRLAAWKY TYETHVDLMP QTSVGPEGQF TPHHLSRILT LPSYEAVREQ LYEYLHVADV
EDIRVGLIKS GRIALYVRRG GLWTNIYNAG NYTKAVLPAL LQLLLANEGS TVFIDDADLA
VPSDKSETLL SAMAEIAQRR HLQLAVSAKE PGFAKVAEKL GLVVESL