Gene Pars_1060 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1060 
Symbol 
ID5055925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp944121 
End bp946229 
Gene Length2109 bp 
Protein Length702 aa 
Translation table11 
GC content60% 
IMG OID640468616 
Producthypothetical protein 
Protein accessionYP_001153290 
Protein GI145591288 
COG category 
COG ID 
TIGRFAM ID[TIGR01451] conserved repeat domain 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.655402 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTCGGA ACATATCACT GGCGGTCATC GCCTTTGCGG CGGTACTCGC CTACGCCCAA 
GCAGTGGTCG GACTCGCTTC TTCTGTGGAC TACCTCTGGC TTGGCGCCAA GTGGAGCCAA
GTACCTAAGT TCCCCGGAGA CGTTGGGGTT GTGACCCTCT CCTTTTATGT GTCAAGCCAA
TACGTCGACG TTACCATCTC CCTAGACCCT AAGTGCGGCT ACGTGGCCCC TCTTGAGGAC
GTTAGGCTCC CCTCCGCCGG CCCAGGAGTG GTGTCGGCAA ACCTCAAGGT ACTAGCCTAC
GCGCTGAACG TCACGTGCCC CGCAAACGCC ATATTCAACG CCAGGTATAA GGCAGTGGGC
GGCTCTCTGA CAGACGGCGT CACCAATGTT GAGTACGTCT CGCTCTACGT CCCGCCGTAC
CCGACGTACG ACGTCTCAGC AAGGGGCACG GCATACCTGG GCATGCCCAG CAGGATAACG
CTAGTTTTCA AAAGCCCCTA CTCCACGGCC TCCACCGCAA CGGTGCAGGG CCAGGGGGTT
AGGGTTCTTT CCCCCTCCGG CCAGTTCTCC GTAAACGGGA CCTATGCAGA AGTGCCGCTG
GTCGTCATCG CCGACTCGCC CTCTGCCTCG TTGTTGGTGT CTGTGCAGTC CCGGGACTGG
CTGGGCAACC CGGTGGCGCT GACGTACACA GTGCCTATCG CCGCGGCTCC CGCCCCGCCT
TCGGTTATGT ACGTATCGCC CACCGCCTTG TCGCTTAACA AGTACAACAA GGTGAACGTC
ACCATACAGC TCCCCGTTGA GGCTGACGGC ACGGCTGTAA TCGCGGCGGC AGGCGCAGTC
ATGCCGCAGT CAAGCATAAC TATACCCATA AGCCGGGGGA GGGGCTACGC CGTGCTGGAG
GTCTACCCCG TCTTCTCAGT TGTGACCTTC ACTGCACAAG TGACATACCA AGTCACCGGC
GTGGCCAAGA CGGAGCAGAT CTCGGCGTCG GCAGCTACCC AGCAGACCAT AGGCGGCTTG
GCCAAGGTGG AGGTAAGGCC CCCAAGGCTG ATGGCCGGCG TGGCCAATAA CGTAACGCTA
TCCGTCTCGG CGCCGGGCGC CTTCAACGTC TCAGTCGCTG TAGCGAACGC CGCCGTCGAC
AAGCCACAGC CCTACTACTT CGGCGGGGTA GACAAGGCAA CTGCCAGCCT CCTGGTGACG
CCTTTGTCGA GCCAACCTGT CACCTTCACT GTGACGGTGT ATCACAGCGC CGGGACAGAT
CAGTACACCA TCACCCTTCC GGTCACTTCG GCAAGCATAT TCACAGTAAT ACCGAATCCG
TCTTTGGTTA AGTCTGGCGG AAACCGCACA GTTGTCGTTA CCGTGATAAA CAGCGGCGAC
GTGGCTGTGC AGAAGGCGGT GGTCACAATC TCCCCCGCCA CGTCGAACGT GGTGGCCTCC
ACCTACACCT TCCAGCTAGG GAGGGTGGCG CCGCTTGAAA GCGTCCAGCT CCCCATATCC
TTCATAGTCC CCGCCACCTA CAGCGGCGCC ATGGCCTTTA CCTACAACAT AATCTACACC
ACAGAGCTCG GCACCACCGG CTCGTCCCAG GGCACCTTCT ACCTACAAGC CTTACAAAGC
CCAGCGGTCA ACGTCACCTC CGTAACCGTC GTGCCGCAGT TCCCGGAGGC CCGCAGGACT
TTCTACATCT CCGTGACGGT GGTGAACAAG GGCTTCGCCC CAGTGACCAA CCTCCAGGTT
GAGGCCAGCC CGCCGCGGGG CATTAGGCCG GTCACGGCGC CTATATACTT CGCCGGCCAG
CTCGACCCCC AGCAGACGGC CAACATACCC CTGAGCTTCA ACGCCACTGC GCCGGGCCAG
TACCAAATAC CCCTCGTCAT ATCCTACACG GATCAGTACG GTAACTTCTA CACAATCCCG
TACACGGTCA CTGTGACAGT CTCCAACGGG ACGCGACTCT TCGGTACTAT CCGTCAGGGA
ACGCCGATTC AGGGCGGATC AAGTCAGCCA GGGCAAGGCG GCACGATAGT CGGGGCGGGG
GTTGCGATGG TGGTTGCGGC GGCTGTTGTC GCCGTTTTGT ACATGAGGCG GAGAGCCAAG
AGGTCATGA
 
Protein sequence
MIRNISLAVI AFAAVLAYAQ AVVGLASSVD YLWLGAKWSQ VPKFPGDVGV VTLSFYVSSQ 
YVDVTISLDP KCGYVAPLED VRLPSAGPGV VSANLKVLAY ALNVTCPANA IFNARYKAVG
GSLTDGVTNV EYVSLYVPPY PTYDVSARGT AYLGMPSRIT LVFKSPYSTA STATVQGQGV
RVLSPSGQFS VNGTYAEVPL VVIADSPSAS LLVSVQSRDW LGNPVALTYT VPIAAAPAPP
SVMYVSPTAL SLNKYNKVNV TIQLPVEADG TAVIAAAGAV MPQSSITIPI SRGRGYAVLE
VYPVFSVVTF TAQVTYQVTG VAKTEQISAS AATQQTIGGL AKVEVRPPRL MAGVANNVTL
SVSAPGAFNV SVAVANAAVD KPQPYYFGGV DKATASLLVT PLSSQPVTFT VTVYHSAGTD
QYTITLPVTS ASIFTVIPNP SLVKSGGNRT VVVTVINSGD VAVQKAVVTI SPATSNVVAS
TYTFQLGRVA PLESVQLPIS FIVPATYSGA MAFTYNIIYT TELGTTGSSQ GTFYLQALQS
PAVNVTSVTV VPQFPEARRT FYISVTVVNK GFAPVTNLQV EASPPRGIRP VTAPIYFAGQ
LDPQQTANIP LSFNATAPGQ YQIPLVISYT DQYGNFYTIP YTVTVTVSNG TRLFGTIRQG
TPIQGGSSQP GQGGTIVGAG VAMVVAAAVV AVLYMRRRAK RS