Gene Pars_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1204 
Symbol 
ID5054328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1090341 
End bp1092326 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content62% 
IMG OID640468751 
Producthypothetical protein 
Protein accessionYP_001153424 
Protein GI145591422 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1331] Highly conserved protein containing a thioredoxin domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.139299 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACAGAG AGGCGTGTTT GAGGAGGTCT ACGTCGCCCT TTGTGTTGGA TGGGCTGAGG 
AGCAAGGTGC AGTGGTGGGC GTGGTGCGAG GAGGCTTTCC AGAAGGCGAA GGCGGAGGAC
AAGCCTATAT TAGTGGACGT GGGCGCCGTC TGGTGTCATT GGTGCCACGT TATTGACGAG
ACGACGTACA ACGATGACGA GATTGCCGAT ATCATAAACA AGCATTTCGT GCCGATTAAG
GTAGATCGGG ACGAGAGGCC AGACGTAGAC CGCCGACTGC AAGAATATGC GGTTTTAGTT
AGCGGCCAGT CTGGTTGGCC TCTCACCGTC TTCATGACGC CGGAGGGGGA GGTGATCTGG
GCCGCCACGT ATCTTCCGCC GAGGGACTAC GGCGGCTTGC CGGGGATGGC CAAAGTGCTG
AGGGCTGTTC TGGAGGCGTA CAGGACGAAG AAAGGCGATA TCAAGAAGAT GGCGGAGGAT
CTCTCTAAGG AGATAGCGGC GTGGCACAAC CCTTCGGAGG CCGAGCTTGA CCGCTCCGTC
CAACTTGACA TACTGGCGTC GCTGGCCGCC TCCTTCGACG AGGAGTACGG CGGCTTCGGC
ACGGCGCCTA AGTTCCCGCC GATCACCCAG CTGGATCTGT TGTTGTTGCG GCATTTCTAC
GACGGGAAGT CGGTCTACGG TAAGATGGCC CATGCGACCT TGAGGGCCAT GGCGCGAGGA
GGGGTCTACG ACCAGCTTGG TGGCGGCTTC TTCCGCTACT CCACTGACCG CTTGTGGCTT
ATCCCCCACT ACGAGAAGCT CCTAGTAGAC AACGCAGAGC TGTTGTCGCT CTACGCCAGG
GCATATGCCC ACTTCGGCGA CCAGCTGTAT AGAAAAACGG CGGCGGGGAT CATCAAGTGG
CTCGACGAAT TCATGCGCGA CCCGGGCGGC GGATACTACG CCAGCCAAGA CGCCGACGTA
GACGGGGAGG AGGGCGCCTA CTACCGCTGG ACGGAGGACG AGCTTAAGGA GACCCTGGGC
GATCTCTTCC CCAAAGCGGC TGATATGTTT GGCCTATACG AATTTAAGTG GCCCGAGGGG
CGGGCTACCC TAAGCATAGT TAGGGTTGTG CCGGAAGCCG ACTTGATCCT TGAGAGGCTG
GCGGAGGCCC GCAAGGCGAG GAAGCCACCG AGGGTGGACA CCACGATTTA CGCCGGTTGG
AGTTGCGCCA TGGCTAAGGC GGAGCTGGAG GCGAGCCGCC TGGCGGGGAT AGGGGACAAG
GAGTTCGCCT TGAAGACTCT TGACAAGATC AGGAGGGAGG CGTGGGACGG CTCGAGGCTG
GCCCGCGGGC TTAGGGGCGG GGGGCCCGTG GGGGAGGGAG TTCTGGAGGA CTACGCCTAC
TGCGCCTTAG CCGCGCTGGA GGCCTACTCC CACACCGGCA GATACCTGGA CTGGGGCGTA
GAGGTGGCGG GGGCGATGGT GGATAGGTTC CTAGACCAGG GAGGGTTTAG AGATGTGGAG
AGGCCAGACC CCGTGTTGAA GACGCCGCAC TACCCCGTGG CTGATACACC CAACTACTCG
GGGAACGCCC TGGCCATATT GGCGTGCGAC CTTCTGCACT ACGCCACGGG TATCCGCAAG
TTTAGAGACG CGGCGGAGAG GGCTCTGAAG GCGCTGGCGG GCAAGCTGGC GAGGCTAGGG
CCCTCCGCCG CCGGGTTGGC CATCGCCCTG GACGCCCACT TGGCCGAGCC TCCCCGGACG
GTGGTTGTGG GCTCTGCCGA GGAGCTTCTG AGGGCGGCCC TTGCGGCGTA CCGCCCCCTG
CACGTGGTGA TGCCTGTGGC AAGCGGCTGG GACTACCCCG AGCCCTCCAT AAAGGCCATG
CTGGCGGGGC CGAAGCCGGC GGCCTACGTC TGCGCTTGGG GGGCTTGCTC CATGCCGATA
TCCGACCCGG GGAGGCTGGG GGAGGCCGTT AGGAAATTTA GGAGGGAGGC CTACGGGCTA
GAATAG
 
Protein sequence
MDREACLRRS TSPFVLDGLR SKVQWWAWCE EAFQKAKAED KPILVDVGAV WCHWCHVIDE 
TTYNDDEIAD IINKHFVPIK VDRDERPDVD RRLQEYAVLV SGQSGWPLTV FMTPEGEVIW
AATYLPPRDY GGLPGMAKVL RAVLEAYRTK KGDIKKMAED LSKEIAAWHN PSEAELDRSV
QLDILASLAA SFDEEYGGFG TAPKFPPITQ LDLLLLRHFY DGKSVYGKMA HATLRAMARG
GVYDQLGGGF FRYSTDRLWL IPHYEKLLVD NAELLSLYAR AYAHFGDQLY RKTAAGIIKW
LDEFMRDPGG GYYASQDADV DGEEGAYYRW TEDELKETLG DLFPKAADMF GLYEFKWPEG
RATLSIVRVV PEADLILERL AEARKARKPP RVDTTIYAGW SCAMAKAELE ASRLAGIGDK
EFALKTLDKI RREAWDGSRL ARGLRGGGPV GEGVLEDYAY CALAALEAYS HTGRYLDWGV
EVAGAMVDRF LDQGGFRDVE RPDPVLKTPH YPVADTPNYS GNALAILACD LLHYATGIRK
FRDAAERALK ALAGKLARLG PSAAGLAIAL DAHLAEPPRT VVVGSAEELL RAALAAYRPL
HVVMPVASGW DYPEPSIKAM LAGPKPAAYV CAWGACSMPI SDPGRLGEAV RKFRREAYGL
E