Gene Pars_1811 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1811 
Symbol 
ID5056001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1626100 
End bp1628484 
Gene Length2385 bp 
Protein Length794 aa 
Translation table11 
GC content52% 
IMG OID640469357 
ProductSMC domain-containing protein 
Protein accessionYP_001154014 
Protein GI145592012 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.305534 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGGA TAGAGAGGAT TGAGCTGGAA AACTTCCGCT CCTACAAGGG GAGGCATGTT 
GTGAGCCTAG GCGATGTGAA CATACTGTGG GGGAGAATAG GGGCTGGTAA GACATCTCTC
CTATACGCAG TTGAGTACGC CCTCTACGGG AGGCAATTGG AGGTCAAGGA AAGAGTCGCC
AAGTTGCTAG ACCTAATCAA CGTCGAAGCT CACGAGATGC GAGTCTCCCT CGTCCTAAGC
GATGGTGGGA GATTGCTCGA GATAGAAAGG CGCCTAGGGA GGCGCGGCGA CGAAAAGATA
GTGGTGAGAA TCAACGGGGA AGAGCTCCGG GGGAGAGAGG CGGAAAGGAG GCTGGAGGAG
CTGTTAGGCG CCGACGAGGA TATCTACGAA AGACTCATCT ACATATCGCA TAGGACCCTA
GAGGGCTTTA TCTACGGCAC TTCGCAGAAG AGGTCGCTAA CAGTAGATAG GCTCTTCGGT
ATCGATGTGA TAGATAGCGT TGTCCGGGTT GTTTCGTCTG TTGAGAAGTC CCTTTTGGCC
AAGGCCGAGG AACTAAGGGG GAGGCTTGCG GCGTATGAGA AGTATAAGGA GGTGATCAGA
AGATACGGCG GTTTCGCCAG CGTCAAGAAG CGCCTAGAAG ACTTAACAAA AGAGGTAGAA
ACCCTGAAGG AGAGGGAAGC GGCGCTGTCT AGAGACGCAG AGGACTTAGC AAGAAAAAGA
GCGGAGCACT TAGCCAAGCT GAGGGAGCAC GAGTCTATGC TCCTTGAGTA CTACAAGACA
AGATCGGAAC TAGAGGTTTT AGAAAGCGCT ACAGAGGGGG GGACTTTTGA CCAGTCAACA
GTGGAGAGGC TGAGAGATGC GCTCAGAGAA GCCGTAGAGG AATTTGAACA TGTCGTGGGC
GAGGAGCTTG CCGAGAAGCT CTCAAAAGCC GGCGATGTGG AGTCGCTGTC TATTGCCATG
ATGGAGGCGT ACAACGCCTT GTCGAGGTTG CAAAACGAAC TAGAAGCGCA GATTGCAGAA
GCGAAAAAAC TCTACGAACA ATACTCGGCA AGAGTTAGGA AAATTGATGA GGAGATAGCC
GAGGCACGGG AAAGGCTAAA AAGACTTGAG AAGTACTACG CTCGTTTCAA GGAGTTACAG
AAGACTATCC AGTCAGTGGA TTCGGCTCGC GCCTCGCTGG CGGAGCTTAG AAGACAGATT
CAGACGTTGG AAAGAGAGGT TTCTTACTCC TCCGCGTTGA GAGTAGTGGC GCTCTACGCC
GCGGAGACTG GGGCAGAAAA GTGCCCCATA TGCGGCGCTC CTGTGCGGAA AGAGGATCTC
CTTCGCCGTG TGGAGGAGGT GGAGGCTAGA CACGGAACTC TCATCAAGGA GGTGGAGGAG
CTGAAGGAGA GGGCTTCTCA GCTGGAAAAA GCCGTGGAAG AGGCGGAGGC GCTAAGCGGC
GAGGTCGCGG AGTATCTAGC CGTGAAGACT AGATTAGAGG AGCTTGAAAC CGAGAGGGAA
GAGGCAGCAA AGAAGGCGTT GCAAGCAGAG AAATCCTGGA AACAGCTTGA GAAGAAGACA
GAAAGACTAC GCTTGTTGCT GGCTAGAGTT GATAGAAGAA CTATATCCGA TACCCTCGCC
AAGTATGGAA GGGCTGTAAG GATTAGAGAG TTGCGTAAGC GTTTAAGAGA AATCGAGGAC
TCGTTGAAAA AGATCGGGAT CACGAGCGAT GTCATAAACA TTGATGTAAA CTGGAGAGAG
GTCGTAGAGG AGCTTGAAAA AACCTCGCGC AAACTCGCCG AGGCATATAG AGAGAAAGCC
CTCCTAGAAG AAGTTGTGAG AGAAGTCGGC GAAGACGCCG AGGCGTTAAA AAAGAAACTG
GACAACACCC TGTACGCCTA TGGCAAGTTG CAGGAGGTTA AGGCTAAGCT GGAGCTTGCC
AAGATCAACG CGAGGGCGCG GCTCCTTGAG GTGGCTAGGA GCAGATTCAA CGAGGTATTC
ACCTCTCTGT ACAGATATGG CGACATCGTA AGGGTTGATG CCGACTTAGA ACAGCGCAGA
GGCTTCTACG ACTTCCACGC CATAACTCCC ACCGGGGATA GATATGGAAT CTCTAAGCTC
AGCGACGGTC AGCGACTCTC CATAGCCCTT GCCCTCGCGC TGGCGCTCCG GGACATCGCA
AGAATAAACT TGGGATTTGT AATCTTCGAC GAGCCGATCC CCTACGTGGA TGTGAACATA
AGAAGAGCCT TTGCCGAAGT GGTCAAGGCG CTGGCGAGCC GTTTCCAGAT AGTTGTCGCT
ACCCAGTCGA GGGAATTCGC GGAGATGATA AAAGAAGCGG TTCCCAACGC GCTCCTCTTC
AAGGTGGATA AAAAAGAGAG CTCCGAGGCA GTAGTGGAGT CCTAG
 
Protein sequence
MWRIERIELE NFRSYKGRHV VSLGDVNILW GRIGAGKTSL LYAVEYALYG RQLEVKERVA 
KLLDLINVEA HEMRVSLVLS DGGRLLEIER RLGRRGDEKI VVRINGEELR GREAERRLEE
LLGADEDIYE RLIYISHRTL EGFIYGTSQK RSLTVDRLFG IDVIDSVVRV VSSVEKSLLA
KAEELRGRLA AYEKYKEVIR RYGGFASVKK RLEDLTKEVE TLKEREAALS RDAEDLARKR
AEHLAKLREH ESMLLEYYKT RSELEVLESA TEGGTFDQST VERLRDALRE AVEEFEHVVG
EELAEKLSKA GDVESLSIAM MEAYNALSRL QNELEAQIAE AKKLYEQYSA RVRKIDEEIA
EARERLKRLE KYYARFKELQ KTIQSVDSAR ASLAELRRQI QTLEREVSYS SALRVVALYA
AETGAEKCPI CGAPVRKEDL LRRVEEVEAR HGTLIKEVEE LKERASQLEK AVEEAEALSG
EVAEYLAVKT RLEELETERE EAAKKALQAE KSWKQLEKKT ERLRLLLARV DRRTISDTLA
KYGRAVRIRE LRKRLREIED SLKKIGITSD VINIDVNWRE VVEELEKTSR KLAEAYREKA
LLEEVVREVG EDAEALKKKL DNTLYAYGKL QEVKAKLELA KINARARLLE VARSRFNEVF
TSLYRYGDIV RVDADLEQRR GFYDFHAITP TGDRYGISKL SDGQRLSIAL ALALALRDIA
RINLGFVIFD EPIPYVDVNI RRAFAEVVKA LASRFQIVVA TQSREFAEMI KEAVPNALLF
KVDKKESSEA VVES