Gene Pars_2070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2070 
Symbol 
ID5055993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1850392 
End bp1852248 
Gene Length1857 bp 
Protein Length618 aa 
Translation table11 
GC content62% 
IMG OID640469619 
Productmolydopterin dinucleotide-binding region 
Protein accessionYP_001154268 
Protein GI145592266 
COG category[C] Energy production and conversion 
COG ID[COG0243] Anaerobic dehydrogenases, typically selenocysteine-containing 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.360567 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0416579 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGACTTC TAGCCTGCAC CCGGGACTGC TACGACACGT GCATTTTCCA CGTGGTGAGA 
GATGGGGAGA TACGCCTAGT CCCAATCAGC GATTTTCCCA CTCTAGGCTT CACCTGCGCC
CGGGGCATGG CAGATGTGCG GAGGCTAAAC TCGCCGAGGA GGATAAAGAC GCCGTTGCTC
AGAGGAGAGC GGCAAGTCGT GGAGGCGAGC TGGGGCCGCG CCCTTGGGGA GCTGGCGGCG
AGGATCAAGG AGGCGGACCC CCAACGCGTA ATACACATAG ACTACGACGG CAACCAAGGC
CTCCTCACCT GGTACTACCC CGCAAGGCTC TTCAACGCCC TGGGGACAGC CTCTACCGAC
TACTCCATCT GTAGCGCCGA GGGGCACGAG GCTATAAAGC TACACTGGGG CCGCTCCTAC
GGCGCCATGC CGGAGGAGCT GGGGAGGAGG CCCGTGGTGT TCTGGGCGCT TGACGCCTCC
ACGTCGTTTA TCCACGGCTG GGCCCTCGCC AAAAGGGGTA GAAACCCCAC TGCCGCCGTA
GACGTGGTGT GGACCAGGAC CATGAAGGCA GTGGATCTGC CAGTGCTGGT GCGGCCTGGG
ACAGACGTGG TTTTGGCCCT CGGCGTCGCC AGGGAGATAA TTGAGAGGGG GGCCTACGAC
AGGGAGTTCG TGGAGAAATA CACCTACGGC TTCCACCTCT TTAGGGAGTA CGTCCAGAAG
TTCACTCCTC AGTACGTCGA GGATGAGGCC GGCGTCCCCC GGGATATCTT CTACAAGCTG
GTAGACATCT ATCTGAGAAG GCCCGTGACG GTTATAGGCT TTGCCATTGG GAGAACCGAG
AACGGCGGCG ACGCCGCTAG GGCCATATCG CTAATACACG CCCTTCTGGG AGACCCCGCC
GGCTTCTACT ACTCCAACTC GGGGGCCTGG GGCATCGACT TCGCCTACCT GCGCGGGTTG
CACGTGGCTA AGCCGAGCAG GGTAGTCCCT ATGGGGGTTG TGGGCGGCGT CATCGAGGAG
TTTAGGGTGG TGTACGTCTG GAACGCCAAC CCCGTCCTCA CGCTCCCCCA GGGAGACAGA
ATCGCCAAGG CGGCGGAGAG GGGCGACATA ACCCTCGCCG TGCATGCCCC GTTGCTGGAC
GAAACCGCCG AGGCGGCGCA CATCGTGTTG CCGGCGCCGC TGTACTTGGA AAAAGACGAC
GTGATCTACA GCTACTGGCA CAACTACCTC GTCTACAACG CCGCAGTGGC TGAGCCCCCC
GGGGATGCGA GGAGGGAGAC CTGGGTCGTG AAGAAGCTCG CCGAGCTTCT GGGAGTTGGC
GACCACCCCC TCTTGCGGGA AGACCCCTGG GACGCCGTGG ATATTGCCAT AAGGGGAACC
GGCGTTACCT TAAAAGAGCT GAGGGAGCGC CAGCTGGTTA AGCTCAAGGC GCCGGACTAC
TATAAGTTCC CCACGGCTAC GGGCAAGGTG GAGTTCTACA GCGCCACGGC GGAGCGCCGA
GGCTTGCCGC CGCTTCCCCA GTACGCGCCG CCAAGGAGGG GCTACGTCTT GACCTTCCCG
CCCCATACCC TATACACCAA TAGCCAGTTT AGAGACGTCT ACGGGGAGCC TGAGCCCGCC
GTGTTGGTAA ACCCAAGCGA CTACGTGGGC GACTGCATTG TACTGTACAA CGAGGCGGGG
GAGGTGAGGG TAAGAGCTAG GCCCAGCCCA GAAGTGCCCC GCGGCGTAGT CGCCTATCTG
GGCATCGGCA AGGACCTCCG GGGGGAGCCC ATAAACAAGA TAGCAAGAGG CGAGCCGGGG
CCCTACGGAG GCACCCCCAA GCTCTACACT ACCTATGTAC AAATGAGACC ATGTTAA
 
Protein sequence
MGLLACTRDC YDTCIFHVVR DGEIRLVPIS DFPTLGFTCA RGMADVRRLN SPRRIKTPLL 
RGERQVVEAS WGRALGELAA RIKEADPQRV IHIDYDGNQG LLTWYYPARL FNALGTASTD
YSICSAEGHE AIKLHWGRSY GAMPEELGRR PVVFWALDAS TSFIHGWALA KRGRNPTAAV
DVVWTRTMKA VDLPVLVRPG TDVVLALGVA REIIERGAYD REFVEKYTYG FHLFREYVQK
FTPQYVEDEA GVPRDIFYKL VDIYLRRPVT VIGFAIGRTE NGGDAARAIS LIHALLGDPA
GFYYSNSGAW GIDFAYLRGL HVAKPSRVVP MGVVGGVIEE FRVVYVWNAN PVLTLPQGDR
IAKAAERGDI TLAVHAPLLD ETAEAAHIVL PAPLYLEKDD VIYSYWHNYL VYNAAVAEPP
GDARRETWVV KKLAELLGVG DHPLLREDPW DAVDIAIRGT GVTLKELRER QLVKLKAPDY
YKFPTATGKV EFYSATAERR GLPPLPQYAP PRRGYVLTFP PHTLYTNSQF RDVYGEPEPA
VLVNPSDYVG DCIVLYNEAG EVRVRARPSP EVPRGVVAYL GIGKDLRGEP INKIARGEPG
PYGGTPKLYT TYVQMRPC