Gene Pars_0526 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0526 
Symbol 
ID5055769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp474260 
End bp475225 
Gene Length966 bp 
Protein Length321 aa 
Translation table11 
GC content61% 
IMG OID640468088 
ProductRieske (2Fe-2S) domain-containing protein 
Protein accessionYP_001152773 
Protein GI145590771 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG2146] Ferredoxin subunits of nitrite reductase and ring-hydroxylating dioxygenases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value0.814974 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCAGA TCCCGAGGGA AAAGATAGAC GAGTTTGAGT TCGCGCTGTA TGAAAGCTGG 
GTGGAGTGGT TCAGGCGGGA ACCTGGCATA AACGGCTATA GGCGGTTCAT AAACCAGAAG
CTTGTCGAGG AGATGACGGT GATACAGCTG GTGCGGGCCG CCATAGGCAG GCCGGAGCTC
ATGGGGGACC CCGACATATA CCACAAGATG GGGTGGATTG TGGATTGGCA CGAGAGCTCG
CTGTATCGCC GCTGGTTCGT GGCCTCCAGG TCGCTGGGGC TCAACGGGTA TAAGCTGGAC
AAGAAGAGGG CGGAGCTTCT GGACTCCGTG GTGGGGACCG CCAAGGGGGA TATAAAGAGG
GGGCTCCTCC TCCTGCGGAC TGCCCTTAAA TACGTCCACA CTGGCTACCG AGCGGCGGCG
TATTGGGCCA ACAACCGGCA CATAGCCCAG CTGGCATACA TGGCGTACAT GAGGGAGGAG
GACGAGATAA AGGTGGTGGA CGACGCCCTG CAGTTCATGG GAGTGGCCAG ACCCGACAGA
GCGGGTATGG ACATGCTGGA CTTCAACACC CTAGTGCCCA TCTTCTTCGA CTTCTTCCAG
CCGCCTGAGG CCGCCGACGA CGGGGGCCCC AAGCTGGAGG CCCCGGAGTG GGAGAGGGTC
GCCACGGTGG ATGAGCTCAG GCAACTGGGT AAGAAGATGG CCGTGGTGGG GCTGTGGCGG
GAGGTCCTGC TGGTGCCCGT CGACGGCGGA GTCGCGGCCT ACGAGAACTG GTGCACCCAC
GAGAGGGACC CCCTGCACTA CGGCTACATC CAGGGGAAGC AACTCATCTG CCTCGGCCAC
CACGCCACCT TCGACGTCAG AACGGGTAGG GTGATTCTGC ATCCCAACCA CGGCGAAGCC
AGGGTGTTGC CCAAGTACCA GGTTAAGGTG GAGGGAGGCG TGGTGTACGT CAGGGTGCCA
TGGTGA
 
Protein sequence
MSQIPREKID EFEFALYESW VEWFRREPGI NGYRRFINQK LVEEMTVIQL VRAAIGRPEL 
MGDPDIYHKM GWIVDWHESS LYRRWFVASR SLGLNGYKLD KKRAELLDSV VGTAKGDIKR
GLLLLRTALK YVHTGYRAAA YWANNRHIAQ LAYMAYMREE DEIKVVDDAL QFMGVARPDR
AGMDMLDFNT LVPIFFDFFQ PPEAADDGGP KLEAPEWERV ATVDELRQLG KKMAVVGLWR
EVLLVPVDGG VAAYENWCTH ERDPLHYGYI QGKQLICLGH HATFDVRTGR VILHPNHGEA
RVLPKYQVKV EGGVVYVRVP W