Gene Pars_1213 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1213 
Symbol 
ID5054426 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1098560 
End bp1099810 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content57% 
IMG OID640468760 
Producthydrogensulfite reductase 
Protein accessionYP_001153433 
Protein GI145591431 
COG category[C] Energy production and conversion 
COG ID[COG2221] Dissimilatory sulfite reductase (desulfoviridin), alpha and beta subunits 
TIGRFAM ID[TIGR02064] sulfite reductase, dissimilatory-type alpha subunit 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGTTA AGCAACAAAG ACCCCTAACT GTAGAGGAGT GGGCCGCCGA GGGGATGGAA 
TTGCTTAAGT ACTACGAAAA AGGGCCTTGG CCTAGCCACG TCTCCGAGCT GAAGAAGACA
AAGTACCCCA TCGAGGCCTA CGCGGCGGGG CTGGCGGCCC GGAAGACCAT GTGGGCGGCG
GGCTCGGCGA AGATTAGGTA CGTCTACACC GGCTTTATAG CTAGGCGCAC CCGCGACGGC
AAGATCTCGG AGCTCCACTT CCGCGTCTGG CAGCCCTCCG GCTATCTCTA CAGCACGGAG
AAGTTGAAAA AGCTCATCGA GTTCACAGAG AAGTACGGCC TTGGCTTGAT AGAGCTCGCC
GGCCAGCAGG GCCAGATGAT AATCAGCATT AGGCCCGAGG TAGCCGACGA GGCGGTGGAC
TACCTCCGCG ACGTGGTAGG CACAGACGTA GGGGCCACCG GCGATACTAT TAGAGAGATA
GCGGCCTGCG TCGGTCCTGC CCTATGCGAA TACTCGCTCT ACGACACGCT CGAGGCGAGG
GACAAGTTCC TCACCCACCC TAAGATATAT GAGTGGATGA GCAACCAGTT ATTCCCCTTT
AAGTTCAAGG CCAAGTTCTC TGGCTGCCCC TTCGACTGCA CTAGGGCCGT CCACCGCGCC
GACTTCGGCT TTATCGGTGT TTGGGAGGGG GCGCCGGAGG TGGATCAAGA GGCTTTTAGG
AGGAAGGTGG AGGCTGGGGA GGTGGATCCG GAGAAGTTGG CCGCCAACTG CCCAAGCGGC
GCAATAACTT GGGATAATGA GAGGAAAGAG CTGAGAATAG ACGGCACTAG ATGCAAAAAG
TCAATGCACT GCATAAGAAC TGCCTTCCCA GCAATAAAGC CGGGGAAGAA CAGAAAAATT
GCAATTGTCG TGGGAGGCCA CGTAAAGGGG AGATTCGGCG GCAAGATGGG CAAGCCTCTA
GCCGTCGTGA ACTCGGTGGA CGAAGCCATG GATTGGGTGG TGAAGACGGT GGAGAGCTGG
ATGGAGCACA TGGAGAAGGG CGTGGTGAAG CACAAGGACC GAATAGGCGA CTTCATCATG
AAGGTGGGCT TCAAGAAGTA CGTCAACGAG ATACTTGGCC TAAAGGAGGT GGGCAAGCCT
TCTCTCCACC CATCGCTTAG GGCCGGCGCT GTGCTAGACG ACGAGGAGCG GAAGATGTAC
GCCGCCTGGG CCTCGAAGAT AGTGGAGGAG GTCTTCGGCC GGAGGGCATG A
 
Protein sequence
MEVKQQRPLT VEEWAAEGME LLKYYEKGPW PSHVSELKKT KYPIEAYAAG LAARKTMWAA 
GSAKIRYVYT GFIARRTRDG KISELHFRVW QPSGYLYSTE KLKKLIEFTE KYGLGLIELA
GQQGQMIISI RPEVADEAVD YLRDVVGTDV GATGDTIREI AACVGPALCE YSLYDTLEAR
DKFLTHPKIY EWMSNQLFPF KFKAKFSGCP FDCTRAVHRA DFGFIGVWEG APEVDQEAFR
RKVEAGEVDP EKLAANCPSG AITWDNERKE LRIDGTRCKK SMHCIRTAFP AIKPGKNRKI
AIVVGGHVKG RFGGKMGKPL AVVNSVDEAM DWVVKTVESW MEHMEKGVVK HKDRIGDFIM
KVGFKKYVNE ILGLKEVGKP SLHPSLRAGA VLDDEERKMY AAWASKIVEE VFGRRA