Gene Pars_2255 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2255 
Symbol 
ID5054274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2021165 
End bp2022274 
Gene Length1110 bp 
Protein Length369 aa 
Translation table11 
GC content59% 
IMG OID640469808 
Productradical SAM domain-containing protein 
Protein accessionYP_001154453 
Protein GI145592451 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.209492 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0610472 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGAACG TACAGGAATT AATCCGCAGA TTCCACGAGG CGCCGCTACT GGTCTTCTGG 
GAGTCGACAA AGGCCTGTCC CCTGGCCTGC AAGCACTGCC GAGCCGACGC CATACTCAAG
CCGCTTCCGG GCGAACTCAC TACCCAGGAG GGGAAGCGGC TTATAGAGCA GGTGGCCCAA
TTCGGCGACC CCAAGCCCCT TCTGATTATC ACGGGGGGAG ACCCCCTAAT GAGAGCCGAC
CTCTTCGAGC TGGTGGACTA CGCCAACTCG TTGTCAGTAC CTGTATCGCT GGCGCCCGCC
GTGTCGCCCA ACTTGACGCC AGAGGTGATG AAGGAGATGA AGCAGGCTGG GGTTAAGTCC
GTCTCCATCA GCCTAGACGG GGCGTTCCCA GAGACTCACG ACGAGCTTAG AGGCGTCCCC
GGGAGCTACA AGGAGACAGT CACAGCGATA AAAACCGCCG TGGAGATAGG GCTCCCCGTT
CAGGTAAACA CCGTCGTCTG GAAGAAATCG TTAGGCGAGC TCCCAGACGT GGCGTACCTC
CTCAAAAACC TCGGCGTCAA GATCTGGGAG GTTTTCTTCC TGATAGTCAC AGGCCGGGCA
AGGGAGGAGT TGGATATCAC GCCGGAGGAG TACGAGGCGG CGGTGCAGTT CCTAGTAGAC
GTGTCGACAT ACGGCTTTCA AGTCAGGACC GTTGAGGCCC CGTTCTATAG GAGAGCAAAG
CTGGAGAGGT TGGAAGGCAG GATCTACGAC CACCCCCTCT ACCTCCAGCT TGTGGATAAG
CTGAGAAAGC TCCTGGGGCC GCCTACCCGC GGCGTCGACC CCACAATTGT GCCGACTAGG
GACGGGTTCG GCATCATATT CGTCGCCTAC GACGGTACGG TGCACCCCTC CGGCTTTTTG
CCCTATCCCC TCGGCAACGT GCGGAGGCAG AGCCTAGTGG AGATATACCG CAACCACTCA
CTCCTCCAGA AAATGCGCAG GGGGGAGTTC GGTGGGAGAT GCGGCGTATG TAGGTACAAA
GACATCTGCG GCGGGTCCCG CGCCAGGGCC TTCGCATATT ACAAAGACCC CCTAGCCGAG
GACCCCGCAT GCATTTACAA GCCCACATAA
 
Protein sequence
MRNVQELIRR FHEAPLLVFW ESTKACPLAC KHCRADAILK PLPGELTTQE GKRLIEQVAQ 
FGDPKPLLII TGGDPLMRAD LFELVDYANS LSVPVSLAPA VSPNLTPEVM KEMKQAGVKS
VSISLDGAFP ETHDELRGVP GSYKETVTAI KTAVEIGLPV QVNTVVWKKS LGELPDVAYL
LKNLGVKIWE VFFLIVTGRA REELDITPEE YEAAVQFLVD VSTYGFQVRT VEAPFYRRAK
LERLEGRIYD HPLYLQLVDK LRKLLGPPTR GVDPTIVPTR DGFGIIFVAY DGTVHPSGFL
PYPLGNVRRQ SLVEIYRNHS LLQKMRRGEF GGRCGVCRYK DICGGSRARA FAYYKDPLAE
DPACIYKPT