Gene Pars_0383 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0383 
Symbol 
ID5056184 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp332660 
End bp333802 
Gene Length1143 bp 
Protein Length380 aa 
Translation table11 
GC content59% 
IMG OID640467950 
Productradical SAM domain-containing protein 
Protein accessionYP_001152637 
Protein GI145590635 
COG category[R] General function prediction only 
COG ID[COG0535] Predicted Fe-S oxidoreductases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000167403 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGAGAC GGCGGATTGC CGAACGCCGC GATGGCTTGC CCCTCCGTTG GGCACAGCAT 
AGGGCCGGGG CTGTGTTGTG GTTTGTCTTG ACCACCGGCG CTTGCAACCT TGCGTGTAAG
TACTGCGGGG GGTCTTTCAG CAATAAGCAT TCGCCGTGGC GACCCAGAGC GGCCCCAGGG
GAGGTCGCCA AGTTCATAGC GTCTAGGGAC TCCTCGCCTG TGGTGTTCTT CTACGGCGGT
GAGCCGTTGC TAAACCCCCA ATACATCGCC AAGGTGATGG ACGCGTTGGG CAACGCGAGA
TTTGGCATAC AGACCAACGG CACCCTCGTC AAGCGGCTTC CCACAGAGCT GTGGCGGCGC
TTCTCCACTG TCCTACTCTC TATTGACGGA CCTAAAGAGG TTACTGACTA TTACAGGGGG
GTCGGGGTGT ACGACAAGGT TGTGGACGCT TTGCGCTGGC TTAAAAACGA GGTGGGGTGT
CGGTGCAAGG TAATAGCTAG GATGGCGGTG TCCAAGAGGT CGCAACTGTA CCGCGACGTG
GTCCACCTCC TCGGGCTGGG GTTCGACGCT GTTCACTGGC AGCTGAACGT TATATGGACG
GAGGAGTGGG GGCCGGCCGA GTTTTTGAAG TGGGCTGAGG AGAGCTACCT CCCCGACGTG
GGGAAGCTCA GGGATCTCTT CCTCGCCGAG GCTCGGCGGG GACGTGTCCT GGGGATTATC
CCATTCCTCG GCATATACCG CGCGCTTCTG GTAAGGCCTT ACGACTGGGT GCCCTGCGGC
GCGGGGAAAC ACTCCTTTGC TATAAACACA GACGGCCGCG TACTCCACTG CCCCATAGCC
GTATCTGAGA AGTGGGCCAC CGCCGGCCAC ATCAAGATCG GCATTAAAAA CGGTGTCAAG
CTCAAGGACA AGTGCCTCAC CTGCGAGTAC CGCCATGTCT GCGGCGGCCG TTGCCTCTAC
ACCCACTACG AGGATTACTG GGGCGAGGAG GGGTTTGACG CTGTGTGCCA GGTGACAAAA
CGGACCATAC GGATACTAGA AGAGGGGGCC CCTATTCTGG CAGAGCTGAT GCGAGGTGGG
GTGGTTAGGA AGGAGGACAT CGACTACGAC CCGCTTCTGG ACTCCACGGA GGTTATACCA
TAA
 
Protein sequence
MRRRRIAERR DGLPLRWAQH RAGAVLWFVL TTGACNLACK YCGGSFSNKH SPWRPRAAPG 
EVAKFIASRD SSPVVFFYGG EPLLNPQYIA KVMDALGNAR FGIQTNGTLV KRLPTELWRR
FSTVLLSIDG PKEVTDYYRG VGVYDKVVDA LRWLKNEVGC RCKVIARMAV SKRSQLYRDV
VHLLGLGFDA VHWQLNVIWT EEWGPAEFLK WAEESYLPDV GKLRDLFLAE ARRGRVLGII
PFLGIYRALL VRPYDWVPCG AGKHSFAINT DGRVLHCPIA VSEKWATAGH IKIGIKNGVK
LKDKCLTCEY RHVCGGRCLY THYEDYWGEE GFDAVCQVTK RTIRILEEGA PILAELMRGG
VVRKEDIDYD PLLDSTEVIP