Gene Pars_1947 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1947 
Symbol 
ID5054777 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1747171 
End bp1748289 
Gene Length1119 bp 
Protein Length372 aa 
Translation table11 
GC content57% 
IMG OID640469493 
Productradical SAM domain-containing protein 
Protein accessionYP_001154146 
Protein GI145592144 
COG category[R] General function prediction only 
COG ID[COG1313] Uncharacterized Fe-S protein PflX, homolog of pyruvate formate lyase activating proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones39 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTGGGCTT TGTATAGGCC AGATGCCTTG GCCGTGTGGC AGAATCCCGT GGTAAGGGAG 
CGGTTGAAGT GGTATTACTT AGTCATGCGC GATGAGGCGC CGGCCAAGTA CCACATAGCC
GCAAGGATTG AGGCGCCGGC CGACTATAGA TTACTGGGGG ATAGCGAATT GTGGAAGTTG
CACGACAAGC TGGGAAAAGC GTTTGATGAT GAGTGGAGCC GCCAGAAGGA GAGACCTGAC
CCGTCTCTGG CGAAAAAGGA GCTACCACAA GCGTCTTTTC TCGACGTCAA GACAGAGCTG
GCGAGGAGGC AATTAAAGCG CTGTATTTTG TGCGAGCGGC GTTGCGGCGT GGACAGGACC
TCCCGGCGGG GCGCGTGCCT CCTCGACGCC AAGCCCCGCG TGGCGAGCTT CTTCCACCAC
CTAGGGGAGG AGGCCCCTCT AGTCCCGTCT GGGACTATCT TCTTTGCGGG TTGTAACTTC
AGGTGCGTCT ACTGCCAGAA CTGGGATATC TCACAAGATC CCGAGGCCGG CGCAGAGGCC
TCGCCGGAGG CCCTAGCCGC AATTCAGGTT AGGCTCAGGG AAGAGGGGGC GCGCAATATA
AACTGGGTGG GCGGCGAGCC GACGCCGAAT ATACCATTTA TCCTTGAGTC GTTGAGAATA
TTGGCAAGAC GCGGAGTAAA TGTCCCCCAG CTGTGGAACT CCAACATGTA CCTAACGCCG
GAGGGTTTGG CCCTGATACT CCACGTTATG GACATATGGC TCCCCGACTT TAAGTACGGC
AACGACGCAT GCGCCTTGAG GTACTCGGTG GCCCCCCGCT ACTGGGAGGT GACGACGAGG
AATTTTTCCG TTATATGCAG TAGGGGAGAG GATATCATAG TCCGCCACTT GGTGCTCCCG
GGGCACGTCG ATTGTTGCAC AAAGCCAGTG CTGAGGTGGC TTGCAGAGAA CTGTAAACAC
GCCTTAGTTA ATATCATGGA TCAGTACAGG CCCGAGCACC TCGTGGTTAA GCTAGATCGC
TATAGGGAGA TTAGGCGTAG GGTTTCCCAA AAGGAGATGG ACGAGGCATA TATGTACGCG
GACTCTCTTG GCTTGGCGTG GCGCGAGGTT AGCCGGTAG
 
Protein sequence
MWALYRPDAL AVWQNPVVRE RLKWYYLVMR DEAPAKYHIA ARIEAPADYR LLGDSELWKL 
HDKLGKAFDD EWSRQKERPD PSLAKKELPQ ASFLDVKTEL ARRQLKRCIL CERRCGVDRT
SRRGACLLDA KPRVASFFHH LGEEAPLVPS GTIFFAGCNF RCVYCQNWDI SQDPEAGAEA
SPEALAAIQV RLREEGARNI NWVGGEPTPN IPFILESLRI LARRGVNVPQ LWNSNMYLTP
EGLALILHVM DIWLPDFKYG NDACALRYSV APRYWEVTTR NFSVICSRGE DIIVRHLVLP
GHVDCCTKPV LRWLAENCKH ALVNIMDQYR PEHLVVKLDR YREIRRRVSQ KEMDEAYMYA
DSLGLAWREV SR