Gene Pars_1848 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1848 
Symbol 
ID5055317 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1652989 
End bp1654803 
Gene Length1815 bp 
Protein Length604 aa 
Translation table11 
GC content52% 
IMG OID640469394 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001154051 
Protein GI145592049 
COG category[C] Energy production and conversion 
COG ID[COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits 
TIGRFAM ID[TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.154193 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACC TTATATTCTA TATACATCTA AATACTGTGA AAATCCTCCT TCTAGGCAAC 
GAGGCAATTG CTTACGGTGC GCTGGCTGGA GAAGTAGCCG TTGCGACGGC GTATCCCGGT
ACTCCCTCAA CAGAAATATT AGAAACGCTG GAGGAATTTA GGGACAGGTT TGTTCACTGG
GCTACCAATG AGAAAACAGC CTTGGAGATA GCATATGGTG CTGCGGTGGC TGGCGCAAGG
GCGCTTGTAG CAATGAAACA CGTGGGGCTA AACGTAGCCG CAGACCCGCT CCACAGCGCC
GCCTATACCG GTGTTGAGGG CGGCTTAGTT GTGGTATCCG CTGACGACCC ATGGATGCAT
TCATCTCAAA ACGAACAAGA CACGAGGTGG TACGGCCTCC AGGCATACGT GCCTGTCTTA
GAGCCTTCCG ATCCCGCCGA GGCGTATAGA TATGCTAAGA CCGCCTTTGA ACTAAGTGAG
AAGTTGAAAC ACCCTATAAT TCTGAGGAGT GTCACCCGGG TGAGCCACGT AAGAGCCCCA
GTGGAGGTGG AGCCTCCTTC TCCTCCCAAG TGGGGTCGCT TCTCGAAAGA TATCAGTAGA
TTTAACTTAG TGCCTGCATA TGCAAGAGAG AGAAGAAAGG CCCTCGTTGA GAAGTGGGGA
ACTATAGGCG AAGTCAGCGC AGAGCTTATG CGTGTAGAGC CAGGCGGCCA TGTGACTATT
GTAACGTCGG GCGTTGCATA TAACTACGTC AAGGAGGCCG TAAGGCTACT GAATATCGAT
GCTACAATAA TAAAGCTGGG GATGCCAGTG CCAATACCGC CTAAAATAAG AGAACTAGTT
AAAGGCACAG TTGTAGTGGT AGAAGAAGGC GACCCAATTG TCGAGACCCA GTTAAGAGCC
CTTCGCCTAG AGACAAAGGG CAAAATTGAC GGCTATTTCC CAAAATACGG CGAGCTCAAT
ACTAGGAAAG TGGCGGAAGG TATCGCCAAG GCGTTAGATA TTCCCTACAA TCCGCCCCAG
CCGCCAAAGT CCCCGATTGA GGCGCCGCCG AGGCCGCCGG TGTTATGCCC AGGTTGTCCT
CACATGGGCA CGTTCTACAT ACTCAGGCTG GCGACGGCTG GGCTAAACCC AGTGTGGTCG
GGAGATATAG GATGCTACTC CCTAGGTATA AACACAGGAC AGCAAGACTT AATCACGCAC
ATGGGATCTT CAGTAGGGCT TGGGATGGGC GTCGCCGTAG CGTCGAAACA GTTCGTTGTC
GCTACGGTGG GCGACTCAAC TTTCTACCAC GCAGTTCTCC CCCAGCTGAT AGACCTAGCC
ACAAAGAAGG TACCCCTCCT TGTCGTCGTA ATGGACAACG CCTACACGGC CATGACAGGA
GGCCAGCCCA GCCCCAGTAG GCTAATACCG CCTGAGAAAA TCGCGGAGAC GTTTGGAATA
CCCGCCTTTG TCATAGATCC TGCTGATATC AAAACCTCTA TAGAAGTGAC CAAGAGAGCT
GTTGAAATTG TTAAAAGCGG GAGGCCGGTC CTCGTAGTCT CAAAGAGGCC TTGTGTGCTT
GTTGCGACGA GAAAGGCCAG GAAGGCGGGT GTGGCAATTC CGAAGTACAA GGTTGAGCCG
GAGAAGTGCA TAGGATGCGG GATATGTTAT AATCTCTTGA AGTGTAGCGC CATCCAGGCT
AGGCCTGATC GCAAGGCATA TATCGATCCT GCCCTGTGTG TTGGGTGCGG TATGTGCGCC
GAGGTGTGTC CTGTCGACGC CATCAAGGGT GACGGTGCAC GAGTGAAATG GCTAGAGGTA
TGGCAACAAG CCTAG
 
Protein sequence
MKNLIFYIHL NTVKILLLGN EAIAYGALAG EVAVATAYPG TPSTEILETL EEFRDRFVHW 
ATNEKTALEI AYGAAVAGAR ALVAMKHVGL NVAADPLHSA AYTGVEGGLV VVSADDPWMH
SSQNEQDTRW YGLQAYVPVL EPSDPAEAYR YAKTAFELSE KLKHPIILRS VTRVSHVRAP
VEVEPPSPPK WGRFSKDISR FNLVPAYARE RRKALVEKWG TIGEVSAELM RVEPGGHVTI
VTSGVAYNYV KEAVRLLNID ATIIKLGMPV PIPPKIRELV KGTVVVVEEG DPIVETQLRA
LRLETKGKID GYFPKYGELN TRKVAEGIAK ALDIPYNPPQ PPKSPIEAPP RPPVLCPGCP
HMGTFYILRL ATAGLNPVWS GDIGCYSLGI NTGQQDLITH MGSSVGLGMG VAVASKQFVV
ATVGDSTFYH AVLPQLIDLA TKKVPLLVVV MDNAYTAMTG GQPSPSRLIP PEKIAETFGI
PAFVIDPADI KTSIEVTKRA VEIVKSGRPV LVVSKRPCVL VATRKARKAG VAIPKYKVEP
EKCIGCGICY NLLKCSAIQA RPDRKAYIDP ALCVGCGMCA EVCPVDAIKG DGARVKWLEV
WQQA