Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1848 |
Symbol | |
ID | 5055317 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1652989 |
End bp | 1654803 |
Gene Length | 1815 bp |
Protein Length | 604 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640469394 |
Product | thiamine pyrophosphate binding domain-containing protein |
Protein accession | YP_001154051 |
Protein GI | 145592049 |
COG category | [C] Energy production and conversion |
COG ID | [COG4231] Indolepyruvate ferredoxin oxidoreductase, alpha and beta subunits |
TIGRFAM ID | [TIGR03336] indolepyruvate ferredoxin oxidoreductase, alpha subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.154193 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAAACC TTATATTCTA TATACATCTA AATACTGTGA AAATCCTCCT TCTAGGCAAC GAGGCAATTG CTTACGGTGC GCTGGCTGGA GAAGTAGCCG TTGCGACGGC GTATCCCGGT ACTCCCTCAA CAGAAATATT AGAAACGCTG GAGGAATTTA GGGACAGGTT TGTTCACTGG GCTACCAATG AGAAAACAGC CTTGGAGATA GCATATGGTG CTGCGGTGGC TGGCGCAAGG GCGCTTGTAG CAATGAAACA CGTGGGGCTA AACGTAGCCG CAGACCCGCT CCACAGCGCC GCCTATACCG GTGTTGAGGG CGGCTTAGTT GTGGTATCCG CTGACGACCC ATGGATGCAT TCATCTCAAA ACGAACAAGA CACGAGGTGG TACGGCCTCC AGGCATACGT GCCTGTCTTA GAGCCTTCCG ATCCCGCCGA GGCGTATAGA TATGCTAAGA CCGCCTTTGA ACTAAGTGAG AAGTTGAAAC ACCCTATAAT TCTGAGGAGT GTCACCCGGG TGAGCCACGT AAGAGCCCCA GTGGAGGTGG AGCCTCCTTC TCCTCCCAAG TGGGGTCGCT TCTCGAAAGA TATCAGTAGA TTTAACTTAG TGCCTGCATA TGCAAGAGAG AGAAGAAAGG CCCTCGTTGA GAAGTGGGGA ACTATAGGCG AAGTCAGCGC AGAGCTTATG CGTGTAGAGC CAGGCGGCCA TGTGACTATT GTAACGTCGG GCGTTGCATA TAACTACGTC AAGGAGGCCG TAAGGCTACT GAATATCGAT GCTACAATAA TAAAGCTGGG GATGCCAGTG CCAATACCGC CTAAAATAAG AGAACTAGTT AAAGGCACAG TTGTAGTGGT AGAAGAAGGC GACCCAATTG TCGAGACCCA GTTAAGAGCC CTTCGCCTAG AGACAAAGGG CAAAATTGAC GGCTATTTCC CAAAATACGG CGAGCTCAAT ACTAGGAAAG TGGCGGAAGG TATCGCCAAG GCGTTAGATA TTCCCTACAA TCCGCCCCAG CCGCCAAAGT CCCCGATTGA GGCGCCGCCG AGGCCGCCGG TGTTATGCCC AGGTTGTCCT CACATGGGCA CGTTCTACAT ACTCAGGCTG GCGACGGCTG GGCTAAACCC AGTGTGGTCG GGAGATATAG GATGCTACTC CCTAGGTATA AACACAGGAC AGCAAGACTT AATCACGCAC ATGGGATCTT CAGTAGGGCT TGGGATGGGC GTCGCCGTAG CGTCGAAACA GTTCGTTGTC GCTACGGTGG GCGACTCAAC TTTCTACCAC GCAGTTCTCC CCCAGCTGAT AGACCTAGCC ACAAAGAAGG TACCCCTCCT TGTCGTCGTA ATGGACAACG CCTACACGGC CATGACAGGA GGCCAGCCCA GCCCCAGTAG GCTAATACCG CCTGAGAAAA TCGCGGAGAC GTTTGGAATA CCCGCCTTTG TCATAGATCC TGCTGATATC AAAACCTCTA TAGAAGTGAC CAAGAGAGCT GTTGAAATTG TTAAAAGCGG GAGGCCGGTC CTCGTAGTCT CAAAGAGGCC TTGTGTGCTT GTTGCGACGA GAAAGGCCAG GAAGGCGGGT GTGGCAATTC CGAAGTACAA GGTTGAGCCG GAGAAGTGCA TAGGATGCGG GATATGTTAT AATCTCTTGA AGTGTAGCGC CATCCAGGCT AGGCCTGATC GCAAGGCATA TATCGATCCT GCCCTGTGTG TTGGGTGCGG TATGTGCGCC GAGGTGTGTC CTGTCGACGC CATCAAGGGT GACGGTGCAC GAGTGAAATG GCTAGAGGTA TGGCAACAAG CCTAG
|
Protein sequence | MKNLIFYIHL NTVKILLLGN EAIAYGALAG EVAVATAYPG TPSTEILETL EEFRDRFVHW ATNEKTALEI AYGAAVAGAR ALVAMKHVGL NVAADPLHSA AYTGVEGGLV VVSADDPWMH SSQNEQDTRW YGLQAYVPVL EPSDPAEAYR YAKTAFELSE KLKHPIILRS VTRVSHVRAP VEVEPPSPPK WGRFSKDISR FNLVPAYARE RRKALVEKWG TIGEVSAELM RVEPGGHVTI VTSGVAYNYV KEAVRLLNID ATIIKLGMPV PIPPKIRELV KGTVVVVEEG DPIVETQLRA LRLETKGKID GYFPKYGELN TRKVAEGIAK ALDIPYNPPQ PPKSPIEAPP RPPVLCPGCP HMGTFYILRL ATAGLNPVWS GDIGCYSLGI NTGQQDLITH MGSSVGLGMG VAVASKQFVV ATVGDSTFYH AVLPQLIDLA TKKVPLLVVV MDNAYTAMTG GQPSPSRLIP PEKIAETFGI PAFVIDPADI KTSIEVTKRA VEIVKSGRPV LVVSKRPCVL VATRKARKAG VAIPKYKVEP EKCIGCGICY NLLKCSAIQA RPDRKAYIDP ALCVGCGMCA EVCPVDAIKG DGARVKWLEV WQQA
|
| |