Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0540 |
Symbol | |
ID | 5054789 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 484388 |
End bp | 486157 |
Gene Length | 1770 bp |
Protein Length | 589 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640468102 |
Product | electron transfer flavoprotein, alpha subunit |
Protein accession | YP_001152787 |
Protein GI | 145590785 |
COG category | [C] Energy production and conversion |
COG ID | [COG2025] Electron transfer flavoprotein, alpha subunit [COG2086] Electron transfer flavoprotein, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.448343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.108603 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATTTTTA TCGCCCTTTT TAAACAAATT CCTGACATAG GCCACGTCAA GATCGACCAA TCCACTAAGC GCCTTATACG CGAGGGCGTT CCCAACATCT TAAACCCATT TGATTACCAC GCGGTTGAGG CCGCCTTGGC GTTGAGAGAT AAACTCGGGG GAAAGGCCAT TGCCATCACC ATGGGCCCGC CGCATTTTAA ACAAAGCGCC GATGAGGTGT TGGCCATGGG CGTAGACGCC GTTATACACC TATCCGATAG GGCTTTCGCC GGATCTGACA CGTTGGCCAC TTCAAGAGCT TTGGCGTTAG CCGTACGGAA ATTCGCCGGC AAGGAGCTGG GCGCTATTTT CGCCGGCAAG TACTCATGGG ATGGCGAGAC AGGCCATGTG GGTCCTCAAG TGGCCGAAAT GCTGGGCTTG GCGCACGTAT CTGGAGTCGC GTCAATTGAG ATGGAGGGTT TAACGGCTGT GGTAGACAGA GAGGCGGAGG ACGGGGTTGA GAAAATACGC GTTGACCTCC CCGCGGTTTT CACCGTAACC GACAGGACGA ATAGCCCGAG GCCCCCCGGG AGGGCGAGGG GCGAGTATAT AGTCATTAGC GCCTCTGAAT TAACAGACAA CACAAGCCTC TTTGGATCTG AGGGCTCCCC CACCTATGTA GCTGATTTGA GAGAGGAGCC TTTAGAGAGG GAGAATAGAG TTTTAATAGA CGCCAGGGAA AGGCCCGAGC TCGGCGTTGA GGCCATCCTT GAGTATATAA AAAAGGCGTT GGCGGAGGGC TCTGGCGAAT CCCTACGCCA GGCCCCTCCA TCGCCGTCAA AAGGCGGGCC TGAGATCTAC GTCTTGGCCG AGGAGGGGCT CAGCGGCATA AAGAGGGTTT CCTACGAGCT CTTAGGTAAA GCGGTGGAAC TCGCCGAAAT GCTAGGAGGC TCTGTGACGG CGATTTACGG AGGGGAGGAG AAGGCGGAGG AGCTTATAGC CCGGGGGGCG GATAAAGTGA TACTCCTGAG AGGCGCCGAT CCAAGAGACT ATATAGCTCA CGCAGAGTCC TTGAGCCGTC TTGTGCTAAA TAGAAGGCCT TGGGCAGTTG TAGCGCCCTC CACTTCTTAT GGAAAAGACG TCTTAGCTAG AGTGGCCGCG AGACTGGGCC TTGGCTTAAC TGCAGATTGT ATAGATCTAA AGGTGGAGAA CGGCAAATTG GCGCAATTTA AACCTGCCTT CGGCGGCTCG ATAGTCTCGA TAATATATTC TAAGACGTAT CCTCAAATGG CGACTATCCG CCCCGGGATA TTCCAGCCGC TAGAGCCTAA TTATAACCGA AGTGGGGCTG TGGAAGAGGT AAGAATCTCC CCAAGGCTCA CGATTTTAGA AAAACGGGGA ATAGAATTCG AGTTGCCCGA TCCGCAACAT GCGAGGATTG TAGTGGGAGT CGGCATGGGT TTTAAAAAGA AGGAGAACGT CCAAATGGCC ATTGATCTGG CCAAGGCATT AGGCGGCGCC GTGGCGGCCA CTAGGAACGT AGTTCTTAGA GGCTGGTTGC CGTATTATGT ACAAGTGGGC GTGTCGGGCA AGGCCGTTGC CCCCCATTTA TATATAGCTC TTGGGATACG AGGTGATATA AATCATCTAG TCGGCATCCG CAAGGCAAGA CATATTATTG CAGTAAATAT AAATAAAAAT GCTGATATTT TTAAAATAGC TAATTTGGGC GTCATTGGAG ATATATTTAA AATTGTTCCT CTACTTATAG AAAGAATAAA GAAGATGTAA
|
Protein sequence | MIFIALFKQI PDIGHVKIDQ STKRLIREGV PNILNPFDYH AVEAALALRD KLGGKAIAIT MGPPHFKQSA DEVLAMGVDA VIHLSDRAFA GSDTLATSRA LALAVRKFAG KELGAIFAGK YSWDGETGHV GPQVAEMLGL AHVSGVASIE MEGLTAVVDR EAEDGVEKIR VDLPAVFTVT DRTNSPRPPG RARGEYIVIS ASELTDNTSL FGSEGSPTYV ADLREEPLER ENRVLIDARE RPELGVEAIL EYIKKALAEG SGESLRQAPP SPSKGGPEIY VLAEEGLSGI KRVSYELLGK AVELAEMLGG SVTAIYGGEE KAEELIARGA DKVILLRGAD PRDYIAHAES LSRLVLNRRP WAVVAPSTSY GKDVLARVAA RLGLGLTADC IDLKVENGKL AQFKPAFGGS IVSIIYSKTY PQMATIRPGI FQPLEPNYNR SGAVEEVRIS PRLTILEKRG IEFELPDPQH ARIVVGVGMG FKKKENVQMA IDLAKALGGA VAATRNVVLR GWLPYYVQVG VSGKAVAPHL YIALGIRGDI NHLVGIRKAR HIIAVNINKN ADIFKIANLG VIGDIFKIVP LLIERIKKM
|
| |