Gene Pars_0540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0540 
Symbol 
ID5054789 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp484388 
End bp486157 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content52% 
IMG OID640468102 
Productelectron transfer flavoprotein, alpha subunit 
Protein accessionYP_001152787 
Protein GI145590785 
COG category[C] Energy production and conversion 
COG ID[COG2025] Electron transfer flavoprotein, alpha subunit
[COG2086] Electron transfer flavoprotein, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.448343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.108603 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTTA TCGCCCTTTT TAAACAAATT CCTGACATAG GCCACGTCAA GATCGACCAA 
TCCACTAAGC GCCTTATACG CGAGGGCGTT CCCAACATCT TAAACCCATT TGATTACCAC
GCGGTTGAGG CCGCCTTGGC GTTGAGAGAT AAACTCGGGG GAAAGGCCAT TGCCATCACC
ATGGGCCCGC CGCATTTTAA ACAAAGCGCC GATGAGGTGT TGGCCATGGG CGTAGACGCC
GTTATACACC TATCCGATAG GGCTTTCGCC GGATCTGACA CGTTGGCCAC TTCAAGAGCT
TTGGCGTTAG CCGTACGGAA ATTCGCCGGC AAGGAGCTGG GCGCTATTTT CGCCGGCAAG
TACTCATGGG ATGGCGAGAC AGGCCATGTG GGTCCTCAAG TGGCCGAAAT GCTGGGCTTG
GCGCACGTAT CTGGAGTCGC GTCAATTGAG ATGGAGGGTT TAACGGCTGT GGTAGACAGA
GAGGCGGAGG ACGGGGTTGA GAAAATACGC GTTGACCTCC CCGCGGTTTT CACCGTAACC
GACAGGACGA ATAGCCCGAG GCCCCCCGGG AGGGCGAGGG GCGAGTATAT AGTCATTAGC
GCCTCTGAAT TAACAGACAA CACAAGCCTC TTTGGATCTG AGGGCTCCCC CACCTATGTA
GCTGATTTGA GAGAGGAGCC TTTAGAGAGG GAGAATAGAG TTTTAATAGA CGCCAGGGAA
AGGCCCGAGC TCGGCGTTGA GGCCATCCTT GAGTATATAA AAAAGGCGTT GGCGGAGGGC
TCTGGCGAAT CCCTACGCCA GGCCCCTCCA TCGCCGTCAA AAGGCGGGCC TGAGATCTAC
GTCTTGGCCG AGGAGGGGCT CAGCGGCATA AAGAGGGTTT CCTACGAGCT CTTAGGTAAA
GCGGTGGAAC TCGCCGAAAT GCTAGGAGGC TCTGTGACGG CGATTTACGG AGGGGAGGAG
AAGGCGGAGG AGCTTATAGC CCGGGGGGCG GATAAAGTGA TACTCCTGAG AGGCGCCGAT
CCAAGAGACT ATATAGCTCA CGCAGAGTCC TTGAGCCGTC TTGTGCTAAA TAGAAGGCCT
TGGGCAGTTG TAGCGCCCTC CACTTCTTAT GGAAAAGACG TCTTAGCTAG AGTGGCCGCG
AGACTGGGCC TTGGCTTAAC TGCAGATTGT ATAGATCTAA AGGTGGAGAA CGGCAAATTG
GCGCAATTTA AACCTGCCTT CGGCGGCTCG ATAGTCTCGA TAATATATTC TAAGACGTAT
CCTCAAATGG CGACTATCCG CCCCGGGATA TTCCAGCCGC TAGAGCCTAA TTATAACCGA
AGTGGGGCTG TGGAAGAGGT AAGAATCTCC CCAAGGCTCA CGATTTTAGA AAAACGGGGA
ATAGAATTCG AGTTGCCCGA TCCGCAACAT GCGAGGATTG TAGTGGGAGT CGGCATGGGT
TTTAAAAAGA AGGAGAACGT CCAAATGGCC ATTGATCTGG CCAAGGCATT AGGCGGCGCC
GTGGCGGCCA CTAGGAACGT AGTTCTTAGA GGCTGGTTGC CGTATTATGT ACAAGTGGGC
GTGTCGGGCA AGGCCGTTGC CCCCCATTTA TATATAGCTC TTGGGATACG AGGTGATATA
AATCATCTAG TCGGCATCCG CAAGGCAAGA CATATTATTG CAGTAAATAT AAATAAAAAT
GCTGATATTT TTAAAATAGC TAATTTGGGC GTCATTGGAG ATATATTTAA AATTGTTCCT
CTACTTATAG AAAGAATAAA GAAGATGTAA
 
Protein sequence
MIFIALFKQI PDIGHVKIDQ STKRLIREGV PNILNPFDYH AVEAALALRD KLGGKAIAIT 
MGPPHFKQSA DEVLAMGVDA VIHLSDRAFA GSDTLATSRA LALAVRKFAG KELGAIFAGK
YSWDGETGHV GPQVAEMLGL AHVSGVASIE MEGLTAVVDR EAEDGVEKIR VDLPAVFTVT
DRTNSPRPPG RARGEYIVIS ASELTDNTSL FGSEGSPTYV ADLREEPLER ENRVLIDARE
RPELGVEAIL EYIKKALAEG SGESLRQAPP SPSKGGPEIY VLAEEGLSGI KRVSYELLGK
AVELAEMLGG SVTAIYGGEE KAEELIARGA DKVILLRGAD PRDYIAHAES LSRLVLNRRP
WAVVAPSTSY GKDVLARVAA RLGLGLTADC IDLKVENGKL AQFKPAFGGS IVSIIYSKTY
PQMATIRPGI FQPLEPNYNR SGAVEEVRIS PRLTILEKRG IEFELPDPQH ARIVVGVGMG
FKKKENVQMA IDLAKALGGA VAATRNVVLR GWLPYYVQVG VSGKAVAPHL YIALGIRGDI
NHLVGIRKAR HIIAVNINKN ADIFKIANLG VIGDIFKIVP LLIERIKKM