Gene Pars_2277 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2277 
Symbol 
ID5055214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2038038 
End bp2039039 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content59% 
IMG OID640469829 
Productthiamine pyrophosphate binding domain-containing protein 
Protein accessionYP_001154473 
Protein GI145592471 
COG category[C] Energy production and conversion 
COG ID[COG1013] Pyruvate:ferredoxin oxidoreductase and related 2-oxoacid:ferredoxin oxidoreductases, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00284034 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.307973 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGGTGG CGGAGGAGTA CCCCGGCTTG TACGAATACG CCAACTTCGA GCTACCGCAG 
GAGGAGCTAT TTTTGCCAGG CCATGGCCTA TGCGCTAGCT GTACAATAGG AGTAATCGCT
AGGCATATGT TAAAGGTGCT GGGGCCTGAC ACCATTGTCG TAAACCCCAC GGGGTGCGCC
GAAGTGTCCA CAGTGGTCTA CCCCCGCACC AACTGGGCGG TGCCTTGGAT TCATGTCGCC
TTCGGCAACG GCGGCTCTGT AGCCTCCGGC ATAGAGGCGG CGATTAAGGT CTTGAAGAGA
AGGGGGGTGA TAGATCCCAA CAGGAAAATA AACATAGTGG TATTCGCAGG CGACGGCGGC
ACCGCCGACA TCGGCTTCCA AGCCCTCAGC GGCATGTTAG AGAGGGGCCA CAAGGTGATA
TACGTAATGT ACGATAACGA AGGCTACATG AATACGGGGA TTCAGCGCTC AGGTACGACC
CCCTTTGGCG CCTCCACCAC CACGGCCCCT GCGGGCAAGA AGGTGCCGGG AAACGTGACG
CACAAGAAGC CGATGGTGGC AATCGCGGCC GCCCACGGCA TCCCCTACGC CGCCACGGCT
AACCCTGCCT ATGTCCACGA TATGGTGTAC AAGTTCAAGA AGGCGGCGGA GGCAGACGGA
CCCGCCTTCC TCCACATCCT CCAGTCGTGT ACCCCGGGCT GGCGCTTCGA GCCGAAGTAC
GCAATTAGGG TGCTGGAGCT GGCCACCGAG ACGGGCTACT GGGTCAACTA TGAGATCGAC
CACGGCGAGT TCAGAGTCAC CGTTCCTGTT CCCAAGAGAA AGCCGGTGAA GTGCTTCCTT
CAGCTTCAGG GGAGGTTTAG GCATCTGAAG CCGGAGGAGA TAGACACCAT CCAGGCGCTG
ATTGACAAAG ACGTAGCGGA GATTAACCGG ATTGTGGGCA GGGAGGTGAT TGGGCCGGTG
GACCCCGGCC TAGAGTGCCT AACGCCTAGG GGGGCCCGGT AA
 
Protein sequence
MKVAEEYPGL YEYANFELPQ EELFLPGHGL CASCTIGVIA RHMLKVLGPD TIVVNPTGCA 
EVSTVVYPRT NWAVPWIHVA FGNGGSVASG IEAAIKVLKR RGVIDPNRKI NIVVFAGDGG
TADIGFQALS GMLERGHKVI YVMYDNEGYM NTGIQRSGTT PFGASTTTAP AGKKVPGNVT
HKKPMVAIAA AHGIPYAATA NPAYVHDMVY KFKKAAEADG PAFLHILQSC TPGWRFEPKY
AIRVLELATE TGYWVNYEID HGEFRVTVPV PKRKPVKCFL QLQGRFRHLK PEEIDTIQAL
IDKDVAEINR IVGREVIGPV DPGLECLTPR GAR