Gene Pars_0242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0242 
Symbol 
ID5055703 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp216130 
End bp217317 
Gene Length1188 bp 
Protein Length395 aa 
Translation table11 
GC content60% 
IMG OID640467821 
ProductFAD-dependent pyridine nucleotide-disulphide oxidoreductase 
Protein accessionYP_001152509 
Protein GI145590507 
COG category[C] Energy production and conversion 
COG ID[COG1252] NADH dehydrogenase, FAD-containing subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.225215 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTCG AAGCAGCCTC TATGAAAAGG GTGGTGATAG TCGGTGGGGG CATCGCTGGC 
ATGACTGTGG CAAAGACGTT GCTAGAGGGC AAGATGCAGG CCGAGATAAC GGTGGTAAAC
TCGGCGCCCC ACTACTTCGC GGGGCCCAGC AGGCCGCTCC TGCTGACGGG GGAGCAGTCA
CTGGACCGCA TTGTGAGGAG TTATGAAGAG GTCGCTAGGA GAGGCGTCAA GGTGATGGTC
GGCACAGTCT ACTCCGTGGA CCCGGCGAAT AGGAAGGTGC GGCTGGTCGG CGGCTACGCC
TCGGACGGCG GCGTCAAGGA GCTTCAGTAC GACTACCTCG TATTGGCCCC GGGCGTGGTG
CTAGACGGCT CTGGCATAAC GGGCTACGAC AAGTACAGGG GCAACGTGCT GAACGTCTAC
GACCCCGGCA GGGTGCGCAC TTTGAAAGAG AGGGTGTGGA AAGCCGAGAA GGGCACCGTG
GTCGTCTACG CCCCCAAGGC GCCGTACAGG TGCGCCCCAG CGCCTACCGA GACCGCCACC
TTAATAGACG CCGTGCTTAG GCACAGGGGG GTAAGGGACA AGTTCAGGAT TATACACATA
GACGCCAACG ACAAGACCCA GCCTCCAGTA ATCGCCGACG TGGTGGCCGA GCTGTACAGG
AAGCTGGGCA TAGAGCTTGT CGTCAACCAG GAAATCGCGG AGATCGGGGA GAACTACGTC
GTGACTAAGA GCGGCGAGAG GTACAACTAC GACATCTTGG CAATGCTGGA GCCCAACAGG
GCGCCTAAGT TCATAGCCGA GGCCGGCTTG GGGGAGAACT GGATCAGCGT GCGCGGCCCC
CAGGACCTCC GCCACCCCAA GTTCGACGAC GTACTCGCCG CAGGCGACGC CGCAAGTCTG
CCGTTCCCCA AGAACCAAGA AATCGCCTTC GAGAGCGCCT TATTCGCCGC CAACAAAATA
CTCGAGATGG AGGGGCTGAG CCACAGAGCC AGCGTCCAGT ACGCCTTCTT GGGCTGGGCC
TATGTCGGCA ACCCTGAGGG GAGGCTGGAG ACTCTGTCCG TCATGTTCGG CCTAGACTTC
ACCACACAGC CGCCTAAGCC GACCAAAGAC CCGCAGCCAA AGAGAGAGTA CACACAGCGA
AAAGACAGCT GGGAGCAGAG CTACCTCGCC AACCTATTTG GGTACTAG
 
Protein sequence
MKFEAASMKR VVIVGGGIAG MTVAKTLLEG KMQAEITVVN SAPHYFAGPS RPLLLTGEQS 
LDRIVRSYEE VARRGVKVMV GTVYSVDPAN RKVRLVGGYA SDGGVKELQY DYLVLAPGVV
LDGSGITGYD KYRGNVLNVY DPGRVRTLKE RVWKAEKGTV VVYAPKAPYR CAPAPTETAT
LIDAVLRHRG VRDKFRIIHI DANDKTQPPV IADVVAELYR KLGIELVVNQ EIAEIGENYV
VTKSGERYNY DILAMLEPNR APKFIAEAGL GENWISVRGP QDLRHPKFDD VLAAGDAASL
PFPKNQEIAF ESALFAANKI LEMEGLSHRA SVQYAFLGWA YVGNPEGRLE TLSVMFGLDF
TTQPPKPTKD PQPKREYTQR KDSWEQSYLA NLFGY