Gene Pars_2291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_2291 
Symbol 
ID5054128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp2051416 
End bp2052492 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content61% 
IMG OID640469843 
Producthypothetical protein 
Protein accessionYP_001154487 
Protein GI145592485 
COG category[C] Energy production and conversion 
COG ID[COG0644] Dehydrogenases (flavoproteins) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value0.0761399 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGTCG CCGTAGTAGG CGCTGGCCCG GCGGGGCTGG CCTTTGCTTC AGAGTTCGGG 
GACGCAGACG TATTCGAAGA GCACCTCGAG GTGGGATTGC CTAGACACTG CACCAGTTTA
GTGAGCGCCT CTTCCGCCAA GGCGGTGGGA ATCCCCCAGT CGCTTGTATT GGCAAAATAC
AGCGACTTAA CAGTGGCGGA TCTGGAGGGA AGGAGTATAT ACTTCCGGAT AAGGCACGGC
ATCTACCTAA TCGACCGCCC AGGCCTCGAG CAGTGGCTCG CCGGCGGCGT GGGGAGGATT
TTCACTAGGC GGAAGGTGGT GGCGACGCGG GGCGGCTACG TCTATACCGC AGATGGTAGC
AGCCACGGCC CGTACGACTA CGTTGTGCTC GCCGAGGGGG CCGCGAGGAG GCTCTCCGGC
AGATACGGCC ACGTGGTGAG GCTCCCGGGG CTCCAGGTAG ATGTGAAAAG CGGCATAGGC
CTCCCGGGCA TCACCGTGGT CTACAACCAG AAGCTGTCTA AGTCCTACTT CGCTTGGATA
GTAGAAGTGG ACAAGGGGCT CTACCGGGTC GGCTTGGCGG ATCACTGTTG TACCGTTCAG
AAGCTCTTTA AGCTGGTAAA GCTCGTGCGC GGCGAGCCCG TCGGCAAGCC CTTCGGCGGA
GGCGTGCTGG CGGGTCCCCC GCTGAGACGG CTGGTCTGGG GGCGGGAGAT ACTGGTAGGC
GACGCGGGTG GCCTCGTTAA GCCGCTCAGC GGCGGGGGGA TAATACTGGC GGTGAGGAGC
GGACGCCTCG CCGCCGAGGC CTTAGCTCGA GAGGAGATAG CCCAGTACGA GGAGGCGACG
AGGTGGGTTA GGCTTAGGCT GAGGCTTGCC TTCACAGCCT TTAGGCTACT CTACGGCATG
AGGCTCGTGG ATAAGGCGCT TCAACTCCTC AATGGCGGCG AGTACGTCGC CGTGGACTAC
GACGACCATG TAAAAACCCT CGCGTTCGCC GCGTTGACAG ATTTAAGATC CCTTGCCGTT
TTGAAAGAGG CAACGCGGTA TTTAGCGAGT AATCGTAATG TTCTTCATTT CCTCTAG
 
Protein sequence
MRVAVVGAGP AGLAFASEFG DADVFEEHLE VGLPRHCTSL VSASSAKAVG IPQSLVLAKY 
SDLTVADLEG RSIYFRIRHG IYLIDRPGLE QWLAGGVGRI FTRRKVVATR GGYVYTADGS
SHGPYDYVVL AEGAARRLSG RYGHVVRLPG LQVDVKSGIG LPGITVVYNQ KLSKSYFAWI
VEVDKGLYRV GLADHCCTVQ KLFKLVKLVR GEPVGKPFGG GVLAGPPLRR LVWGREILVG
DAGGLVKPLS GGGIILAVRS GRLAAEALAR EEIAQYEEAT RWVRLRLRLA FTAFRLLYGM
RLVDKALQLL NGGEYVAVDY DDHVKTLAFA ALTDLRSLAV LKEATRYLAS NRNVLHFL