Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2291 |
Symbol | |
ID | 5054128 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2051416 |
End bp | 2052492 |
Gene Length | 1077 bp |
Protein Length | 358 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640469843 |
Product | hypothetical protein |
Protein accession | YP_001154487 |
Protein GI | 145592485 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 22 |
Fosmid unclonability p-value | 0.0761399 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCGTCG CCGTAGTAGG CGCTGGCCCG GCGGGGCTGG CCTTTGCTTC AGAGTTCGGG GACGCAGACG TATTCGAAGA GCACCTCGAG GTGGGATTGC CTAGACACTG CACCAGTTTA GTGAGCGCCT CTTCCGCCAA GGCGGTGGGA ATCCCCCAGT CGCTTGTATT GGCAAAATAC AGCGACTTAA CAGTGGCGGA TCTGGAGGGA AGGAGTATAT ACTTCCGGAT AAGGCACGGC ATCTACCTAA TCGACCGCCC AGGCCTCGAG CAGTGGCTCG CCGGCGGCGT GGGGAGGATT TTCACTAGGC GGAAGGTGGT GGCGACGCGG GGCGGCTACG TCTATACCGC AGATGGTAGC AGCCACGGCC CGTACGACTA CGTTGTGCTC GCCGAGGGGG CCGCGAGGAG GCTCTCCGGC AGATACGGCC ACGTGGTGAG GCTCCCGGGG CTCCAGGTAG ATGTGAAAAG CGGCATAGGC CTCCCGGGCA TCACCGTGGT CTACAACCAG AAGCTGTCTA AGTCCTACTT CGCTTGGATA GTAGAAGTGG ACAAGGGGCT CTACCGGGTC GGCTTGGCGG ATCACTGTTG TACCGTTCAG AAGCTCTTTA AGCTGGTAAA GCTCGTGCGC GGCGAGCCCG TCGGCAAGCC CTTCGGCGGA GGCGTGCTGG CGGGTCCCCC GCTGAGACGG CTGGTCTGGG GGCGGGAGAT ACTGGTAGGC GACGCGGGTG GCCTCGTTAA GCCGCTCAGC GGCGGGGGGA TAATACTGGC GGTGAGGAGC GGACGCCTCG CCGCCGAGGC CTTAGCTCGA GAGGAGATAG CCCAGTACGA GGAGGCGACG AGGTGGGTTA GGCTTAGGCT GAGGCTTGCC TTCACAGCCT TTAGGCTACT CTACGGCATG AGGCTCGTGG ATAAGGCGCT TCAACTCCTC AATGGCGGCG AGTACGTCGC CGTGGACTAC GACGACCATG TAAAAACCCT CGCGTTCGCC GCGTTGACAG ATTTAAGATC CCTTGCCGTT TTGAAAGAGG CAACGCGGTA TTTAGCGAGT AATCGTAATG TTCTTCATTT CCTCTAG
|
Protein sequence | MRVAVVGAGP AGLAFASEFG DADVFEEHLE VGLPRHCTSL VSASSAKAVG IPQSLVLAKY SDLTVADLEG RSIYFRIRHG IYLIDRPGLE QWLAGGVGRI FTRRKVVATR GGYVYTADGS SHGPYDYVVL AEGAARRLSG RYGHVVRLPG LQVDVKSGIG LPGITVVYNQ KLSKSYFAWI VEVDKGLYRV GLADHCCTVQ KLFKLVKLVR GEPVGKPFGG GVLAGPPLRR LVWGREILVG DAGGLVKPLS GGGIILAVRS GRLAAEALAR EEIAQYEEAT RWVRLRLRLA FTAFRLLYGM RLVDKALQLL NGGEYVAVDY DDHVKTLAFA ALTDLRSLAV LKEATRYLAS NRNVLHFL
|
| |