Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0634 |
Symbol | |
ID | 5056108 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 562541 |
End bp | 563509 |
Gene Length | 969 bp |
Protein Length | 322 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 640468193 |
Product | hypothetical protein |
Protein accession | YP_001152877 |
Protein GI | 145590875 |
COG category | [C] Energy production and conversion |
COG ID | [COG0644] Dehydrogenases (flavoproteins) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.621704 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGGGCCA CAGCTAGGGT TGTGGGCCTC GGCCCCTCGG GATCCGCCTT CCTCCACTTT TACGGCGCCG CCCGCGGCGT GGAGAGGTCG CCGCGGTATT TCAAGGCCTG CGGCGAGGCC GTACCCGTAG AGACCCCGCT GGTGGGGAAA GAACACGTGG TCGACAAGGT TAGGCTCTTC CGCTTTTACT ATTGGAAGAG GGAGGTGGGG GAGGTGGCCT ATCAGAAGCC CAGGTGGTAC ATAATCGACA AGGCCAAGTG GGTGGAGCAA CTGAGGGCCG CGGCTACGGG GAGCGGGGCG GTGGACGGGG AGGTGGTGGT CAAGGCGGGC GGCCCGTACC AGAGCGAGGG GGGTAAGATA ACGGTGGTGC GGGCGTACGT GGAGGGGGTC AAGCTGGAGG ACGAGGCGGT CTACTTCGTC TTCCCGCCGG ACTCGGTCGG CTTTTACTGG GCCTTCCCCC ACGGCGGGGT TTACAATGTC GGAGGCGGGT TCATCGGCGT GGAGAACCCA GTGCCCCTGG TAAGGGCCTT TGTGCAGAAG TGGCTGGGTG GGGGCCGCGT GGTTGACGTC CGGGGGGCGC CGCTAACCGT GGAGCCTAAG ATAGTCCTCC ACGACGGGGA GGCCTTCCGC ATAGGCGAGG CCGCCGGGCT GGTGTACCCT CTGACGGGCG AGGGCATTAG GCCGGGGGTC CTCTCGGCAA AGGCCCTGGC GGAGGCCCTT ACTACGAAAA AGCCGCTGGA AACGTATAGA AGGGCCGTTG CCGACATCGC CAAGCAGGTG GAATTCCAGA AAAGGCTGTT AAAGGCGGCG CGGCGGTTGA TAGAGCGGGG GGCTTCGATT ATGGAGCTCG CCAACGACGG CGTATTGCGG GACTACATCG AGGAGAACCT CTCCGCAAGG GCCCTCTTCG CGGCGCTCGC CAAGAGGCCG GCCGTCGGCG TGAGGCTTGT GGCCGCGTTG ATTAAATAA
|
Protein sequence | MRATARVVGL GPSGSAFLHF YGAARGVERS PRYFKACGEA VPVETPLVGK EHVVDKVRLF RFYYWKREVG EVAYQKPRWY IIDKAKWVEQ LRAAATGSGA VDGEVVVKAG GPYQSEGGKI TVVRAYVEGV KLEDEAVYFV FPPDSVGFYW AFPHGGVYNV GGGFIGVENP VPLVRAFVQK WLGGGRVVDV RGAPLTVEPK IVLHDGEAFR IGEAAGLVYP LTGEGIRPGV LSAKALAEAL TTKKPLETYR RAVADIAKQV EFQKRLLKAA RRLIERGASI MELANDGVLR DYIEENLSAR ALFAALAKRP AVGVRLVAAL IK
|
| |