Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2282 |
Symbol | |
ID | 5055210 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2044084 |
End bp | 2045361 |
Gene Length | 1278 bp |
Protein Length | 425 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640469834 |
Product | NADH/ubiquinone/plastoquinone (complex I) |
Protein accession | YP_001154478 |
Protein GI | 145592476 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG0651] Formate hydrogenlyase subunit 3/Multisubunit Na+/H+ antiporter, MnhD subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 0.335323 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTGAGC CCTTCTACCT CCCCACCGTC TTGGCCACCC TACTTTCGCT GGTAGCTGTG AAGGCGGGGA GGTGGGCCGC TGTTGCTGCT GTACTCTCCA TGTTCCTCTT CGTGGCGCAG GCCAACTACT CCGCCACGCT GGCGTCCCTC CCCTACCTTG GCGAGGTACG TGTCTCCGTA ACGCCGGACA AGCTGCCGTT TGTCCTCACG TCGGTTGTCC TAGGGCTAAT CGTCACGGTG TACGCTGAGA GGTATTTACA ACATCTCAAG GCCGAGAGGT GGTACTACGC CGTCCAGTCG CTCTACGTCC TCTCGTTCGT CTACATAATC ATATTTGAAA ACCTCATATT CGTCTTCCTG GCACTGGAGC TGTCTATCAT CACAAGCTTC CTCCTCATCT GGTATTTTGG ATACGGCGAC AGGCGGAGGG TTGGGTTACT GTACTTCATA TGGGCCCAGA TTGGCTCTAT CCTTTTCCTG ATCGGGATCG CCATGTCCGG CACTTACCTC GCCGCTGACT TTAAGGCGGC TGGGACGGCG GCTTTGATGG TGCTGGTCGG GTTGCTGGTC AAGATGGGCA CGGCGGGGGT CCACTTCTGG TTGCCCTACG CCCACGCCGA GGCGCCGACT CCCCTCTCAG CCCTTCTGAG CCCCGTCCAC GTGGGGCTGA TGGCCTACTG GCTCTGGCGA CTCAGGGATG GGGCAGGCTG GCCGCTGGAG ACGCTCTACC TCTACGGCTT GGCCACAGCC GTCTACGGCT CCCTCCTCGT CTTCCGGGAG TCCGACATAA AGAGGGCCTT GGCTGATTCG ACCATCGCCA ACATGGGCCT CCTCGTGGCG GCGGCGGCTG TACCGAAGCC GGAGCTTAGT TACCTCGCCA CGGCCATGTT ATTCGTAGGC CACGCCTTTG CCAAGGCCGC CGGCTTCATG CTGGCGGGTA TCTACATCGT GGGACTGCAC ACTAGAGATC TCGACCAGCT CAGGTGGGAC ACTCGGGTCT TGGCGCTAGG CGTCCTCTCC TTCGTTGCAC TATCCGGCGT CTTTGGCATT AACCTGCTGG GTAAGGCCAT GGTAGCGGTC GGCGTGCCCC AGGCCCTGGC GGCGGCTGTC CTCCTTATAA CGGCGCTGTT CTCCACGGCC CTCTACAGCT TCTACCTCCT CCACAAGATC TACAAAGGCG GCCAGGCTGA GGTGCCCACT GACGGCGGGA TGTACCTCTC GGCGTTGGCC GCCGCGGCGG CTCCCTTCAT CCTCCTCGTC GCCGTGTTAG CGATATGA
|
Protein sequence | MSEPFYLPTV LATLLSLVAV KAGRWAAVAA VLSMFLFVAQ ANYSATLASL PYLGEVRVSV TPDKLPFVLT SVVLGLIVTV YAERYLQHLK AERWYYAVQS LYVLSFVYII IFENLIFVFL ALELSIITSF LLIWYFGYGD RRRVGLLYFI WAQIGSILFL IGIAMSGTYL AADFKAAGTA ALMVLVGLLV KMGTAGVHFW LPYAHAEAPT PLSALLSPVH VGLMAYWLWR LRDGAGWPLE TLYLYGLATA VYGSLLVFRE SDIKRALADS TIANMGLLVA AAAVPKPELS YLATAMLFVG HAFAKAAGFM LAGIYIVGLH TRDLDQLRWD TRVLALGVLS FVALSGVFGI NLLGKAMVAV GVPQALAAAV LLITALFSTA LYSFYLLHKI YKGGQAEVPT DGGMYLSALA AAAAPFILLV AVLAI
|
| |