Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2281 |
Symbol | |
ID | 5054691 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2042459 |
End bp | 2044087 |
Gene Length | 1629 bp |
Protein Length | 542 aa |
Translation table | 11 |
GC content | 64% |
IMG OID | 640469833 |
Product | NADH/ubiquinone/plastoquinone (complex I) |
Protein accession | YP_001154477 |
Protein GI | 145592475 |
COG category | [C] Energy production and conversion [P] Inorganic ion transport and metabolism |
COG ID | [COG1009] NADH:ubiquinone oxidoreductase subunit 5 (chain L)/Multisubunit Na+/H+ antiporter, MnhA subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.454821 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 27 |
Fosmid unclonability p-value | 0.480352 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGCTTG AGCTCTGGTT GCTGGTCCTC GCCCTACTCG CCTTAGCCGA TCTCTTCACC AAGAAGGGCA TAGGCTCGCT GGTAGGCGGC GCCGCTACCC TCTACCTATC CCTCACAAGG TCGCTGATGC CGGCGACGCA CCTCTTCCAC ATGGGGGATC TCGCCCACCC ACTATATATC TTCATATCTG GGATATATAC CGCCATCGCG GCGTACTCCA TCTGGTACGC CAGCCACTTG GAGAGGAGGG GGTGGTTCTG GCTGTGGATG GGGGTGTTCT ACACCTCCAT GCTCACCTTC GTCGCCGCCG ACCACTGGCT TGTTTTGATA ACGGGGTGGG GGGGTCTCGA CATAGCTAGC TGGGCCCTAA TCCTCACCTA TCACGACGGT GAGAAGTACG GCCGCGTGGG GCCGGGGGGG AGGGCATGGG GGGTGGCATG GGAGTGGGCG CCAAGCGCCT CGGCGCTGAG GGCTATCTTG ACCGTGGAGA TAGGCACCGC CTCTCTCGCC GCGGGCCTTG CCCCGGCCGC CGCCGCGCAG GGCCCGCACA TAAGCTCGCT GTCCGCCATG TCTGACCTCT CAGCGGCGCT TGTTCTGACG GCGGCTTTCG TGAAGGCGGC CCAGCTACCC TTCACAGACT GGCTTATGAC CGCGATGTCG GCACCTACGC CAGTCAGCGC CCTGCTCCAC AGCTCGACGA TGGTGAAGGC AGGGCCGATA CTCCTCCTCA AGCTGGGACA CGCCATGCCC ACATGGGCTG CGGGGACGGC CTTCGCCTTC GGCATCGCCA CTGCCTTGTA CGGAGGCGTC GTGGCGCTTG GGCAGAGAGA GCCGAAGGTC CTCCTCGCCG CCTCCACTGC CTCATATCTA GGCCTCATAA CAGCCTTCGC CCTTGCGAAG CCGGAGGAGG CGCTGTGGCT CACCTACTCC CATGGAGTGG CCAAGGCAAC TCTCTTCATG GCGGTTGGCC ACGCCATACA TATAGAGCAC ACAACCACGC CCACCCGGTT CCCCGTGGCG GCCAAGGCGG CTATGGGCCT TGCCCTCCTG ACGCTGGTGG GGCTGACCCC CCTAGGTGCA GTCGCCAAGA GCAACGCCGA GCCTTGGTTC CTTCTGTTCT CCTTCCTGAC CGCGGGGTAC GTGGGGAAGT TGATGCTAAA GACAGCCACC ACGCCGGGCG GCTGGGCGGT GGCGGCGCCG TATACAGCGC TGGCGGCGGC CAGCTTGGCT TTCCCCGTCT TGCCTAACCC CTTCTGGGCC CTCGCGCTTG CCGGCCTGGC ATTGGCGAAG ACGCCTGAGC CCACCGTTTT GCTCAGGCGG CTGGGTCTAC CCGTTCTCTA TGACGCGGTG GCCCCCGCCG TGTTTAAGGC AGTGAGGCAG GCCGCGGCGG TGGGAGACGG CTTTGTGGAC AAGCGCCTCT TATCACTGGA GGGGCTGTGG CGGGGCTTGG CGTCCCTCGT CGCCGTCGTG GACTTAATCT TCGACATGTT GCTCCACGAC TTCGTTCCCG CCCTAGTGCA GTCCGCCTCT GCCCAGCTAT CTAGGCGGAG TTTCGACTAC TACCTATACG TGGCCGGCGT CGGCGCGGGG ATAATCTTGG CCCTCGCGGT GTTGCTATGG ATCCACTAA
|
Protein sequence | MMLELWLLVL ALLALADLFT KKGIGSLVGG AATLYLSLTR SLMPATHLFH MGDLAHPLYI FISGIYTAIA AYSIWYASHL ERRGWFWLWM GVFYTSMLTF VAADHWLVLI TGWGGLDIAS WALILTYHDG EKYGRVGPGG RAWGVAWEWA PSASALRAIL TVEIGTASLA AGLAPAAAAQ GPHISSLSAM SDLSAALVLT AAFVKAAQLP FTDWLMTAMS APTPVSALLH SSTMVKAGPI LLLKLGHAMP TWAAGTAFAF GIATALYGGV VALGQREPKV LLAASTASYL GLITAFALAK PEEALWLTYS HGVAKATLFM AVGHAIHIEH TTTPTRFPVA AKAAMGLALL TLVGLTPLGA VAKSNAEPWF LLFSFLTAGY VGKLMLKTAT TPGGWAVAAP YTALAAASLA FPVLPNPFWA LALAGLALAK TPEPTVLLRR LGLPVLYDAV APAVFKAVRQ AAAVGDGFVD KRLLSLEGLW RGLASLVAVV DLIFDMLLHD FVPALVQSAS AQLSRRSFDY YLYVAGVGAG IILALAVLLW IH
|
| |