Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2272 |
Symbol | |
ID | 5054384 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 2034506 |
End bp | 2035726 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640469824 |
Product | NADH dehydrogenase (ubiquinone) |
Protein accession | YP_001154468 |
Protein GI | 145592466 |
COG category | [C] Energy production and conversion |
COG ID | [COG0649] NADH:ubiquinone oxidoreductase 49 kD subunit 7 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 0.242862 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTGACG AATGGCCTTT TAACTTGAGG TTCGGCGACA CGTACTACGT GGAGAGGGAG GAGGAGCTAG TTGGCGGCCG CCGGGGTCTG ACGCTTGTCG TCGGGCCGCA ACATCCCGGA TCGGGCCACA TGCGGATCTT CCTCGTGCTG GACGGCGACG TCATCGTAGA CGCCTTTCCT GACCCGGGCT TTGTCCACCG CGGAATAGAA AAGCTGGCTG AGAACAGGCC CTACTGGACA TTGATACCGT TGGTGGAGAA GGCCTCCATC ATGGACAGCG CCAACATCAT TTATCCCCTC GTGCTGGCGC TTGAAAAGAG TCTCGCCCTG GAGCCGCCGC CGCGGGCGAA GTATCTGCGG CTGATCATGG CGGAGCTTAC GCGTATTAGG ACCCACCTCT ACGACTTGGC GCTTCTCGGC ATCTTCCTAG GCCACTCCAC CGCCTTTATG TGGGGGTTCG CCCTGCAGGA CCTTATTGCC GAGGTCTTTG CCAAGATCGC CGGCGCCAGG ACCACCACGG CGTACCTCGT GCCGGGGGGT GTGCGGAGGG ATTTCAAGCC CGACCACGTA GAGCTTGTGG AGAGGCTGTT GAGGAAGGTC GAGGCCAAGC TCAACGATTT CAAGAATCTA TTTTTAGACA ACCCCGTGAC AAGGGCGAGG CTGGAGGGGG TGGGCGTCTT GGACGCGAAG AGGGTGGCCG AGCTGGGGGT GGTGGGCCCC TTTGCCAGGG CCTCCGGTGT TGATTTTGAC GTGAGGAGGG CCTATCCGTA CGATGCCTAC GCGGAGCTGG GCTACGAGCC GGTTGTGGAC AAGGCCGGCG ATGCTTGGGC GAGGACGTGG GTTAGGTGGG AGGAGGTGAG GAGGTCGATT GAGCTTGTGC GCAGGGCGCT TAGGGAGCTA CCACAAGGCG ACGTGATTGA CAGCGCCCTC CTCTTCAAAA ACCCCGAGTA CAGGCGGGAG GGGATTTCAG GTGTTATGGG CGTGTATACA TACATGTACC CCGAGCCGGG CGAGTGGCTG GGCGTGGCCG AGGCTACGCG TGGACTCGCC CTAGCTCAGC TCTGGGCCTC AGGGTTGCAG CGGGTATATC GCATGAGGTT CGTCACGCCG TCCTGGCGCA ACCTCCGTGC CATGATAGAG GCTATGAAGG GAGAGAGGCT GGCGGATATG CCTGCTGTGT ACATGAGCTT CGGGTACTTC CCCCCCGAGG CAGATAGGTG A
|
Protein sequence | MLDEWPFNLR FGDTYYVERE EELVGGRRGL TLVVGPQHPG SGHMRIFLVL DGDVIVDAFP DPGFVHRGIE KLAENRPYWT LIPLVEKASI MDSANIIYPL VLALEKSLAL EPPPRAKYLR LIMAELTRIR THLYDLALLG IFLGHSTAFM WGFALQDLIA EVFAKIAGAR TTTAYLVPGG VRRDFKPDHV ELVERLLRKV EAKLNDFKNL FLDNPVTRAR LEGVGVLDAK RVAELGVVGP FARASGVDFD VRRAYPYDAY AELGYEPVVD KAGDAWARTW VRWEEVRRSI ELVRRALREL PQGDVIDSAL LFKNPEYRRE GISGVMGVYT YMYPEPGEWL GVAEATRGLA LAQLWASGLQ RVYRMRFVTP SWRNLRAMIE AMKGERLADM PAVYMSFGYF PPEADR
|
| |