Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2266 |
Symbol | |
ID | 5055862 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2028062 |
End bp | 2029066 |
Gene Length | 1005 bp |
Protein Length | 334 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 640469818 |
Product | D-isomer specific 2-hydroxyacid dehydrogenase, NAD-binding |
Protein accession | YP_001154462 |
Protein GI | 145592460 |
COG category | [C] Energy production and conversion [H] Coenzyme transport and metabolism [R] General function prediction only |
COG ID | [COG1052] Lactate dehydrogenase and related dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.098436 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0903762 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTAAAT ACAAAGTTGT TGTCTTATCG CCTGTACCTG AGGCATTGAT AAAAATGTGG GCTACGCCCA TTGCTCAAAA ATATGGCATA CCCATCGAAG AGATTGAAGT TGTCACCCTC TTCGAGCCCA ATTACGAGGA GGTAGCTCGC CAGGTTGCCG ACGCAGACGT TGTGGTTGGC GACTACACCT TCAGAATTAA AATCGACGCC GACTTGTGCC AAAAAATGTC TAAGGTGAAA CTAATTCAAC AGCCAAGCAC CGGCTACGAC CACATAGACG TCGTAGCTTG CGCAAAGAGG GGCATCCCAG TGGCGAATAT CGGTGGGGCT AACTCCATCT CTGTGGCCGA GCACACCATA ATGCTGGCCT TGATGTTGTT GAAGAGAGCT GTGTACGCTC ACCAGAAGCT AGTTAACGGC CAGTGGACGC AGGGGGAGCT CATGAACACT GTTGGGGAGC TCTATGGCAA AACATGGGGG ATACTCGGCA TGGGCAGAAT TGGAAAGGAG GTCGCCATAA GGGTCCTAGC TTTTGGTGCC AAAGTTATTT ACTACGACGT CGTGAGGAGG GAAGATGTAG AAAAACTGGG AGTCGAGTAT AGGCCTTTCA ACAGATTGCT GGCGGAGAGC GATGTGCTTA GCATCCACGT GCCTCTTACA GAGAAGACAA GGGGCATGAT CGGGGAGCGG GAGCTTAGGA TGATGAAGCC CACCGCGGTG CTTATCAACG TCTCGCGCGG CGAAATCACC GACGAAGAGG CACTCGCTAA AGCTGTGCGC GAGGGCTGGA TTGCCGGGGT GGGGGTAGAC GTATTCTCCG TCGAGCCCCC GCCTCCGGAT CATCCATTGT TACAAGTCGC AAGGGAGGGC TTCAACGTCA TCGTCACGCC GCATATCGCC GGCGCCACCA ATGAGGCTAG GATGAGAATT ATCAACGTGA CGCTAGATAA CGTGTTGAGA GTGCTTGCAG GTCTAAAGCC TGAAAATGTG GTGAATATGC CATGA
|
Protein sequence | MAKYKVVVLS PVPEALIKMW ATPIAQKYGI PIEEIEVVTL FEPNYEEVAR QVADADVVVG DYTFRIKIDA DLCQKMSKVK LIQQPSTGYD HIDVVACAKR GIPVANIGGA NSISVAEHTI MLALMLLKRA VYAHQKLVNG QWTQGELMNT VGELYGKTWG ILGMGRIGKE VAIRVLAFGA KVIYYDVVRR EDVEKLGVEY RPFNRLLAES DVLSIHVPLT EKTRGMIGER ELRMMKPTAV LINVSRGEIT DEEALAKAVR EGWIAGVGVD VFSVEPPPPD HPLLQVAREG FNVIVTPHIA GATNEARMRI INVTLDNVLR VLAGLKPENV VNMP
|
| |