Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2089 |
Symbol | |
ID | 5056299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1865120 |
End bp | 1866454 |
Gene Length | 1335 bp |
Protein Length | 444 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 640469639 |
Product | D-lactate dehydrogenase (cytochrome) |
Protein accession | YP_001154287 |
Protein GI | 145592285 |
COG category | [C] Energy production and conversion |
COG ID | [COG0277] FAD/FMN-containing dehydrogenases |
TIGRFAM ID | [TIGR00387] glycolate oxidase, subunit GlcD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.491336 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATGTGG GTTTTCTCAG GAAGACGTTT GGCGATAGGT TTGTTGAAGA TTCTTCCGTT GCGGCGTTGT ACGTCCACGA CGCCTCTTTT GTGGAGGGGG AAAGCAACGT GCTGGGGGTG GTCTTCCCCG AGACGGAGGG GGAGGTGGTT GAGCTGGTTA GGTGGGCTAT AAAGCATAAG GTGCCGTTGT TCCCACAAGG GAGTGCCACC AGCCTCTCGG GCAACGCCGC GGCTACTGCT AGAGGGCTCG TGGTGAGTTT TGAGAGGATG ACCAAAGTGG AGATAGACCC AGGGGACGGC GTGGCTGTGG TCGGCCCCGG GGTGAGGATC GAGGAGCTGA ACGTCGAGCT GGCCCGGTAC GGCTTCTTCT TCCCCGTCGA TCCCGGCTCT GTGAGGAGCG CCACGATTGG CGGGGCTATC GCCAACGGAG CCGGCGGGAT GAGGGGGGCG AAGTACGGCA CAATAAAGGA CTGGGTGTTG GGACTGAGAG TGGTGACGGG AAGAGGCGAC GTGTTGAAGG TGGGTTGCAA GACGTTCAAG TGCCGGAACG GCTATGATCT TGTGAGGCTA TTTGTCGGTA GCGAGGGGAC GCTGGGCCTT ATTACGGAGG CTGTTTTGAA GCTGGCTCCT GTGCCGGAGT CCGCCGTGGC CGTCTTGGCG TATTATGACG ACGTGGAGCC GCTTGTAGAG GACGTGGTTA GGGTTAGGGC AAGCAGAATT TGGCCGCTAT TTGCAGAGTT TTTAGACGCG CCGACTGCCG CCGTGGTGGG GCTTGAGGAG AGAGACACCC TCTTTCTTGG CGTCGACGTC AATACAGGTG CAGAGGAGAG AGTTTTGAAG AGACTCCAGT CTATCGTCAG GGGGAGAGTG GCCAGTGTGG CAGTGGGCTG GTCTGAAGCC ATGAAGCTAC TGGAGCCGCG CAGGAGGCTA TACTCGGCGC AGGTTCACCT CGCTCAGAGA GGCGGCGGCG TGTTGGTAAT TGAGGACGTT GCGGTGCCCA TTTCGAAGCT CCCAGACGCC GTGAGGGGGC TTAAAAAGCT GGCGGAGAAA TACGGCGTAC CGCTGTTGCT AGGCGGCCAT GTAGGCGACG GCAACTTACA CCCAGCTACT TGGTTTAGAA AAGAGGAGGG GCCCGGAAAG GCGGAAAAGT TTATCAGAGA AATGGCGGAG CTCGTGGTTG GGCTAGGCGG TACAGTGTCG GCAGAGCACG GGGTCGGGAC CTTGAAGAAA GATCTCATAG CGCTTGAGCT CGGCGATGCG GTGCTTACAT ATATGAGGGA GCTTAAGAAG GTCTTCGACC CCTACAATAT CCTCAACCCC GGCAAGATAG CCTAG
|
Protein sequence | MDVGFLRKTF GDRFVEDSSV AALYVHDASF VEGESNVLGV VFPETEGEVV ELVRWAIKHK VPLFPQGSAT SLSGNAAATA RGLVVSFERM TKVEIDPGDG VAVVGPGVRI EELNVELARY GFFFPVDPGS VRSATIGGAI ANGAGGMRGA KYGTIKDWVL GLRVVTGRGD VLKVGCKTFK CRNGYDLVRL FVGSEGTLGL ITEAVLKLAP VPESAVAVLA YYDDVEPLVE DVVRVRASRI WPLFAEFLDA PTAAVVGLEE RDTLFLGVDV NTGAEERVLK RLQSIVRGRV ASVAVGWSEA MKLLEPRRRL YSAQVHLAQR GGGVLVIEDV AVPISKLPDA VRGLKKLAEK YGVPLLLGGH VGDGNLHPAT WFRKEEGPGK AEKFIREMAE LVVGLGGTVS AEHGVGTLKK DLIALELGDA VLTYMRELKK VFDPYNILNP GKIA
|
| |