Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1571 |
Symbol | |
ID | 5056212 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1421413 |
End bp | 1422342 |
Gene Length | 930 bp |
Protein Length | 309 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640469112 |
Product | malate dehydrogenase |
Protein accession | YP_001153777 |
Protein GI | 145591775 |
COG category | [C] Energy production and conversion |
COG ID | [COG0039] Malate/lactate dehydrogenases |
TIGRFAM ID | [TIGR01763] malate dehydrogenase, NAD-dependent |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 0 |
Fosmid unclonability p-value | 0.0000000000000156495 |
Fosmid Hitchhiker | No |
Fosmid clonability | unclonable |
| |
Sequence |
Gene sequence | ATGATAACTG TAATAGGTAG CGGGAGAGTC GGCACGGCGG CCGCCGTTAT TATGGGAATT CTGAAAGTAG ATACGAAAAT ACTTCTAATC GATATAGTCA AGGGGCTACC CCAAGGCGAG GCCCTCGACA TGAACCACAT GAGCTCTATC CTCGGCCTCG ACGTTGAGTA TGTGGGGTCA AACGAATACA AAGACATGGA GGGTAGCGAC TTGGTAATCG TCACTGCCGG ACTGCCAAGG AAGCCCGGCA TGACGAGGGA GCAGCTACTT GAGGCTAACG CTAAAATAGT CAGCGAGATA GGCAGAGAGA TAAGGAAGTA CGCGCCCGAG TCTGTGGTTA TCCTAACCAC AAACCCGCTT GACGCCATGA CCTACGTTAT GTGGAAAGCC ACCGGGTTCC CAAGAGAGCG CGTAATCGGG TTCAGCGGCG TCCTCGACGC TGGAAGACTT GCCTACTACG CATCCAAGAA GCTGGGCGTT TCGCCGGCGT CGATACTGCC CATAGTCCTG GGGCAACACG GCGAGAGTAT GTTCCCCGTC CCGAGTAAAA GTTTTGTACA TGGAGTGCCT CTGAGCAGGC TTCTTACAGA AGACCAGCTG AGAGAGGTGG TTGAGGAGAC GGTGAAGGCG GGGGCAAGGA TCACGGAGCT CAGAGGCTTC TCCTCTAACT GGGGCCCAGG ATCCGGCTTG GCGATAATGG CAGAGGCCGT GAAGCGTGAT TCCAAGCGTT CCCTAATCGC CTCTGTGGTT TTAGAAGGAG AGTACGGAGT GCGCGACGTC CCTGTGGAGG TCCCCATAGT GCTGGGAAGA CGCGGAGTGT TGAAAGTCCT AGAAGTGGAA CTAACAGAAG AGGAGAGGCA GAAATTTATG CAAAGCGTAG AAGCCGTGAA GAAGCTTATT TCTTCCCTAC CGCCTGCGTA CTTATCGTAA
|
Protein sequence | MITVIGSGRV GTAAAVIMGI LKVDTKILLI DIVKGLPQGE ALDMNHMSSI LGLDVEYVGS NEYKDMEGSD LVIVTAGLPR KPGMTREQLL EANAKIVSEI GREIRKYAPE SVVILTTNPL DAMTYVMWKA TGFPRERVIG FSGVLDAGRL AYYASKKLGV SPASILPIVL GQHGESMFPV PSKSFVHGVP LSRLLTEDQL REVVEETVKA GARITELRGF SSNWGPGSGL AIMAEAVKRD SKRSLIASVV LEGEYGVRDV PVEVPIVLGR RGVLKVLEVE LTEEERQKFM QSVEAVKKLI SSLPPAYLS
|
| |