Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2233 |
Symbol | |
ID | 5056394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2000370 |
End bp | 2001674 |
Gene Length | 1305 bp |
Protein Length | 434 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640469786 |
Product | malate dehydrogenase |
Protein accession | YP_001154431 |
Protein GI | 145592429 |
COG category | [C] Energy production and conversion |
COG ID | [COG0281] Malic enzyme |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.70342 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.683923 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACAGAAA AATGGTACCA ACTCTCGGTT GAGACGCACC GGAGATACGG CGGCAAAATC TCTGTAATAC CAAAGGTGCC GGTTAGGTCT ATAGAAGATT TCGCAATATA CTACACACCT GGTATAGCTG AGGTGTCGCG CCAGATTCAC AAAAACCCAG AAATGGCGTT TGAGCTTACC TCTAGGTGGA ATATTATTGG CGTATTGACA GACGGCACAA GAGTCCTAGG TCTAGGCAAC ATAGGCCCAG AGGCGGCGTA TCCTGTAATG GAAGGCAAAG CACTTATTTT CAAGTACTTA GGAGGAGTAG ACGCTATTCC CATTCCTATT AGGGTGCGGA CGCCTGAGGA GTTCATATTT GTAGCAAAGG CCCTCGAACC GGCGCTGGGA GGTATAAACC TCGAAGATAT AGAGTCCCCC AAGTGCTTCT ACCTGCTAGA CAAGTTGCGA GAAGAGTTGA AAATCCCGGT GTGGCACGAC GATCAGCAAG GCACAGCCAC CGCAACGCTC GCGGGACTTA TAAACGCGCT TAAGCTCGTG GGTAAGAAGT TCAGCGATGT CGTGATCGCC CTTATAGGCG CAGGCGCCTC GAATATATAC ACTGCCCGCA TCCTTATCAA ATACGGCGCT AAGCCGGGAA ACCTCATCTT GGTAGACAGC AAGGGGATTC TCCACCCCGA GCGCGACGAC ATAGACAAAA TGATGCTCGA AAACCCGTGG AAGTATAAAT ACGCCATTGA GACCAACGCA GAGCGGCGTA AAGGCGGCAT TCCCGAAGCT ATGAAAGGCG CAGATGTAGT TATTGGAGCG TCAAGGCCGG GTCCCGGCGT CATAAAGAAG GAGTGGGTAG CATCAATGAA CAAAGACGCC ATCGTATTTG CCTTGGCCAA CCCCGTCCCC GAGATCTGGC CCTGGGAGGC AAAGGAGGCT GGGGCCAAGA TTGTGGCTAC TGGGAGGAGT GACTTCCCCA ACCAGATAAA CAACTCGTTG ATATTCCCCG CCGTGTTCAG AGGCGCCCTA GACGTCAGAG CTACTACCAT AACTGATGAA ATGCTCATAG CCGCGGCAGA AGAGGTGGCG AAATTCGCCG AGGAAAAAGG AATCCACGAA GAGTATATAG TGCCAAAGAT TACAGAGTGG GAAGTTTATG TAAGAGAGGC GGCAGCCGTC GCGGCGATGG CCTCTTCACA AAGAGTGGCG AGGATCCCGA GATCTTACAA CGAGGAGCTT GAGATTGCGA GGAGTATAAT ATCAAAAAGC ATAAAGACTC TTGAAATCTT GATGAGGGAG AAAATAATTG AATAA
|
Protein sequence | MTEKWYQLSV ETHRRYGGKI SVIPKVPVRS IEDFAIYYTP GIAEVSRQIH KNPEMAFELT SRWNIIGVLT DGTRVLGLGN IGPEAAYPVM EGKALIFKYL GGVDAIPIPI RVRTPEEFIF VAKALEPALG GINLEDIESP KCFYLLDKLR EELKIPVWHD DQQGTATATL AGLINALKLV GKKFSDVVIA LIGAGASNIY TARILIKYGA KPGNLILVDS KGILHPERDD IDKMMLENPW KYKYAIETNA ERRKGGIPEA MKGADVVIGA SRPGPGVIKK EWVASMNKDA IVFALANPVP EIWPWEAKEA GAKIVATGRS DFPNQINNSL IFPAVFRGAL DVRATTITDE MLIAAAEEVA KFAEEKGIHE EYIVPKITEW EVYVREAAAV AAMASSQRVA RIPRSYNEEL EIARSIISKS IKTLEILMRE KIIE
|
| |