Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1031 |
Symbol | |
ID | 5056272 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 918236 |
End bp | 919156 |
Gene Length | 921 bp |
Protein Length | 306 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640468587 |
Product | NAD-dependent epimerase/dehydratase |
Protein accession | YP_001153261 |
Protein GI | 145591259 |
COG category | [G] Carbohydrate transport and metabolism [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG0451] Nucleoside-diphosphate-sugar epimerases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.64326 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 29 |
Fosmid unclonability p-value | 0.538375 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGGTATC TGCTGTATGG TGGGCTCGGG TTTATAGGTG CGAACATCGT AGAGGAGCTC GCCGGGGAGG AGGTCTATGT GGCGCACAGG CCGGGGTCGC CGCAGAGAAG GCCCCGCATC GCCTCCTTTG TGTCGCAGTA CGCGAGGCTG GTGGAGTACG TAGACCCCGC CTCGCCGTTC GAGCAGGTCA AGCCAGACGT GGTCGTGAAC CTCGTTGGGG AGTACTTCGG GCCGCCGGAG GCTATTAGGG AGGCGAACGC CGAGTTTCCG AAAAAGCTGT GCGACGCGGC GAGGCGGGCG GGGTGGAGCG GGAAGATCGT TCACATCTCA GCGGCGACGG TAAGGGGGCC CGCGGGGGAG GTCATCACGG AGGAGGGCCG CCACCTAGAG GGGATAACCC CGGTCTCGGA CTTCGACAAG TGGAAGGCAG AGGGGGAGCG GGTCGTGGCC CAGTGCTTCG CCGACTGGGT CATCGTAAGG CCAGTCTTGG TGTACGGCCG CTTCAACGAC CACCCCGAGT GGGTAGCCCT CACCGGCATG GTGAAAAGGG GGATAGCGCC GATGATCAAC GCCGCCGTCT CCTCAATATC GGCAAGGGAG CTGGCCAAGG TGGTCAAGAT ATCCACCGCG CTGTCCAGGG AGTACTTCTT CGCCACGGAG TGCGAGGCGA GGAGGCTATC TGACTTCGTC TTAGCCATAG AAAAGGCGCT TGGGAAAAGG GCCCTCCACC TGCCGATACC TACGGCCCTA CTCAAGATCG CGGCGCCGCG CGACTTGAAG AAACACATCC CCTTCCTAGG CCGCCGGTTT AGTTGTGAAA AGATGAGCAA GCTTCTAAAA TACACCCCCA GCCCCGACTT CTACCGCGAG GTGGCGGAGA TGGTGGCATT TATTACCGGG AGAACCCAGG CTAGGGGATG A
|
Protein sequence | MRYLLYGGLG FIGANIVEEL AGEEVYVAHR PGSPQRRPRI ASFVSQYARL VEYVDPASPF EQVKPDVVVN LVGEYFGPPE AIREANAEFP KKLCDAARRA GWSGKIVHIS AATVRGPAGE VITEEGRHLE GITPVSDFDK WKAEGERVVA QCFADWVIVR PVLVYGRFND HPEWVALTGM VKRGIAPMIN AAVSSISARE LAKVVKISTA LSREYFFATE CEARRLSDFV LAIEKALGKR ALHLPIPTAL LKIAAPRDLK KHIPFLGRRF SCEKMSKLLK YTPSPDFYRE VAEMVAFITG RTQARG
|
| |