Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0396 |
Symbol | |
ID | 5054388 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 346613 |
End bp | 347671 |
Gene Length | 1059 bp |
Protein Length | 352 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 640467963 |
Product | alcohol dehydrogenase |
Protein accession | YP_001152650 |
Protein GI | 145590648 |
COG category | [R] General function prediction only |
COG ID | [COG1064] Zn-dependent alcohol dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAAGATTC CCGTCAAAAT AAAAGCGGCG GTATACAAAT CACCCGGCCA GCCGCTTGAG CTCTCCGAGA TACCGACCCC AACTCCGGGT GAAGGCGAGG TTTTGATAAA AGTCGCAGCA ACAGGGGTAT GCCACTCCGA CCTACATGTA TTAGACGGCG AGATGATCCC CCCGCCAGAG GGCTTCATAC TAGGACACGA GGTATCGGGG TGGGTAGTTG AATTTGGGCC GAGGTGCGAA AACCCCCACG GCCTCTCCCC AGGCGACCCC GTTGTTGTCT CTTGGATCAT ACCATGCGGC AAGTGCTACT GGTGCGTCAG AGGACAGGAA AACTACTGCC CATACGCCGC CGCGAGGATG CCAGGCCTTG TTGGGATCAA CGGAGGACAC GCGGAGTACA TGACAGTGCC TGAGACGGCG ATATACCCAA TACCAAAAGG ACTCGACGTA CACAACGCCG CGGTCATCTC GTGCGCCTAC GGCACTGCAT ACAGAGCTTT GAAAGAAGCC GGCGTCGGCC CCGGCACTTC GCTTGTGGTT GTAGGCGCCG GAGGCGTGGG GTTGGCGGCG GTTGAGCTTG CGGTTGCACT CGGAGCCTAC CCCATTGTAG CTGTAGACGT GAGAGAGGCT GCGCTGAAAA AAGCCCAGGA GGTCGGCGCC AGCCATGTGA TAAACGCCGC CGAGAGAAAC GCAATTGGGG CCATAAGAGA AGCGTTGCCC CAAGGCACTG ACGTGGTCTA CGAGACGAAG CCAAACCCAG ATCTCAAAAT TGCCCTAGAA GTTGTTAGAA GAGGGGGGAC AATAGTAGTC ACGGGCCTCG GCGCGTCAAC AATTGAGATA CCGGCAATGC ACCTCGTAAT GAATGGAATA AGAATTGTGG GGAGCCTAGG CTACAAGCCA CGCACCGACA TACCAGAACT CCTAGCCCTA GCCGCCGTGG GGAAAATAAA GCCTGAAAAA ATAATCTCAC ATAGGTATAA GCCGGAAAAC ATCAACGAGG CCTACAACAA CCTACGACAA GGAAAACACC ACCGCGCATT AATCATCTGG AATCCTTAA
|
Protein sequence | MKIPVKIKAA VYKSPGQPLE LSEIPTPTPG EGEVLIKVAA TGVCHSDLHV LDGEMIPPPE GFILGHEVSG WVVEFGPRCE NPHGLSPGDP VVVSWIIPCG KCYWCVRGQE NYCPYAAARM PGLVGINGGH AEYMTVPETA IYPIPKGLDV HNAAVISCAY GTAYRALKEA GVGPGTSLVV VGAGGVGLAA VELAVALGAY PIVAVDVREA ALKKAQEVGA SHVINAAERN AIGAIREALP QGTDVVYETK PNPDLKIALE VVRRGGTIVV TGLGASTIEI PAMHLVMNGI RIVGSLGYKP RTDIPELLAL AAVGKIKPEK IISHRYKPEN INEAYNNLRQ GKHHRALIIW NP
|
| |