Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0552 |
Symbol | |
ID | 5054519 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 494019 |
End bp | 495101 |
Gene Length | 1083 bp |
Protein Length | 360 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640468114 |
Product | glucose sorbosone dehydrogenase |
Protein accession | YP_001152799 |
Protein GI | 145590797 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2133] Glucose/sorbosone dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.0000617979 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCCCTCC TGGCTGGGTT AGGCCTTGCT ATAGGGAGGG TGATAAATTT GTTAAGAAAA GAGGGGGTTG ATGCGGTTGA GTTTAAAACA TCTATCGTGG CCTCCGATTT AGAGGTTCCC TGGTCTATTA CCCCTCTTGG AGGTAGGCGT TATCTAGTTA CAGAGCGCCC TGGTCGCTTA GTGTTGATAA GCCCCAGCGG AAAAAAGCTC GTGGCTTCAT TTGACGTGGC AAGCGTCGGC GAGGCAGGCC TGCTGGGTTT GGCGCTACAC CCTGATTTCC CTAAGAAAAC CTGGGTTTAT CTCTACGCCT CCTACTTCGA CAGTGCGGGG CAGATAAAGA ATAAGTTAAT TAAAGGACGT CTAGATCCAC TCACCTTTAG GCTTAGTGAA GTGAAGACTT TAATTGAGGA TATTCCGGGC GCCTATATTC ATAATGGAGG GCGCATTAGG TTCGGTCCTG ACGGCATGTT ATACATAACT ACAGGGGATG CGGCCAAGCC GCTACTTTCC CAAGACTTAT CCAGTCTAGG TGGTAAAATC CTCCGCGTAG ATGACGATGG AAAACCTTCC CCTGATAACC CCTTCCCTAA CAGTCCCATC TGGTCTTACG GCCACAGAAA TCCTCAAGGC ATTGACTGGC ACCCCGACAG TGGTGTGATG GTAACAACTG AGCATGGCCC AGTAGGCCAC GACGAAGTAA ACGTAATAGT GAAAGGGGGC AACTACGGGT GGCCGTTGGC AGTGGGGAAG GCCGATAGAG GCGAATTCAT AGATCCAATA ATCGAATCGG GCGGAGATAC TTGGGCGCCT TCGGGGGCCT CCTTTGTGCA CGGAGATGCG TTCCCAGAGC TTCGCAGTTG GTTGTTAATC GCATGTCTCA GAGGGAGTAT GATACTGGGA GTTGAGTTTG TCAACCAAAT GAAAGTGTTT GGAATTCACA TGTTTTTTAA AAATGTCTTT GGGAGACTCC GCGATGTTGT TATTGACGAA GACGGAGGTA TACTAATAAG TACCAGTAAT AGAGATGGTA GAGGTAACCC GAGAGACGGA GATGATAAGA TTTTAAAAAT TGTCCCCGCC TAA
|
Protein sequence | MALLAGLGLA IGRVINLLRK EGVDAVEFKT SIVASDLEVP WSITPLGGRR YLVTERPGRL VLISPSGKKL VASFDVASVG EAGLLGLALH PDFPKKTWVY LYASYFDSAG QIKNKLIKGR LDPLTFRLSE VKTLIEDIPG AYIHNGGRIR FGPDGMLYIT TGDAAKPLLS QDLSSLGGKI LRVDDDGKPS PDNPFPNSPI WSYGHRNPQG IDWHPDSGVM VTTEHGPVGH DEVNVIVKGG NYGWPLAVGK ADRGEFIDPI IESGGDTWAP SGASFVHGDA FPELRSWLLI ACLRGSMILG VEFVNQMKVF GIHMFFKNVF GRLRDVVIDE DGGILISTSN RDGRGNPRDG DDKILKIVPA
|
| |