Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1401 |
Symbol | |
ID | 5056367 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 1263622 |
End bp | 1265145 |
Gene Length | 1524 bp |
Protein Length | 507 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640468944 |
Product | carbohydrate kinase, YjeF related protein |
Protein accession | YP_001153613 |
Protein GI | 145591611 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.324731 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 30 |
Fosmid unclonability p-value | 0.9044 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGTAA TAACATCACT TGAGATGTAC GTCGCTGATA GAAACGCCGA GTGGCTCGGC GTGCCCCGGC TGGTTCTCAT GGAAAACGCG GGGGCCGCTG TGGCGCGTAA TATTTTGAAG AAGTATCCCC ACGCTTCTAG GGTGTTGGCT ATATGCGGGA CGGGAGATAA CGGGGGGGAC GGCTACGTGG CTGTGAGGCA CCTCCACGCC GCTGGGAAGG AGGTGCGGGT GATCGCGCTG GGCGAGCCAA GGGAGGAGCT AGCGGCGAGG AACTACCATG CTGTTAGGAG GCTGTGGGGG GTCGAGGTTG CTGTGGTTCA GTCCCCTCTT GAGCTCTTGG CGTTGCAAGA CTGGCTTATG TGGGCAGATG TTATAATAGA CGCGGTCCTA GGCACGGGGA TTAGGGGCGC ATTGAGGGAG CCGCACGCAA CGGCGATTGA GCTCATGAAC ATCGCCCCGG CGCCTAAGGT GGCGGTGGAT ATCCCAAGCG GCTTAGACCC CGACACGGGC GAGGTGAGAG ACAAGGCAGT GAAGGCGGCT CTCACCGTGA CTTTCCACAA GGCGAAGAAG GGACTCCTCG CCCCCAGCGC GGCGCGGTAC GTGGGGGAGC TGGTGGTGGA GCCGATTGGC ATTCCGCCTG AGGCAGAGGT CATAGTCGGC CCCGGCGACT TTGCCTACCT GAACTTCTCC CGGAGAGCCG ACTCGAAAAA GGGCGACCAC GGTCGGGTTC TAGTGGTGGG AGGCTCCTTG GAGTACTCCG GCGCTCCGGT ATTTGTGGCT AAAGCCGCCT TGAGGGCTGG GGTGGATCTC GCAGTGATCG CCGCGCCGGA GCCGGCGGCT TATGCGGCAA AGGCCATGGG CCCCGACGTG ATAGCAGTGC CCCTAGAAGG CCCCCGGCTA TCGCTGAGAC ACGTTGAAAA GATCGCCTCT TTGGCGGAGA GATTTGACGT AGTGGCTATT GGCCCCGGCC TCGGCACAGA GGGGGAGACC CCAGACGCCG TTAGGGAAAT CTTCAAGAGG CTCGCCGGCA GAAAACCGCT GGTGGTAGAC GCAGACGCCT TGAAGGCGCT AAGGGGCGAA AAGGCGGCGG GGGTTACTAT CTATACGCCC CACGCCGGGG AGTTCAAGGC GCTTACGGGA ATTGAGCCGC CTGAAGCCCT TAGGGAGAGG GCAGAGGTCG TGAAGCAACA AGCCGCATCA ATAGGCGCAG TCATTTTGCT AAAGGGCAGA TACGACGTCA TATCCGACGG GGTCAAGGTG AAGATAAACG CCACAGGCAC CCCCGCCATG ACTGTCGGCG GCACCGGCGA CGTGTTGACG GGGCTAGTCG CGGCGTTTTT GACAAAAACT ACAAACCCTC TAGAGGCGGC AGCGGTGGCC GCCTTCGTCA ATGGGCTGGC AGGAGAGGAG GCGGCCGCCC AGTTATGCTT CCACATCACC GCAAGCGACC TCCTGGACAA GATACCGGGC GTAATTAGGA AATTTGCAAG AGAGGAGGTC ACCCACGCCT CCTCGAGAGC TTAA
|
Protein sequence | MDVITSLEMY VADRNAEWLG VPRLVLMENA GAAVARNILK KYPHASRVLA ICGTGDNGGD GYVAVRHLHA AGKEVRVIAL GEPREELAAR NYHAVRRLWG VEVAVVQSPL ELLALQDWLM WADVIIDAVL GTGIRGALRE PHATAIELMN IAPAPKVAVD IPSGLDPDTG EVRDKAVKAA LTVTFHKAKK GLLAPSAARY VGELVVEPIG IPPEAEVIVG PGDFAYLNFS RRADSKKGDH GRVLVVGGSL EYSGAPVFVA KAALRAGVDL AVIAAPEPAA YAAKAMGPDV IAVPLEGPRL SLRHVEKIAS LAERFDVVAI GPGLGTEGET PDAVREIFKR LAGRKPLVVD ADALKALRGE KAAGVTIYTP HAGEFKALTG IEPPEALRER AEVVKQQAAS IGAVILLKGR YDVISDGVKV KINATGTPAM TVGGTGDVLT GLVAAFLTKT TNPLEAAAVA AFVNGLAGEE AAAQLCFHIT ASDLLDKIPG VIRKFAREEV THASSRA
|
| |