Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0316 |
Symbol | |
ID | 5054394 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 274709 |
End bp | 275788 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640467893 |
Product | GHMP kinase |
Protein accession | YP_001152580 |
Protein GI | 145590578 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0153] Galactokinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGAGGCTT TGGTGAAGGC CTCGGCTCCG GGTAGGCTAG ACTTCCTCAA CACACACCAG GACTACAAAG GCCTCCCAGT AGTCTCAGTG GCGGTGGATT TGCGGACAAC GGTGACCCTG CGGAGGGGGG AAGAGTTTGA AATAACGTCT CTTAACACGG GGGAGAGGTG CCATTTCAGC AAGCCCCTTA TAGAGGGGAG GTCTTTCTGC GATTATGTTA AGGCGGCGGT GATCTCCGTC GAGAGGGAGG GGGTCGTGCT GAGGGGCTTC TCGGGCGAGC TTTACTCCGA TATTCCGATA GGCGCGGGCA TGGCTAGCAG CGCGGCGATG CTCGTGGCGC TGGTGGGGGC CATGTTGAGG CTCGCCGGGA GAGGCGCCGA TTTATACACC GTCGCAGAGC TGGCGTACAG GGCTGAGAGG GAGGTTTTGG GAGTGCCCTG CGGCAGGTTG GACCAATACG GCTCGGCCTT CGGGAAGGTG GCGGTAATTT ACCCCAAGCC GCCGGTGAGG GTAGAGAGGC TTGAGATGCG GGGCGGCGTC TTCGTGGTGC TAGACAGCGG TATTAGGCAC AGCACTGCTG AGATTCACCC AAAGCGCCAG GCGGAGCTCC AGGAGGCCGT GGAGATCCTA AGAGAGGCAC TAGGCGTAGA GAGCGAGGGC TACTGGGACT TCCCGTGGGA GGTGCTAGAA GCCAGGAGGG GTGTTGTTGA GACGCTTCCG CAGCCGCTTA GGGATAGGGT GCTCTTTACC TTGGAAATGC AGAAGTCGAC CGAGCGGGCG CTTGCCTACC TGAGAGGGGC AGACGTGGAT AAGGCGCTGA GGGAGGTGGG GCGCGAGATG CTTTACCAAC ACCACCTCCT TTCCCGGCTT TACGAGGTGT CTCTGCCCAA GCTAGACCAG CTGGTGGAGG AGGCCGTCGC GGCCGGGGCC TACGGGGCTA AGCTCTCGGG CGCAGGCCTG GGCGGGGTTG TAATTGCCCT TGCCCCAAAT AGAGAAGTCG CAGAGAGGGT AGGACAACTC TCAAGCGCGG AGAGGTGGTG GGTAGTGGAG ATAGACGAGG GGCTGAGGTA TGGAGATTAG
|
Protein sequence | MEALVKASAP GRLDFLNTHQ DYKGLPVVSV AVDLRTTVTL RRGEEFEITS LNTGERCHFS KPLIEGRSFC DYVKAAVISV EREGVVLRGF SGELYSDIPI GAGMASSAAM LVALVGAMLR LAGRGADLYT VAELAYRAER EVLGVPCGRL DQYGSAFGKV AVIYPKPPVR VERLEMRGGV FVVLDSGIRH STAEIHPKRQ AELQEAVEIL REALGVESEG YWDFPWEVLE ARRGVVETLP QPLRDRVLFT LEMQKSTERA LAYLRGADVD KALREVGREM LYQHHLLSRL YEVSLPKLDQ LVEEAVAAGA YGAKLSGAGL GGVVIALAPN REVAERVGQL SSAERWWVVE IDEGLRYGD
|
| |