Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0159 |
Symbol | |
ID | 5055661 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 145119 |
End bp | 146132 |
Gene Length | 1014 bp |
Protein Length | 337 aa |
Translation table | 11 |
GC content | 61% |
IMG OID | 640467738 |
Product | aminotransferase, class I and II |
Protein accession | YP_001152426 |
Protein GI | 145590424 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCGCTAC CGGAGCTGGA CTTCGCCTAC GACGAACCCG ACGTCGGTTA TAAAAAAGTC AGGCTTCACT TTAACGAGAA CTTATTCCTC CCCGACGAGT ACTATAAGGC AGTTGCGACG TCTCTTGAGC CGTGGGAGCT CAGATATTAT ACAGATCCGA ACAATAAGAG GCTGGCCTCT GCTATTGAGT CGCACCACGG CCTCCCCCCG GGGACTGTGG TGATCACGGC GGGCGCTGAC GAGGGGCTGA GGATGGCGAT GCAGCTGGCG GCCCACATGG GGAGGGGCCT CGCCATTGTG GAGCCCACAT ACGGAATGGC GCGGGTCGTG GCGAGGCAGG TAGGCCTGAG ACCGGCGGCG GCCGCATATC GGGAGGACCT CTCCCTAGAC GTGGAGGAGG TTGCGAGGTC GGGGGTCGGC GCAGTCTATG TCTGCTCACC TAACAACCCT ACGGCCCACG TGGTTAAGGA GGTGGAGGAG CTGGCGGCGA GGTTCAACGG CCTTATAATA CTCGACGCGG CCTACGCCGA GTTCGCGGGC TACTGGAGGC CGAGGCTGTA CGAGTACGGC AACGTGGTTG AGGTTAGGAC CTTCTCCAAG GCATGGGGCC TCGCAGGGCT TAGGGTGGGC TACGTCGTGA CCAACAAGCG AGTCGCAGAC GCGCTTAGGG CTCTCTCCCT CCCCCACCCC ATATCGGCCT ACTCGGCAAA GGTGGTGGAG AAGGCACTGG AGGTAGGCAA GCCATATGTG GAGAGGTCAA TAGAGGAGCT GAAAGAGGTG AGGAGCTGGG TCCTCTCGCA GTTGAAAGCA GACGGCTACC ACGGGCCGAC TAACTTCGTA ACGCTGAAGG TGGACGACGC AGAGGCCGCG GCGGCGGAGC TAGACAAAAA GGGATACGTC GTGAGGGTTC TCGGCGGGAA GCCCCTCTGC CGTTCGTGTA TCCGCTTCAC GTTGGCCCCG CGACCCGTCA TGGAGGGCTT CCTCAAAGCG CTCGGCGCCG CGCTTAAAAT TTAA
|
Protein sequence | MSLPELDFAY DEPDVGYKKV RLHFNENLFL PDEYYKAVAT SLEPWELRYY TDPNNKRLAS AIESHHGLPP GTVVITAGAD EGLRMAMQLA AHMGRGLAIV EPTYGMARVV ARQVGLRPAA AAYREDLSLD VEEVARSGVG AVYVCSPNNP TAHVVKEVEE LAARFNGLII LDAAYAEFAG YWRPRLYEYG NVVEVRTFSK AWGLAGLRVG YVVTNKRVAD ALRALSLPHP ISAYSAKVVE KALEVGKPYV ERSIEELKEV RSWVLSQLKA DGYHGPTNFV TLKVDDAEAA AAELDKKGYV VRVLGGKPLC RSCIRFTLAP RPVMEGFLKA LGAALKI
|
| |