Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_0120 |
Symbol | |
ID | 5055245 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | - |
Start bp | 107883 |
End bp | 108899 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 62% |
IMG OID | 640467699 |
Product | aminotransferase, class I and II |
Protein accession | YP_001152387 |
Protein GI | 145590385 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase |
TIGRFAM ID | [TIGR01141] histidinol-phosphate aminotransferase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTCCCTC CTCCAGCCGG TGTCCTGAAG CTGGATCAAA ACGAGGTGCC TATTCCCCCT CCCAGCCACG TTGTGGAGGC GGCGGCTCAG GCCTTGGCCG CGGCCAACTG GTACCAGCCC GCCGCCCTCT ACGACGAGGT TAGGCAACTC TACGCCGAAT ACGCCGGCGT AGATCCCCAA CGGGTGTGGC TCTTCCCCGG CGCCGACGAC TTTTTCGAGC CGCTTCTGAG AAGCGCAAAG ACGGTGGCCG CGCCAAGCCC CACGTACTTC CTCTTCGAGG ACCAGGCCCA GTTCCACGGC GCCGCGCTGA TCAAGACGCC GCTGAGGGGC GAGGACTTCC GCCTAGACCT CGCCGAGTTT CTAAACGCCG CCAAGAAGGC AGACGCCGTC TATATAGACA ACCCCAACAA CCCCACCGGC CAGTTGTTGG TAAAGCCAAG CGAGGTAGAG GAGGTCCTCG CCATCGGAAA GCCAACGGTG GTGGACGAGG CGTACTTCGA GTTCTCCGGC GTCACCGCCG CCGCACTTGT GGAGGCGTGG CCCAACCTCG CCGTTGTGAG GACTCTGTCG AAGGCCTTCC TCCTGGCCGG CTTCAGGGTA ACGCCAGTAA TAGCCGGGCG GGGGTGGAGG ATCCACTACA ACACAGTGAG GTTCAGGGTT TCCCTCCCCT CCCTCGCCGC GGCTAGGGCG GCGCTTTACA AGAGGGACTA CGTCGAGGAG GTGGTAAGGC AGATAAGGGA AGGCGCCCGG TACTTGGCAG AGGAGTTGCC AAAACTAGGA GTGAGGGTCT GGCCTACCCA CGCAAACTTC GTACTAGTCC GGGGCCCGCC GGGCTTCGCC CAAACGCTGA GAAAATACGG CGTGTGGGTC AGAGACGAGG AGCAAAGGCT TGGACCTGGC TACGCCAGAA TTACAGTGGG GACAAGAGAG ATAAATGCCC AGCTCGTGAG GACGATAAAA ACGGCGTTTG CTACTCGGCA ATCTCAGCCA TCATCATCCT CAGCCTCCTC CTCCTGA
|
Protein sequence | MLPPPAGVLK LDQNEVPIPP PSHVVEAAAQ ALAAANWYQP AALYDEVRQL YAEYAGVDPQ RVWLFPGADD FFEPLLRSAK TVAAPSPTYF LFEDQAQFHG AALIKTPLRG EDFRLDLAEF LNAAKKADAV YIDNPNNPTG QLLVKPSEVE EVLAIGKPTV VDEAYFEFSG VTAAALVEAW PNLAVVRTLS KAFLLAGFRV TPVIAGRGWR IHYNTVRFRV SLPSLAAARA ALYKRDYVEE VVRQIREGAR YLAEELPKLG VRVWPTHANF VLVRGPPGFA QTLRKYGVWV RDEEQRLGPG YARITVGTRE INAQLVRTIK TAFATRQSQP SSSSASSS
|
| |