Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_2254 |
Symbol | |
ID | 5054286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 2019890 |
End bp | 2021161 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 60% |
IMG OID | 640469807 |
Product | glutamate-1-semialdehyde aminotransferase |
Protein accession | YP_001154452 |
Protein GI | 145592450 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0001] Glutamate-1-semialdehyde aminotransferase |
TIGRFAM ID | [TIGR00713] glutamate-1-semialdehyde-2,1-aminomutase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.100829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 0.0981223 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTTG AAAGGGCTAG GCAAGTCTTC CCCGGCGGGG TTAATTCCCC TGCCAGGGCT CTCAAACACC TCCCGTCGTC GCTCGTCGCA AGGGCCGCCT CTGGGCCCTA CCTATACACC GACCGCGGGA GGCTTGTGGA CTACTGCATG GCGTTTGGCG CCATAATCCT CGGCCACGCC CACCCCCGGG TGAAAAGGGC CGTGGAGGAG CAGCTGGAGA GGGGCTGGAT ATACGCCCTG CTCACCGAGC AGGAGGTGGA ATTCGCCGAG GCCATAAGGC GGCACATGCC CTCTGTGGAG AAGATGCGGA TAGTGAATAC TGGAACCGAG GCCACGATGA ACGCCATAAG GCTCGCCCGG GGCTACACGA AGCGCGACGT GATAATTAAA TTCGACGGAA ACTTTCACGG CTCCCACGAC TATGTTTTGG TCAAGGCCGG CTCCGGGGCG GCGACTTGGG GCATACCCAC AAGCGCCGGC GTGCCGCAAG ACGTAGTCAA GCTGACGGTA GTGGCGCCTT ACAACGACGT AGACGCATTC CTCAAGGCAG TAAAGGAAGT GGGGGACAGA CTAGCGGCGG TGATTGCGGA GCCGGTGGCG GGGAACTACG GCCTCATAAT ACCCGACGCG GAGTTTCTTA AGGCGCTGAG GGAGGAGACC AAACGCGTAG GGGCCCTCCT GATATTTGAC GAAGTAATTA CGGGCTTTAG GCTGGGCCTC GGCGGTGCCC AAGGCCGCTT CGGCATAAGG CCAGACCTCA CCACCCTGGG CAAGGCCGTG GGCGGAGGCT TCCCCATCGG TATATTCGGT GGAAGGGCAG AGGTGATGGA CTTGGTCGCG CCCAGCGGCC CCGTGTACAA CGCAGGCACG TACAACGCCC ATCCTGTCTC GGTGACTGCC GGCCTCGCCG TGTTGAAAGA GCTGGAAACC GGTGAGCCCT TCCGCACAGC AGACGAGGCG GCGGAGAGGC TTGCCAAGGG CATAGAGGAC ATCGCCGGGA GGCTCGGCTT TGACGTGGTC GTGAAGAAGA TAGCCTCCAT GTTCCAGTTC TACTTCAAGA AAGGCGACGT GAAGACCCCC CAAGACGTCA GGGAGAGCAA CGAGAAAATG TACCTAAAAC TCCACGAGAT CGCGCTTAGA CACGGCGTCT ACCTAACCCC CTCCCAGTTC GAGGTGAACT TCACATCGGC AGCTCACACC AGAGAGGTGG TCGAGGAGAC CCTCGCCGCG CTGGAGAAGG CCTTTCAACA ATTAAAGACG GAAATCGGGT AG
|
Protein sequence | MLFERARQVF PGGVNSPARA LKHLPSSLVA RAASGPYLYT DRGRLVDYCM AFGAIILGHA HPRVKRAVEE QLERGWIYAL LTEQEVEFAE AIRRHMPSVE KMRIVNTGTE ATMNAIRLAR GYTKRDVIIK FDGNFHGSHD YVLVKAGSGA ATWGIPTSAG VPQDVVKLTV VAPYNDVDAF LKAVKEVGDR LAAVIAEPVA GNYGLIIPDA EFLKALREET KRVGALLIFD EVITGFRLGL GGAQGRFGIR PDLTTLGKAV GGGFPIGIFG GRAEVMDLVA PSGPVYNAGT YNAHPVSVTA GLAVLKELET GEPFRTADEA AERLAKGIED IAGRLGFDVV VKKIASMFQF YFKKGDVKTP QDVRESNEKM YLKLHEIALR HGVYLTPSQF EVNFTSAAHT REVVEETLAA LEKAFQQLKT EIG
|
| |