Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Pars_1772 |
Symbol | guaA |
ID | 5055380 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Pyrobaculum arsenaticum DSM 13514 |
Kingdom | Archaea |
Replicon accession | NC_009376 |
Strand | + |
Start bp | 1592247 |
End bp | 1593764 |
Gene Length | 1518 bp |
Protein Length | 505 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640469317 |
Product | GMP synthase |
Protein accession | YP_001153975 |
Protein GI | 145591973 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0518] GMP synthase - Glutamine amidotransferase domain [COG0519] GMP synthase, PP-ATPase domain/subunit |
TIGRFAM ID | [TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit [TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.564322 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.00463456 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGGAGAAGA TCCTTGTCGT AAATCTAGGC GGACAATATG CTCACCTAAT AGCGAGAAGA ATTAGAGAAA TCGGGGCATA CGCGGAGATT GCTTCATACG ACTCCGTTCT CGAAGTGGCG AAAAGCGATG AGGTAAAAGC CATAGTTCTC TCGGGAGGCC CGGCTTCGGT GTACGAACCA AACTCGCCAG ATCTGCCGGT GGACCTCCTT TACCTCGGAA AACCTGTGCT TGGCATATGC TTCGGACACC AGTGGATAGC CAAAAAACTA GGAGGAGTCG TAGAACGCGG CAAGGGCGAG TACGGAAAGA CTATGGTTAG AATCTTGTCT AGCGACCCCC TCTTTGAGGG GTGGGAAGCA GAGGAGGTGG TTTGGATGAG CCACAGCGAC TACGTAAAGG AGGTCCCAAG CGGCTTCCAA GTCTTGGCAG TAAGCGAAAA CGGCTACGTT GCGGCTATGA GGAGCGGCCA CATCTACGGA GTTCAGTTCC ACCCTGAGGT TAGACACACG GTAAAGGGGA TTCGACTTTT AGAAAACTTT GTGAGAAAGG TTGCCGGCAT CAAATCGGTT TGGGTGCCTG AAGAGCAAGT TGGTAAAATA GTGGAAGAGA TAAAAGCAAT GGTCAAGGAA GGCATTGTAG TCGTAGGCGT TAGCGGTGGC GTGGACAGCA CCGTCACCGC GGTTCTCCTC CACATGGCGT TAGGGAGCCG CGTCAAGGCA GTCTTCATCG ACCACGGCTT ATTCAGAGAG GGGGAGCCCG AGAAGGCTGT ACAGCTCCTT AGATCAGTAG GAATAGACGT GTTGTATATC GACGCGAGAG AGCGTTTCCT TAAAAAGCTT GATGGGGTGT CCGACTGCGA AGAGAAGAGA AGAATTGTCG GAGAGACCTT CGCCGAGGTG TTCACAGAAG TGGTGGCTGG GATACCTAAT GCCAAATACC TCGCACAAGG TACGCTGTAT CCAGACGTCA TAGAAAGCGG AGCAGTAAAG GGCGCAGATA GAATAAAAAG CCACCACAAC GTCGGAGGGA TCCCACCCTG GTTTACCCTA GAACTAATCG AACCGCTTAG AGACTTTTAC AAAGACGAGG TAAGAAGAAT AGCAAAGTCC CTTGGTTTGC CAGATGAGGT AGTTTACCGA CATCCCTTTC CCGGTCCAGG TCTTGCAGTA AGGATAATAG GGCCCTTTAC CCTTGAGAAG CTTGAAATAG TGAGGAGAGC TACAAAGATT GTAGAAGAAG AGCTGGAAAG GGCGGGTCTT CTGAGAAAGG TATGGCAAGC CTTCGCCACA GTGGGTGAGG ATAAATGGGT GGGAGTAAAG GGAGATAGAC GCGCCGAGGG CTACATAGTT ACAGTTAGGG TTGTGGAAAG CGAAGATGCT ATGACAGCCG ACTGGGCCAA GATCCCACAC GACGTCTTGG ACAAGATCTC CTCGCGTATT GCATCAGAAA TTCCACACGT AACAATGGTT ACATATGCAA TTACCTCCAA ACCCCCCTCA ACAATTGAGC CGTGTTAA
|
Protein sequence | MEKILVVNLG GQYAHLIARR IREIGAYAEI ASYDSVLEVA KSDEVKAIVL SGGPASVYEP NSPDLPVDLL YLGKPVLGIC FGHQWIAKKL GGVVERGKGE YGKTMVRILS SDPLFEGWEA EEVVWMSHSD YVKEVPSGFQ VLAVSENGYV AAMRSGHIYG VQFHPEVRHT VKGIRLLENF VRKVAGIKSV WVPEEQVGKI VEEIKAMVKE GIVVVGVSGG VDSTVTAVLL HMALGSRVKA VFIDHGLFRE GEPEKAVQLL RSVGIDVLYI DARERFLKKL DGVSDCEEKR RIVGETFAEV FTEVVAGIPN AKYLAQGTLY PDVIESGAVK GADRIKSHHN VGGIPPWFTL ELIEPLRDFY KDEVRRIAKS LGLPDEVVYR HPFPGPGLAV RIIGPFTLEK LEIVRRATKI VEEELERAGL LRKVWQAFAT VGEDKWVGVK GDRRAEGYIV TVRVVESEDA MTADWAKIPH DVLDKISSRI ASEIPHVTMV TYAITSKPPS TIEPC
|
| |