Gene Pars_1772 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1772 
SymbolguaA 
ID5055380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1592247 
End bp1593764 
Gene Length1518 bp 
Protein Length505 aa 
Translation table11 
GC content51% 
IMG OID640469317 
ProductGMP synthase 
Protein accessionYP_001153975 
Protein GI145591973 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.564322 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.00463456 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGGAGAAGA TCCTTGTCGT AAATCTAGGC GGACAATATG CTCACCTAAT AGCGAGAAGA 
ATTAGAGAAA TCGGGGCATA CGCGGAGATT GCTTCATACG ACTCCGTTCT CGAAGTGGCG
AAAAGCGATG AGGTAAAAGC CATAGTTCTC TCGGGAGGCC CGGCTTCGGT GTACGAACCA
AACTCGCCAG ATCTGCCGGT GGACCTCCTT TACCTCGGAA AACCTGTGCT TGGCATATGC
TTCGGACACC AGTGGATAGC CAAAAAACTA GGAGGAGTCG TAGAACGCGG CAAGGGCGAG
TACGGAAAGA CTATGGTTAG AATCTTGTCT AGCGACCCCC TCTTTGAGGG GTGGGAAGCA
GAGGAGGTGG TTTGGATGAG CCACAGCGAC TACGTAAAGG AGGTCCCAAG CGGCTTCCAA
GTCTTGGCAG TAAGCGAAAA CGGCTACGTT GCGGCTATGA GGAGCGGCCA CATCTACGGA
GTTCAGTTCC ACCCTGAGGT TAGACACACG GTAAAGGGGA TTCGACTTTT AGAAAACTTT
GTGAGAAAGG TTGCCGGCAT CAAATCGGTT TGGGTGCCTG AAGAGCAAGT TGGTAAAATA
GTGGAAGAGA TAAAAGCAAT GGTCAAGGAA GGCATTGTAG TCGTAGGCGT TAGCGGTGGC
GTGGACAGCA CCGTCACCGC GGTTCTCCTC CACATGGCGT TAGGGAGCCG CGTCAAGGCA
GTCTTCATCG ACCACGGCTT ATTCAGAGAG GGGGAGCCCG AGAAGGCTGT ACAGCTCCTT
AGATCAGTAG GAATAGACGT GTTGTATATC GACGCGAGAG AGCGTTTCCT TAAAAAGCTT
GATGGGGTGT CCGACTGCGA AGAGAAGAGA AGAATTGTCG GAGAGACCTT CGCCGAGGTG
TTCACAGAAG TGGTGGCTGG GATACCTAAT GCCAAATACC TCGCACAAGG TACGCTGTAT
CCAGACGTCA TAGAAAGCGG AGCAGTAAAG GGCGCAGATA GAATAAAAAG CCACCACAAC
GTCGGAGGGA TCCCACCCTG GTTTACCCTA GAACTAATCG AACCGCTTAG AGACTTTTAC
AAAGACGAGG TAAGAAGAAT AGCAAAGTCC CTTGGTTTGC CAGATGAGGT AGTTTACCGA
CATCCCTTTC CCGGTCCAGG TCTTGCAGTA AGGATAATAG GGCCCTTTAC CCTTGAGAAG
CTTGAAATAG TGAGGAGAGC TACAAAGATT GTAGAAGAAG AGCTGGAAAG GGCGGGTCTT
CTGAGAAAGG TATGGCAAGC CTTCGCCACA GTGGGTGAGG ATAAATGGGT GGGAGTAAAG
GGAGATAGAC GCGCCGAGGG CTACATAGTT ACAGTTAGGG TTGTGGAAAG CGAAGATGCT
ATGACAGCCG ACTGGGCCAA GATCCCACAC GACGTCTTGG ACAAGATCTC CTCGCGTATT
GCATCAGAAA TTCCACACGT AACAATGGTT ACATATGCAA TTACCTCCAA ACCCCCCTCA
ACAATTGAGC CGTGTTAA
 
Protein sequence
MEKILVVNLG GQYAHLIARR IREIGAYAEI ASYDSVLEVA KSDEVKAIVL SGGPASVYEP 
NSPDLPVDLL YLGKPVLGIC FGHQWIAKKL GGVVERGKGE YGKTMVRILS SDPLFEGWEA
EEVVWMSHSD YVKEVPSGFQ VLAVSENGYV AAMRSGHIYG VQFHPEVRHT VKGIRLLENF
VRKVAGIKSV WVPEEQVGKI VEEIKAMVKE GIVVVGVSGG VDSTVTAVLL HMALGSRVKA
VFIDHGLFRE GEPEKAVQLL RSVGIDVLYI DARERFLKKL DGVSDCEEKR RIVGETFAEV
FTEVVAGIPN AKYLAQGTLY PDVIESGAVK GADRIKSHHN VGGIPPWFTL ELIEPLRDFY
KDEVRRIAKS LGLPDEVVYR HPFPGPGLAV RIIGPFTLEK LEIVRRATKI VEEELERAGL
LRKVWQAFAT VGEDKWVGVK GDRRAEGYIV TVRVVESEDA MTADWAKIPH DVLDKISSRI
ASEIPHVTMV TYAITSKPPS TIEPC