Gene PICST_70121 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_70121 
SymbolGUA1 
ID4837220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp2276282 
End bp2277974 
Gene Length1693 bp 
Protein Length528 aa 
Translation table12 
GC content46% 
IMG OID640388535 
ProductGMP synthase 
Protein accessionXP_001383192 
Protein GI126133334 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0518] GMP synthase - Glutamine amidotransferase domain
[COG0519] GMP synthase, PP-ATPase domain/subunit 
TIGRFAM ID[TIGR00884] GMP synthase (glutamine-hydrolyzing), C-terminal domain or B subunit
[TIGR00888] GMP synthase (glutamine-hydrolyzing), N-terminal domain or A subunit 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0970266 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.0696406 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
CCCCCTTTAA ACTCTCCTCT AGTACTACCG TAAAACTAGT CTACAATGTC CGCTGCTGAT 
GTTCCTGTTG AAATCACCAA GGTGTTCGAC ACTATCTTGG TGTTAGACTT TGGTTCGCAA
TACTCCCATT TGATTACTCG TCGTTTGAGA GAATTCAATG TCTACGCTGA AATGTTGCCT
TGTACCCAGA AGATCTCTGA ATTGTCATGG AAGCCTAAGG GTGTGATTTT GTCTGGTGGT
CCTTACTCGG TTTACGCCGA GGACGCCCCT CACGTTGACC ACGATGTCTT CAAGTTGGAT
GTGCCTATCT TGGGTATCTG TTACGGTATG CAAGAAATTG CCTGGATCAA TGGTAAGGGT
GTTGCCAGAG GCGACAAGAG AGAATACGGT CCAGCCACTT TGAACGTCGA AGACAGCAGT
AACGCTCTTT TCCATGGTGT AGACCATTCT CAGGTCTGGA TGTCCCACGG TGACAAGTTG
CACGCCTTGC CTACCGGCTA CAAGATCGTT GCTACCTCCG ACAACTCACC ATATGCCGCT
ATCTACAACG AAACCGACAA TATCTACGGT ATCCAGTTCC ATCCAGAAGT CACCCACACC
ATCCAGGGTA AGACTATCTT GAAGAACTTC GCTGTTGACA TCTGTAAGGC TAACACCAAC
TGGTCTATGG AAAACTTCAT CGACACTGAG ATTGCCAGAA TCAGAAAGTT GGTTGGTCCT
ACTGCCGAAG TCATCGGTGC TGTTTCCGGA GGTGTGGACT CCACTGTCGG TGCAAAGATC
TTGAACGAAG CTATTGGCGA CCGTTTCCAT GCCATCTACG TCGACAACGG TGTGATGAGA
AAGAACGAAA CCGAAACCGT CTACAAGACC TTGACTGAAG GCTTGGGAAT CAACTTGACT
GTAGTTGATG CTTCTGAATT GTTCTTAGGT AGATTGAAGG GTGTCACCGA TCCTGAAAAG
AAGAGAAAGA TCATTGGTAA CACCTTCATC CACGTTTTTG AAGAAGAAGC TGCCAAGATC
ACGCCAAAGT CCGGTCAGGA AATTGAGTTC TTGTTGCAAG GTACTTTGTA CCCAGATGTT
ATCGAATCTA TCTCGTTCAA GGGTCCTTCT CAAACCATCA AGACTCACCA CAACGTCGGT
GGTTTGTTGG AAGACATGAA GTTGAAGTTG ATTGAACCTT TGAGAGAATT GTTCAAGGAC
GAAGTACGTC ACTTAGGTGA GTTGTTGGGT GTTCCAACCG ACTTGGTTTG GAGACATCCT
TTCCCAGGTC CAGGTTTGGC TATCAGAGTC TTGGGTGAAG TTACAAAGGA ACAGGTTGTC
ATTGCTCGTG AAGCCGATGC CATCTTCATT GAAGAAATCA AGAAGGCTGG TTTGTATAGA
GAAATCTCGC AAGCATTTGC TGCCTTGTTG CCTGTCAAGT CTGTCGGTGT CATGGGAGAC
CAAAGAACCT ATGACCAGGT CATTGCTCTC AGAGCCATCG AAACTGTTGA TTTCATGACT
GCCGACTGGT ACGTCTTTGA AGCTTCCTTC TTGAAGAGAG TCGCTTCAAG AATCGTCAAC
GAAGTTGATG GAGTTGCTCG TGTCACCTAC GACATCACCT CTAAGCCTCC AGCTACTGTT
GAATGGGAAT AGAGAATTTA GAACTAATAG TCTGTACAAT AAAAGAAGTA TATTAAATCA
TAGCATATAA TTT
 
Protein sequence
MSAADVPVEI TKVFDTILVL DFGSQYSHLI TRRLREFNVY AEMLPCTQKI SELSWKPKGV 
ILSGGPYSVY AEDAPHVDHD VFKLDVPILG ICYGMQEIAW INGKGVARGD KREYGPATLN
VEDSSNALFH GVDHSQVWMS HGDKLHALPT GYKIVATSDN SPYAAIYNET DNIYGIQFHP
EVTHTIQGKT ILKNFAVDIC KANTNWSMEN FIDTEIARIR KLVGPTAEVI GAVSGGVDST
VGAKILNEAI GDRFHAIYVD NGVMRKNETE TVYKTLTEGL GINLTVVDAS ELFLGRLKGV
TDPEKKRKII GNTFIHVFEE EAAKITPKSG QEIEFLLQGT LYPDVIESIS FKGPSQTIKT
HHNVGGLLED MKLKLIEPLR ELFKDEVRHL GELLGVPTDL VWRHPFPGPG LAIRVLGEVT
KEQVVIAREA DAIFIEEIKK AGLYREISQA FAALLPVKSV GVMGDQRTYD QVIALRAIET
VDFMTADWYV FEASFLKRVA SRIVNEVDGV ARVTYDITSK PPATVEWE