Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_28094 |
Symbol | GAT1 |
ID | 4850876 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | - |
Start bp | 299893 |
End bp | 302355 |
Gene Length | 2463 bp |
Protein Length | 820 aa |
Translation table | |
GC content | 44% |
IMG OID | 640392584 |
Product | activator of transcription of nitrogen-regulated genes |
Protein accession | XP_001387707 |
Protein GI | 126273822 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACATCA ATTCAAACTA CGACCCACCT GGATCTCCCA TGAACAGCCA TACGCCTTCG ACGACGTCGA CGCTGGCAGC CGCACATACT TCCGCTTCGT CTTCGTACCA ACACAAGTTC CATTTCCACC CAACCAGAAC TACGGCCTCT TCCACGGGAA GCAACCACCA CCACAACAAC AATAATGCCA AACAGACAGT TTCGATAAAG GCCCTTTTGG CAGATGAAGT AGAGAACATC GAGGGACTCT GGAGAATGTA CAACAAGGCT AAAGAGTCGC TACCATATAA GGCTCGTATG GAAAACTTGA CCTGGAGAAT GATGTACATC ACCAATAAAC GACTAGAAGT CAAGAAAGAG AGTATCCACA TTAAAATGGA ACACGAAGCT GAGATTTATG AAAACTCTAA TAATGTTCCT GAGAATTCAG TATCGCCGTC GTTGGATCCA GCTGCTGAGG ACTTTGACTA TGTAGCTCAT ATTCGAAAGA TGGGGCAGAA CAATCTAGAG AATGATGCTG GTGATTCTGA AATTGACATC AATAATGACG ATTCTAACGA TTCAAACGAA GCTGTCAACT TCAGAAAGAG ACCTGCTGAC TTCTCTCCCA TGATTACTAG CCACGCTGGA CCTGGTTCTA TCACAGGGAT TCACTCCAAC TTGTCGATGT CGTTGAACCA GGAGAAGTTG AGGAGTCAGA TTCAAAATCA GAGTCAAAGT GTACATCCTT TACGAACGAG TATCCAGCCC CATATCTTAC CTTCTCAGCA TCTTCCACTT AATGACTTGG ACCATTCGGA TCACCGAAGC CAGTCAAACA GTATTCCAGA CCACCACAAC GACCTTTACG ACCACAACCA TGATGACCAT GACCACCACC TTGAACTGCT TCATGTGGGG CACCACGGCA TGGGAGAATC TTCAGCGTTT GAGTTTTCTC TCGATCCTTT GGCTTTTGAA GGTCCGAACA ACAACTACAA CGATGACATT AACATGGACA TTCTTGCTAA TGGAACCAAT AGTAGATTCG ATGATATGGA AGAATACCAT ACCAGAAACC CCAGTCTTCA TAGTCAAACT ATTGGACCTA CCAGCATTCT CCATAACTAC AATGATCACC ACCACAGTAA TAGTAACAAC AGCAATAGTA ATTCTCGCTA TGGACATTCT AACAGTGTTG TTTCGGTCGT AGCAACCCCT ACAAATCTCT TGAGGCACGA CAATTCCATC ATCAGTTTAC CAGACTTCAG CCATAACTCG GCTAGTCTTC ATCTGCAACA TCAGCAGCCT CCTTTGAGTC GCTCTATAAC ACAAACTCCT ACTAATTTTT CACGGTCTTC GAACGGAAAC GACACTTTCC AATTCAACCC TTCTTTTTCT GGCGTCCAGC AATCACCTGG TCTCGATTTT CCGACTCCTC AGCTCAACAA CCAGCCATTT ACCGATTCTT ATTTTGACAG CATTGGATCT GGAAGCATTC CTAACAAGAA GGGTGCGTTT CCCAAACAGT TCAGCTTCAC GGGTCTGGAA ACTGAGACTA CTCCACATTC TCTTCCTTCA CAGACGACAC TTACCAGCTG GAGATCTTCT GTTGATAAGC CAGACAAGAT TTCCAAGCCT TCATCGAAGA AGTCCAAGTC AGAAAAGTTC AAGAACTCTT CTGAAAAGTC GAAATCAAAG AAGACATCGA GCCCCGCAGA AACTCCACGA TCTCTGGGCC AGCTCAAGAG TTCTCAATCG ACAACTTCGT TGTCTTCGAT GCAGCCGGGA GTCTCTTGTA CTAATTGTCA CACCCAAACG ACACCATTGT GGAGAAGAAA TCCACAGGGA CTACCCTTGT GTAATGCCTG TGGTCTTTTC TTGAAGTTGC ATGGAGTCGT TCGTCCCTTA AGTTTGAAAA CAGATGTCAT CAAGAAAAGA CAGAGAAATA CGAACCCCAA GAAGTCTATC AGTGGTTCTA GTAAAGACAA GGACGGGGAC GACTTGAACC CTACCTCCAT TTGCAAAAGC GACACGAAGA TCATAAAAAG TCTCGTAGCA GGAGGAAGTG ACACGTCCGA AACACTCGCA GTTGATGGCG AAGACTTGAA GTTTGAAACT CCAATCTTGC TCTCTTCCAA GAAGAAACCA ACTAGAAACG CCTCTGTGAC TTCTTCTTTA ACTATGACAC CAAAAAAGAC TTCTACGAAA GCGAAAAGTA CAAAAGCTTC GCCTAAGAAA GTCTCTGTCA AGAAAGAAAA AAATGGCTTC GTTTTGAAGA CAGAAGGTGA AGATTACGTA GATATAGACC ACGACAACGA GTTCATCAAT GTGCTCAACT CAGTAGACCA AAACTTACCA CAAGCCCGTG GCAATCTGGA AAACCAGAAC GACCAACATG TCATGAACAG CAACGGGCAC GATCTTGAAA ACGCTGGAGA ACAAAATGGT AACAACTGGG ACTGGTTAAG CATGACCCTA TAG
|
Protein sequence | MNINSNYDPP GSPMNSHTPS TTSTLAAAHT SASSSYQHKF HFHPTRTTAS STGSNHHHNN NNAKQTVSIK ALLADEVENI EGLWRMYNKA KESLPYKARM ENLTWRMMYI TNKRLEVKKE SIHIKMEHEA EIYENSNNVP ENSVSPSLDP AAEDFDYVAH IRKMGQNNLE NDAGDSEIDI NNDDSNDSNE AVNFRKRPAD FSPMITSHAG PGSITGIHSN LSMSLNQEKL RSQIQNQSQS VHPLRTSIQP HILPSQHLPL NDLDHSDHRS QSNSIPDHHN DLYDHNHDDH DHHLELLHVG HHGMGESSAF EFSLDPLAFE GPNNNYNDDI NMDILANGTN SRFDDMEEYH TRNPSLHSQT IGPTSILHNY NDHHHSNSNN SNSNSRYGHS NSVVSVVATP TNLLRHDNSI ISLPDFSHNS ASLHLQHQQP PLSRSITQTP TNFSRSSNGN DTFQFNPSFS GVQQSPGLDF PTPQLNNQPF TDSYFDSIGS GSIPNKKGAF PKQFSFTGLE TETTPHSLPS QTTLTSWRSS VDKPDKISKP SSKKSKSEKF KNSSEKSKSK KTSSPAETPR SLGQLKSSQS TTSLSSMQPG VSCTNCHTQT TPLWRRNPQG LPLCNACGLF LKLHGVVRPL SLKTDVIKKR QRNTNPKKSI SGSSKDKDGD DLNPTSICKS DTKIIKSLVA GGSDTSETLA VDGEDLKFET PILLSSKKKP TRNASVTSSL TMTPKKTSTK AKSTKASPKK VSVKKEKNGF VLKTEGEDYV DIDHDNEFIN VLNSVDQNLP QARGNLENQN DQHVMNSNGH DLENAGEQNG NNWDWLSMTL
|
| |