Gene PICST_28094 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_28094 
SymbolGAT1 
ID4850876 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp299893 
End bp302355 
Gene Length2463 bp 
Protein Length820 aa 
Translation table 
GC content44% 
IMG OID640392584 
Productactivator of transcription of nitrogen-regulated genes 
Protein accessionXP_001387707 
Protein GI126273822 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACATCA ATTCAAACTA CGACCCACCT GGATCTCCCA TGAACAGCCA TACGCCTTCG 
ACGACGTCGA CGCTGGCAGC CGCACATACT TCCGCTTCGT CTTCGTACCA ACACAAGTTC
CATTTCCACC CAACCAGAAC TACGGCCTCT TCCACGGGAA GCAACCACCA CCACAACAAC
AATAATGCCA AACAGACAGT TTCGATAAAG GCCCTTTTGG CAGATGAAGT AGAGAACATC
GAGGGACTCT GGAGAATGTA CAACAAGGCT AAAGAGTCGC TACCATATAA GGCTCGTATG
GAAAACTTGA CCTGGAGAAT GATGTACATC ACCAATAAAC GACTAGAAGT CAAGAAAGAG
AGTATCCACA TTAAAATGGA ACACGAAGCT GAGATTTATG AAAACTCTAA TAATGTTCCT
GAGAATTCAG TATCGCCGTC GTTGGATCCA GCTGCTGAGG ACTTTGACTA TGTAGCTCAT
ATTCGAAAGA TGGGGCAGAA CAATCTAGAG AATGATGCTG GTGATTCTGA AATTGACATC
AATAATGACG ATTCTAACGA TTCAAACGAA GCTGTCAACT TCAGAAAGAG ACCTGCTGAC
TTCTCTCCCA TGATTACTAG CCACGCTGGA CCTGGTTCTA TCACAGGGAT TCACTCCAAC
TTGTCGATGT CGTTGAACCA GGAGAAGTTG AGGAGTCAGA TTCAAAATCA GAGTCAAAGT
GTACATCCTT TACGAACGAG TATCCAGCCC CATATCTTAC CTTCTCAGCA TCTTCCACTT
AATGACTTGG ACCATTCGGA TCACCGAAGC CAGTCAAACA GTATTCCAGA CCACCACAAC
GACCTTTACG ACCACAACCA TGATGACCAT GACCACCACC TTGAACTGCT TCATGTGGGG
CACCACGGCA TGGGAGAATC TTCAGCGTTT GAGTTTTCTC TCGATCCTTT GGCTTTTGAA
GGTCCGAACA ACAACTACAA CGATGACATT AACATGGACA TTCTTGCTAA TGGAACCAAT
AGTAGATTCG ATGATATGGA AGAATACCAT ACCAGAAACC CCAGTCTTCA TAGTCAAACT
ATTGGACCTA CCAGCATTCT CCATAACTAC AATGATCACC ACCACAGTAA TAGTAACAAC
AGCAATAGTA ATTCTCGCTA TGGACATTCT AACAGTGTTG TTTCGGTCGT AGCAACCCCT
ACAAATCTCT TGAGGCACGA CAATTCCATC ATCAGTTTAC CAGACTTCAG CCATAACTCG
GCTAGTCTTC ATCTGCAACA TCAGCAGCCT CCTTTGAGTC GCTCTATAAC ACAAACTCCT
ACTAATTTTT CACGGTCTTC GAACGGAAAC GACACTTTCC AATTCAACCC TTCTTTTTCT
GGCGTCCAGC AATCACCTGG TCTCGATTTT CCGACTCCTC AGCTCAACAA CCAGCCATTT
ACCGATTCTT ATTTTGACAG CATTGGATCT GGAAGCATTC CTAACAAGAA GGGTGCGTTT
CCCAAACAGT TCAGCTTCAC GGGTCTGGAA ACTGAGACTA CTCCACATTC TCTTCCTTCA
CAGACGACAC TTACCAGCTG GAGATCTTCT GTTGATAAGC CAGACAAGAT TTCCAAGCCT
TCATCGAAGA AGTCCAAGTC AGAAAAGTTC AAGAACTCTT CTGAAAAGTC GAAATCAAAG
AAGACATCGA GCCCCGCAGA AACTCCACGA TCTCTGGGCC AGCTCAAGAG TTCTCAATCG
ACAACTTCGT TGTCTTCGAT GCAGCCGGGA GTCTCTTGTA CTAATTGTCA CACCCAAACG
ACACCATTGT GGAGAAGAAA TCCACAGGGA CTACCCTTGT GTAATGCCTG TGGTCTTTTC
TTGAAGTTGC ATGGAGTCGT TCGTCCCTTA AGTTTGAAAA CAGATGTCAT CAAGAAAAGA
CAGAGAAATA CGAACCCCAA GAAGTCTATC AGTGGTTCTA GTAAAGACAA GGACGGGGAC
GACTTGAACC CTACCTCCAT TTGCAAAAGC GACACGAAGA TCATAAAAAG TCTCGTAGCA
GGAGGAAGTG ACACGTCCGA AACACTCGCA GTTGATGGCG AAGACTTGAA GTTTGAAACT
CCAATCTTGC TCTCTTCCAA GAAGAAACCA ACTAGAAACG CCTCTGTGAC TTCTTCTTTA
ACTATGACAC CAAAAAAGAC TTCTACGAAA GCGAAAAGTA CAAAAGCTTC GCCTAAGAAA
GTCTCTGTCA AGAAAGAAAA AAATGGCTTC GTTTTGAAGA CAGAAGGTGA AGATTACGTA
GATATAGACC ACGACAACGA GTTCATCAAT GTGCTCAACT CAGTAGACCA AAACTTACCA
CAAGCCCGTG GCAATCTGGA AAACCAGAAC GACCAACATG TCATGAACAG CAACGGGCAC
GATCTTGAAA ACGCTGGAGA ACAAAATGGT AACAACTGGG ACTGGTTAAG CATGACCCTA
TAG
 
Protein sequence
MNINSNYDPP GSPMNSHTPS TTSTLAAAHT SASSSYQHKF HFHPTRTTAS STGSNHHHNN 
NNAKQTVSIK ALLADEVENI EGLWRMYNKA KESLPYKARM ENLTWRMMYI TNKRLEVKKE
SIHIKMEHEA EIYENSNNVP ENSVSPSLDP AAEDFDYVAH IRKMGQNNLE NDAGDSEIDI
NNDDSNDSNE AVNFRKRPAD FSPMITSHAG PGSITGIHSN LSMSLNQEKL RSQIQNQSQS
VHPLRTSIQP HILPSQHLPL NDLDHSDHRS QSNSIPDHHN DLYDHNHDDH DHHLELLHVG
HHGMGESSAF EFSLDPLAFE GPNNNYNDDI NMDILANGTN SRFDDMEEYH TRNPSLHSQT
IGPTSILHNY NDHHHSNSNN SNSNSRYGHS NSVVSVVATP TNLLRHDNSI ISLPDFSHNS
ASLHLQHQQP PLSRSITQTP TNFSRSSNGN DTFQFNPSFS GVQQSPGLDF PTPQLNNQPF
TDSYFDSIGS GSIPNKKGAF PKQFSFTGLE TETTPHSLPS QTTLTSWRSS VDKPDKISKP
SSKKSKSEKF KNSSEKSKSK KTSSPAETPR SLGQLKSSQS TTSLSSMQPG VSCTNCHTQT
TPLWRRNPQG LPLCNACGLF LKLHGVVRPL SLKTDVIKKR QRNTNPKKSI SGSSKDKDGD
DLNPTSICKS DTKIIKSLVA GGSDTSETLA VDGEDLKFET PILLSSKKKP TRNASVTSSL
TMTPKKTSTK AKSTKASPKK VSVKKEKNGF VLKTEGEDYV DIDHDNEFIN VLNSVDQNLP
QARGNLENQN DQHVMNSNGH DLENAGEQNG NNWDWLSMTL