Gene PICST_89002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_89002 
Symbol 
ID4838696 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp340514 
End bp343543 
Gene Length3030 bp 
Protein Length970 aa 
Translation table12 
GC content40% 
IMG OID640390011 
Productpredicted protein 
Protein accessionXP_001384020 
Protein GI150864982 
COG category 
COG ID 
TIGRFAM ID[TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.887075 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTGTCT CTATCACAGA GCAGAAACAG ATTCTCCAGA GCTGTCTCTC GGCGATTAAG 
CACCAGTCTA ATCTAATGAA ACAATGCCTC AACGAGAACA ACATTCTTCA GGCTCTCAAG
CATTGCTCCA ACTTCCTCAA CGAGCTACGG ACCAATCAGT TAACACCAAA ACAATACTAC
GAGTTGTATA TCGCTGTTTT TGACTCTCTT GAAACCCTCC TGAACCATTT ACTCAACTCG
CACAATCTCA AACAGCACAA GCTTGAGAAG AGACAAGCTG CTTTGGATAG CACCTCAACT
TCAGACAAAA ATGCCGATGA TAAAAGCACT ACTCATAAAA ATGTTAAAAA TGGTGATGAA
ATTTCAAAAA ATGCTGTTGG AAAAAGTGCT ACGACTCCAT TCCTTGCAGA TTTGTACGAG
CTTGTTCAAT ACTCCGGCAA CATCGTACCC CGTCTCTATA TGATGATCGT CATTGGCACA
ACATATATGT CGACGAAAGG AGCTCCTGGC AAAGAGATCA TGAAAGATAT GATCGAGATG
TGTCGTGGAG TACAGCATCC TATTCGAGGC TTGTTTTTGC GTTACTACTT GTCGCAGAGA
ACGAAGCATT TGCTTCCGTT TCTGAACGCC AATGACTTCA ACGACACTGT AGAGTTTCTC
ATTTCCAACT TCATCGAGAT GAACAAGTTG TGGGTGCGTT TGCAACACCA GGGTCATTCG
TCGGAACGTG AGTTGAGATA CAGAGAGAGA AAGGAATTGA AGATCTTGGT AGGTTCCAAC
TTGGTCAGAT TATCGCAAGT CATTGACGAT TACAACGGCG ACGAAACCTA CTCCAGCATC
AAGTATTACC AGGATAAAGT ATTTCCTACC ATCACAGAAC AGATTATTCA GTGTCGTGAT
CATCTTGCCC AGAGCTATTT GATTGACGTT TTGATCCAGA TCTTTCCTGA TGACTTTCAC
TTTGCCACGT TGGACAGCTT GCTTAGTGAT GTTTTCCTCA ATTTGCATCC GTTGTTGAAG
AAGAGCGAGT TGGTAGCCAC GTTGATCGAG AGATTCATCA CCTATCACAA ATTTGAGTCC
GATATGTCTA CAAGTGAGAT CAAGGAGCTT TCTTTGGAAA GCGATGAGAA ACAGAAAAAG
ATTAAAACGA CTATTGATTC CACGCAATTG TTCAATTCTT TCTGGAAATT CTACTTGAAA
TTGTATGAAT TAGATCCACA ATTACCTTCA GAAGAGCATT CTGAGTTGCT ACAATCGTTT
ATTAGATTGT CGTTAACGTA CGATCCTAAC AACTACCAAA ACTTGGATGT AGTCTACAAA
TTTGCTACTG AGAAAGAGGG CCAAATCAAG GCTAATGCCG AGAATGATGA TATTTGGTTG
CAATTGTTGA TTGTTCCTAT TCGTCACTTT GATTCCATCA AGACCTTGTT CAAGTTGCCC
TTCTTCCACG AGTTCTATTT GAAGTTGTCC AACAAACAGC ACCAGAAACA GATATCGTTA
GAAATCATCA ACAAATTGCT AGGAATAACT ACCTATGGAG ATGAAGATGG TAATACCGTC
CAAGAGATTC ACGAGCCGGA GACTTTCACC ACCACTGAAG AGGTAGACGG GATTTTCAAG
TACTTATTGG TCTTGATCAA AGATTCTGAC AAGCAAAACA GTACCTCCAA GAACTTGGGG
GTCACAAAGA GCATAACCAT CAACAAAGGA GAAAATGTGA TTTCGCATGA GTTCTTGAGC
AACCAGGAGA AGATCTGTAA AGTTATTCAT TTAATTGAAA ACCCCAGTGA TCCTTTTAAG
AACTTGTCCA ATTTAATGTA CGCCAGAAAG AAGTACTTGA ACAAGAACTT TGATAACATC
ATATACACCT ATCCAACTTT GATCTCACGG ATCTTGTACA AATTGAAGAT TGTTGGTTAT
GCTAATTTGA GACAGCAGAA GAAGAAGAAG AACACTGAGG CCAGCCAAGA CTTGATGATC
ACTTCCAACT TTAAAAATTT ATCTATAATA ATTGATGAGT TGTACCAATA CCATGCTGAA
TTCAACTCAG AATTGATTCT CAAGATTTAT CTTAATGCTG CTTCAGTTGC TGACCAGTTG
AAACAAGAAT CAATTTGCTA CGAATTGTAC ACGCAGTGTT TCATTGTATA TGAAGAAAAC
TTGATTCTTG GATCCAGTCT GTACCAACAA CATATCAATC CTCACGACTC GCTTGCTGGA
GGTTCGTTGC AATATCAATC CATTATACAT GTAGCCAACA AACTAGTTTC TGCTCGTTAT
TTCAACAAGG AGAACTACGA GAACTTGATA ACAAAGTTGA CGTTGTATGG ATCGAAATTA
TTGAAAAAAC AGGACCAATG CAGAGCTGTC TATTCTTGCT CCCATTTGTG GTGGTGGTGC
GAATTGCTCA TTGAGCATGG AGAAAAGTCG CCTACTGTCC AACCAGAGGC TGCAAAAGAG
AAACTGGCAA AGGAAAATAT ACAAAAAGAT GAGGACCAGT CATCCAGAGA TCGCGAAGAG
GCTGATGATG AAGAAGATGA AATTGAGTTG TATCGTGACG CGAAGAGAGT CTTGGAGTGT
TTACAGAAGT CTCTCAGAGT GGCAGATTCG TGTATGGATC CCTATTTGCT GTTGAAGCTC
TTCGTTGAGA TCTTGAACCA ATGTCTAATT TTCAATATTT ACGGAAATGC ATTGGCTGAT
TCACGCTACA TAAATGGACT CATCGACTTG ATTAGAACAA ACATCGATAA TCTTCGTGAT
GACGACAACA ATGCAAAGAC AGATGCTGCA GACGAGGAGG ATGACAAAGA GGCTCGTTTG
TTTAAACAGA GTGTAGGCTA TTTTGAACGC ACCTTGTCTT ACATAAGAGA CCAACAGGAA
GTGGAAAATC GATTTGAGGG GATTGTCGTG TGAATTTATG AAACGGCTTT GCAAGACCAA
GCACTGCTAT GTCTTTGATA GAGGTATGAG ATTACAACTA TACTACACTT TACTATACTA
TATAGCAAAT ACAGGAACTA TTGGCCTTTT
 
Protein sequence
MVVSITEQKQ ILQSCLSAIK HQSNLMKQCL NENNILQALK HCSNFLNELR TNQLTPKQYY 
ELYIAVFDSL ETLSNHLLNS HNLKQHKLEK RQAALDSTST SDKNADDKST THKNVKNGDE
ISKNAVGKSA TTPFLADLYE LVQYSGNIVP RLYMMIVIGT TYMSTKGAPG KEIMKDMIEM
CRGVQHPIRG LFLRYYLSQR TKHLLPFSNA NDFNDTVEFL ISNFIEMNKL WVRLQHQGHS
SERELRYRER KELKILVGSN LVRLSQVIDD YNGDETYSSI KYYQDKVFPT ITEQIIQCRD
HLAQSYLIDV LIQIFPDDFH FATLDSLLSD VFLNLHPLLK KSELVATLIE RFITYHKFES
DMSTSEIKEL SLESDEKQKK IKTTIDSTQL FNSFWKFYLK LYELDPQLPS EEHSELLQSF
IRLSLTYDPN NYQNLDVVYK FATEKEGQIK ANAENDDIWL QLLIVPIRHF DSIKTLFKLP
FFHEFYLKLS NKQHQKQISL EIINKLLGIT TYGDEDGNTV QEIHEPETFT TTEEVDGIFK
YLLVLIKDSD KQNSTSKNLG VTKSITINKG ENVISHEFLS NQEKICKVIH LIENPSDPFK
NLSNLMYARK KYLNKNFDNI IYTYPTLISR ILYKLKIVGY ANLRQQKKKK NTEASQDLMI
TSNFKNLSII IDELYQYHAE FNSELILKIY LNAASVADQL KQESICYELY TQCFIVYEEN
LILGSSSYQQ HINPHDSLAG GSLQYQSIIH VANKLVSARY FNKENYENLI TKLTLYGSKL
LKKQDQCRAV YSCSHLWWWC ELLIEHGEKS PTVQPEAAKE KSAKENIQKD EDQSSRDREE
ADDEEDEIEL YRDAKRVLEC LQKSLRVADS CMDPYLSLKL FVEILNQCLI FNIYGNALAD
SRYINGLIDL IRTNIDNLRD DDNNAKTDAA DEEDDKEARL FKQSVGYFER TLSYIRDQQE
VENRFEGIVV