Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_89002 |
Symbol | |
ID | 4838696 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 340514 |
End bp | 343543 |
Gene Length | 3030 bp |
Protein Length | 970 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390011 |
Product | predicted protein |
Protein accession | XP_001384020 |
Protein GI | 150864982 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.887075 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGTTGTCT CTATCACAGA GCAGAAACAG ATTCTCCAGA GCTGTCTCTC GGCGATTAAG CACCAGTCTA ATCTAATGAA ACAATGCCTC AACGAGAACA ACATTCTTCA GGCTCTCAAG CATTGCTCCA ACTTCCTCAA CGAGCTACGG ACCAATCAGT TAACACCAAA ACAATACTAC GAGTTGTATA TCGCTGTTTT TGACTCTCTT GAAACCCTCC TGAACCATTT ACTCAACTCG CACAATCTCA AACAGCACAA GCTTGAGAAG AGACAAGCTG CTTTGGATAG CACCTCAACT TCAGACAAAA ATGCCGATGA TAAAAGCACT ACTCATAAAA ATGTTAAAAA TGGTGATGAA ATTTCAAAAA ATGCTGTTGG AAAAAGTGCT ACGACTCCAT TCCTTGCAGA TTTGTACGAG CTTGTTCAAT ACTCCGGCAA CATCGTACCC CGTCTCTATA TGATGATCGT CATTGGCACA ACATATATGT CGACGAAAGG AGCTCCTGGC AAAGAGATCA TGAAAGATAT GATCGAGATG TGTCGTGGAG TACAGCATCC TATTCGAGGC TTGTTTTTGC GTTACTACTT GTCGCAGAGA ACGAAGCATT TGCTTCCGTT TCTGAACGCC AATGACTTCA ACGACACTGT AGAGTTTCTC ATTTCCAACT TCATCGAGAT GAACAAGTTG TGGGTGCGTT TGCAACACCA GGGTCATTCG TCGGAACGTG AGTTGAGATA CAGAGAGAGA AAGGAATTGA AGATCTTGGT AGGTTCCAAC TTGGTCAGAT TATCGCAAGT CATTGACGAT TACAACGGCG ACGAAACCTA CTCCAGCATC AAGTATTACC AGGATAAAGT ATTTCCTACC ATCACAGAAC AGATTATTCA GTGTCGTGAT CATCTTGCCC AGAGCTATTT GATTGACGTT TTGATCCAGA TCTTTCCTGA TGACTTTCAC TTTGCCACGT TGGACAGCTT GCTTAGTGAT GTTTTCCTCA ATTTGCATCC GTTGTTGAAG AAGAGCGAGT TGGTAGCCAC GTTGATCGAG AGATTCATCA CCTATCACAA ATTTGAGTCC GATATGTCTA CAAGTGAGAT CAAGGAGCTT TCTTTGGAAA GCGATGAGAA ACAGAAAAAG ATTAAAACGA CTATTGATTC CACGCAATTG TTCAATTCTT TCTGGAAATT CTACTTGAAA TTGTATGAAT TAGATCCACA ATTACCTTCA GAAGAGCATT CTGAGTTGCT ACAATCGTTT ATTAGATTGT CGTTAACGTA CGATCCTAAC AACTACCAAA ACTTGGATGT AGTCTACAAA TTTGCTACTG AGAAAGAGGG CCAAATCAAG GCTAATGCCG AGAATGATGA TATTTGGTTG CAATTGTTGA TTGTTCCTAT TCGTCACTTT GATTCCATCA AGACCTTGTT CAAGTTGCCC TTCTTCCACG AGTTCTATTT GAAGTTGTCC AACAAACAGC ACCAGAAACA GATATCGTTA GAAATCATCA ACAAATTGCT AGGAATAACT ACCTATGGAG ATGAAGATGG TAATACCGTC CAAGAGATTC ACGAGCCGGA GACTTTCACC ACCACTGAAG AGGTAGACGG GATTTTCAAG TACTTATTGG TCTTGATCAA AGATTCTGAC AAGCAAAACA GTACCTCCAA GAACTTGGGG GTCACAAAGA GCATAACCAT CAACAAAGGA GAAAATGTGA TTTCGCATGA GTTCTTGAGC AACCAGGAGA AGATCTGTAA AGTTATTCAT TTAATTGAAA ACCCCAGTGA TCCTTTTAAG AACTTGTCCA ATTTAATGTA CGCCAGAAAG AAGTACTTGA ACAAGAACTT TGATAACATC ATATACACCT ATCCAACTTT GATCTCACGG ATCTTGTACA AATTGAAGAT TGTTGGTTAT GCTAATTTGA GACAGCAGAA GAAGAAGAAG AACACTGAGG CCAGCCAAGA CTTGATGATC ACTTCCAACT TTAAAAATTT ATCTATAATA ATTGATGAGT TGTACCAATA CCATGCTGAA TTCAACTCAG AATTGATTCT CAAGATTTAT CTTAATGCTG CTTCAGTTGC TGACCAGTTG AAACAAGAAT CAATTTGCTA CGAATTGTAC ACGCAGTGTT TCATTGTATA TGAAGAAAAC TTGATTCTTG GATCCAGTCT GTACCAACAA CATATCAATC CTCACGACTC GCTTGCTGGA GGTTCGTTGC AATATCAATC CATTATACAT GTAGCCAACA AACTAGTTTC TGCTCGTTAT TTCAACAAGG AGAACTACGA GAACTTGATA ACAAAGTTGA CGTTGTATGG ATCGAAATTA TTGAAAAAAC AGGACCAATG CAGAGCTGTC TATTCTTGCT CCCATTTGTG GTGGTGGTGC GAATTGCTCA TTGAGCATGG AGAAAAGTCG CCTACTGTCC AACCAGAGGC TGCAAAAGAG AAACTGGCAA AGGAAAATAT ACAAAAAGAT GAGGACCAGT CATCCAGAGA TCGCGAAGAG GCTGATGATG AAGAAGATGA AATTGAGTTG TATCGTGACG CGAAGAGAGT CTTGGAGTGT TTACAGAAGT CTCTCAGAGT GGCAGATTCG TGTATGGATC CCTATTTGCT GTTGAAGCTC TTCGTTGAGA TCTTGAACCA ATGTCTAATT TTCAATATTT ACGGAAATGC ATTGGCTGAT TCACGCTACA TAAATGGACT CATCGACTTG ATTAGAACAA ACATCGATAA TCTTCGTGAT GACGACAACA ATGCAAAGAC AGATGCTGCA GACGAGGAGG ATGACAAAGA GGCTCGTTTG TTTAAACAGA GTGTAGGCTA TTTTGAACGC ACCTTGTCTT ACATAAGAGA CCAACAGGAA GTGGAAAATC GATTTGAGGG GATTGTCGTG TGAATTTATG AAACGGCTTT GCAAGACCAA GCACTGCTAT GTCTTTGATA GAGGTATGAG ATTACAACTA TACTACACTT TACTATACTA TATAGCAAAT ACAGGAACTA TTGGCCTTTT
|
Protein sequence | MVVSITEQKQ ILQSCLSAIK HQSNLMKQCL NENNILQALK HCSNFLNELR TNQLTPKQYY ELYIAVFDSL ETLSNHLLNS HNLKQHKLEK RQAALDSTST SDKNADDKST THKNVKNGDE ISKNAVGKSA TTPFLADLYE LVQYSGNIVP RLYMMIVIGT TYMSTKGAPG KEIMKDMIEM CRGVQHPIRG LFLRYYLSQR TKHLLPFSNA NDFNDTVEFL ISNFIEMNKL WVRLQHQGHS SERELRYRER KELKILVGSN LVRLSQVIDD YNGDETYSSI KYYQDKVFPT ITEQIIQCRD HLAQSYLIDV LIQIFPDDFH FATLDSLLSD VFLNLHPLLK KSELVATLIE RFITYHKFES DMSTSEIKEL SLESDEKQKK IKTTIDSTQL FNSFWKFYLK LYELDPQLPS EEHSELLQSF IRLSLTYDPN NYQNLDVVYK FATEKEGQIK ANAENDDIWL QLLIVPIRHF DSIKTLFKLP FFHEFYLKLS NKQHQKQISL EIINKLLGIT TYGDEDGNTV QEIHEPETFT TTEEVDGIFK YLLVLIKDSD KQNSTSKNLG VTKSITINKG ENVISHEFLS NQEKICKVIH LIENPSDPFK NLSNLMYARK KYLNKNFDNI IYTYPTLISR ILYKLKIVGY ANLRQQKKKK NTEASQDLMI TSNFKNLSII IDELYQYHAE FNSELILKIY LNAASVADQL KQESICYELY TQCFIVYEEN LILGSSSYQQ HINPHDSLAG GSLQYQSIIH VANKLVSARY FNKENYENLI TKLTLYGSKL LKKQDQCRAV YSCSHLWWWC ELLIEHGEKS PTVQPEAAKE KSAKENIQKD EDQSSRDREE ADDEEDEIEL YRDAKRVLEC LQKSLRVADS CMDPYLSLKL FVEILNQCLI FNIYGNALAD SRYINGLIDL IRTNIDNLRD DDNNAKTDAA DEEDDKEARL FKQSVGYFER TLSYIRDQQE VENRFEGIVV
|
| |