Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_88597 |
Symbol | |
ID | 4838097 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009043 |
Strand | - |
Start bp | 1043933 |
End bp | 1047681 |
Gene Length | 3749 bp |
Protein Length | 1183 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640389412 |
Product | predicted protein |
Protein accession | XP_001383826 |
Protein GI | 150864842 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.713267 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CCCTGATCCA TTCTGTTTCC ACCTGAACTA ACCCATTCTC GGCTCTTCTC ACGAAATGGA CGGCTGGACC GAAATCTCCC GAATAGCGGC CACCACTCAG CCCCCGAAAG GTCCCTCTCC CCATATCCAT GAGCCAGCAT CCATCACTTC TCTCAAGTTT GACTCGGTGC AGAATCTTAT ATGGTGCGGT GACAGTTTTG GGTACACAAG ATCATTTACA CCAGGTCCAA ATTCAAATGG AGTCAATCCC TATCAGCAGA GTTTGCTGTT TCTTCCATAT ACAAAGTTCA AGTCGTCTTT GAACAACAAT CCCATACTTC AGCATCTAGA CCACAAGGAT GGGATCTTGC TGCTTCTGAG CAATTGCATC AACTTCAACA ACCGTAGAGG TTTGGCCAAG TTATCTTTGC TGTCAGATTC GTTTGTTGAC GCAAATTCCG TTCTTTTCAA GAACTTAACT AGTCTCACGA CCAACTGTAA TACTATGAAC GATATAGTTG TAGGTAGCAA TTTGTCGCTT CTAAAGTTTG ATCTCAATAA GCCGTCACAC CTCATGTCGT TCAACCACGA CGGCAACATA TCGATAGTGA ACCAGACATC AAAATTCTTG ACGTTGGGTA AAGCTAATGG AGCTCTAGAA CTTTTTGATC CCGTAAGCAA CTCCTCTATC AAGTCGTTCC AGGGCCATAC GGGTTTACTT TCAGACTTGG ACGTTCAAGG CTCCTACATT GCGACATGTG GATATTCCAT TCGTCCGAGA CGTTACAACC ATAACTCCCA AAACTCCAAT AATGACTACA TGGTAGACCC CCTAGTTAAC ATCTACGATG TGAGAATGAT GCGTGCTGTG GCCCCAATAC CGTTCCCAGC AGGTGCTTCC TGTGTCCGTT TCCACCCGAA GTTACCTAAC ATTGTGCTCA TATCATCCAA TAGTGGCCAG CTTCAATTTG TAGACATCTA TGACCAAACC AATGTGTTCT TGTACCAAGC AGATTTGATG CCTACATCGC AACCGCGCCA GCCCCTGCAA TTGTCCAAGG CTCCGTTCAT GACGAATCTT GAAATCAGCG AGAATGGTGA CTTCTTTGCC TTTAGCGATA GCTATGCGAC AATGCACTTG TGGACCTTAA ACAACTCTGG TTCGACTATA ACTAAAGACT TTGTCAATTT CCCTGCTAGC ATTGAACAAC CGGATGTAAT CATACCTCCT GCAACTGAGC ATATTGGTGT GGATGATTTG GTTCCTTTAT CCACAATTGG TATGCCCTAC TACAAGGACT TGTTACTCTC TAACTATGCC TCGGACTTGA GCTTTACCAA GGAGTTGGCT AAATTGCCTG ACCCAATCGA CCACGATCTA CTCGTAGAAA GTGAATCTCA TGTCGGTTTT TTCCCATACG ACAAGTCCAA ATACGGTCCA GCCAACACCG CAAAGAAATA TCAGTCTCTT AAGGAAAGAA GCAACATCCA TTCGACAGTC AACGTACCTA AGTTTATAAG TGAAAGAAAC AATGCCTTGA CCAAAACCAT GTCCAACAGT TCAGATTTGC ATTTGGATAT GAATAACGAT TCATCTGAGA ACGTAGCTAG TCTTGCACTT CAGAATTTGA ACAAGGAACA TCACAATGAA ATCTTTCAGT ACAAGGTGCC TTTATCGTCT TCTTCGGGAA GCTCCACTCA AACTGCAAAT AGTAGAAAGA AGATCCCCAA TTGCTATTCG AGATTGCAGA TTCAGTACTC CAAGTTTGGT GTTAAAGATT TCGATTTCAG CTACTACAAC AGGACTGATG GATTGTACTG CGGGCTTGAA AATGATGCTG ATAATTCTTA TGTGAATCCC TTACTTCAAT TATACAAGTT TCAGCCAGCT TTCCACAACT TGATGGTTCG AAACTTGACC AACGAATGGT TACCCAATGA TTTTGAAACA ATTATCACTC AAAAGAATCC GCAAGGATCT TCGATTCTAA ATGAATTGGG ATATTTGTTC GACATGATGA ACAAGGCAAA GGACAAAAAT GTGAATATGT CCAACTTTTC ACAGGTGTTT AAAGAGAATA GACTTGCTCA GACAGAGAAC TTGATTAATC TTGACGAAGG AGCCAAGTTG AATTCTCAGG CTTTGCGTAA TTTGATCATT GGTTTCAACA AATTCTTGAT TGCTGAAGTA TACAAGGATT TAATGAACCA GGCCAGAGAC TCCTCTATTT CGAGCCTTAT GACAGTGCTG TACGTGATGG AAGTTAGAGG AACCGGTCCA TCTTGCCCTA TTTACGATAA GCAGTTTGGA TCGCAGTTGA GTTTAGATTT GCTTACACCT CCCAGTAACG TATTGAACAA GTTGAGTATA TTACTTAATC CACAGATCAA CACACAGCAG CAAGTGGTAA CACCTACTAC GACCAGAAGA AATCATAATT TGATAACCTA CTTGGAGTAC ACCATTAACC AGTTCAAGAC CATACCATGT CAACAACACC AACATCAATA TCCACACACT TTGGAAGTGC GTTCTTCTAT CACAAAATTG CCTCCTTTAT TATCGTTGAA TGTTAACTTG TCCAACGAGG AATTCAAGTT GATCAATGGA TTCAAGAAGT GGCTTGTTCC AGAGTTCTAC GCCTTGAACA ACAACAATGA TGCACCCATT GCTTTCAAGC CGGTTCTCAC ACAATTTGAT CAGGACTCGA CTAGATATGA GTTATTGGGT TATGTTTGTG AAATCAGTCA GCAATCTGAT TTTTCATTGG GAACACATAA CTTGGTTTCA TATGTCAAGA TAGATGGAAG GTGGTTCTTG TTCAATGATT TCTTGGTCAT GCAAATTCCA GAAGAGGAAG TATTCAACCT TTCGTATCCA TGGAAGAAGC CTGTCATTTT GGTATACCAC GATAGTTCTA TTTCTGGTAT CCCTTTTGAT TTGTTCCAAA TAGAGACGTT CGCCAATTTG CCGGGATTGA ACGATTCTAT CATATATCGC GATCACTTTG CTGGGTCAAT TAGGGAGTCC CACAAGAAGG ATTACGAGTT GTTAACTAGA CAGGAAGCAC CAAGTTTGGG CACTCTTATT GCCATTGACG CCGAATTCGT GAATTTGAGA CCAGAAGAAC TCGAAGTAAG ATACGACGGC CACAAAAAGT TAATCAAGCC CAAGTTTCTT TCTTTGGCAA GGTTATCAGC TCTTCGTGGT GATAATGGTG AGAAACAAGG AGTTGCATTT ATCGATGACT ACGTTGTACA CACTGGTGAA ATCTACGACT ATCTCACAAG CTTCTCAGGT ATCGAACCTG GAGACTTGGA CCCTATCAAC TCGGAAAAGA ACTTGGTTAC TTTACAGACA GTTTACAGAA AGTTGTGGTT ACTTTTGAAC TTAGGTGTTG TGTTTGTAGG ACATGGCTTG TACAATGATT TCAGAGGTAT TAACTTGCAA GTACCGCAAA ATCAGATTAG GGACACAGCC GATTTCTACT ACAAAAGTGA TTTCAAGAGA CAATTGAGTT TGAAATTCCT TGCCTATGTT CTATTGAAAG AGAAGGTACA GACCGGTAAT CACGACTCAA TTGAAGATGC CTACACTGCG TTGTTGCTAT ACAAGAAGTA TATCGAGATC ACAGCTACTG GTGAGTACGA GAGTACTTTG AATTACATCT ACTCGGAAGG ACAACAGTTG AGGTTCAAAG TGCCTGAATA GAAAATATTA AGTAATGCCT AATAGATTAA TGACATTTAA TAATATGAAA ATAGTAATT
|
Protein sequence | MDGWTEISRI AATTQPPKGP SPHIHEPASI TSLKFDSVQN LIWCGDSFGY TRSFTPGPNS NGVNPYQQSL SFLPYTKFKS SLNNNPILQH LDHKDGILSL SSNCINFNNR RGLAKLSLSS DSFVDANSVL FKNLTSLTTN CNTMNDIVVG SNLSLLKFDL NKPSHLMSFN HDGNISIVNQ TSKFLTLGKA NGALELFDPV SNSSIKSFQG HTGLLSDLDV QGSYIATCGY SIRPRRYNHN SQNSNNDYMV DPLVNIYDVR MMRAVAPIPF PAGASCVRFH PKLPNIVLIS SNSGQLQFVD IYDQTNVFLY QADLMPTSQP RQPSQLSKAP FMTNLEISEN GDFFAFSDSY ATMHLWTLNN SGSTITKDFV NFPASIEQPD VIIPPATEHI GVDDLVPLST IGMPYYKDLL LSNYASDLSF TKELAKLPDP IDHDLLVESE SHVGFFPYDK SKYGPANTAK KYQSLKERSN IHSTVNVPKF ISERNNALTK TMSNTSLALQ NLNKEHHNEI FQYKVPLSRK KIPNCYSRLQ IQYSKFGVKD FDFSYYNRTD GLYCGLENDA DNSYVNPLLQ LYKFQPAFHN LMVRNLTNEW LPNDFETIIT QKNPQGSSIL NELGYLFDMM NKAKDKNVNM SNFSQVFKEN RLAQTENLIN LDEGAKLNSQ ALRNLIIGFN KFLIAEVYKD LMNQARDSSI SSLMTVSYVM EVRGTGPSCP IYDKQFGSQL SLDLLTPPSN VLNKLSILLN PQINTQQQVV TPTTTRRNHN LITYLEYTIN QFKTIPCQQH QHQYPHTLEV RSSITKLPPL LSLNVNLSNE EFKLINGFKK WLVPEFYALN NNNDAPIAFK PVLTQFDQDS TRYELLGYVC EISQQSDFSL GTHNLVSYVK IDGRWFLFND FLVMQIPEEE VFNLSYPWKK PVILVYHDSS ISGIPFDLFQ IETFANLPGL NDSIIYRDHF AGSIRESHKK DYELLTRQEA PSLGTLIAID AEFVNLRPEE LEVRYDGHKK LIKPKFLSLA RLSALRGDNG EKQGVAFIDD YVVHTGEIYD YLTSFSGIEP GDLDPINSEK NLVTLQTVYR KLWLLLNLGV VFVGHGLYND FRGINLQVPQ NQIRDTADFY YKSDFKRQLS LKFLAYVLLK EKVQTGNHDS IEDAYTALLL YKKYIEITAT GEYESTLNYI YSEGQQLRFK VPE
|
| |