Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_41012 |
Symbol | |
ID | 4837605 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 105792 |
End bp | 108782 |
Gene Length | 2991 bp |
Protein Length | 847 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640388920 |
Product | predicted protein |
Protein accession | XP_001382783 |
Protein GI | 150864087 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.164038 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AAGACGACAA AGTCCAGAAA TGGCTGTATT ACATGTAAGA GAAAACGCTT GAAATGTGAC GAAACCAAAC CAGCCTGTCT CAACTGTAAG AAAAGGAATA TTGAATGTGG AGGCTATGCC ACTAACTTCA AATGGAGATC TTTTGGTGAA ACCGACTCCA ATAGTACTGT AACCAGTTTC ACAACATCTA CAAAGGACAA AAGAACATCC ATTTCCGTCA TCTCTACGAG CCCAACTTCG ACGTCTGCAT CAATATCTTC AACCGCATCT GTACCTTCAT CAAACTTGAC ATCTGCTTCT TCTGAGCAAA AGTCAAATAG TCTCAAGCGT CATCTAGAGC TTGCCTCGCT TTCTGTCACT GGAAGGACCA TTGATGATAT CAAGATCGAA AACGACTTAA TCTCGAAAGG AATAAATCCA CACAGTTATA AAAGGAAAAA GCACCATTCC GATTTCAGTA GCCAACACAT GATCCATGCT GCCTCTGTAG ATTCTATACC GAGAAGAAGC AATAGCATGT CTAAGGATAT GTCTCCAGTA GCCAGAGAAT CGCTTGTTAG ATCCTTCAGC ACGAACTCGA CTATGGAATC TGGGAATTTG ATAACTTCAT TAAGGGAAGA ATACTCTCGT GATACTTCCG CCCGTCTTGA TTCTCTAGCT GATGCTGCAG TAGACGAAAT GAATCGTTCT CCATCGTCTA TAAAGAAAGA ATCTATAGCC GAAAGTCCGG CTCTGCAACT TCCTTTTTCA CCTAACTTCG GTGATTTTCT TACTGTTCGT TCTCCGGCTG GTTCCGTAGA CGAAGTGTCA AGTAGTCATA CAGATAAGGC ACAAGCTGCC CCAACGAGTT TGCTACTCTC CAGAGAGGCA GAGATCTTAA ACGAAATAAA CCTCACACCT TCGCTCTCTG CTATAATCAA TTTTGCTTTC AGTGTAGACG ATCCAAAGGA TGCCTTCATT GATGGAAAGA CCAAGTTTCC CTTTGAAGGT CCATTCAGTC CTTTGACGTT GACATTTCCA TATGATAACC ACGAAAATGG AACTGAGTCA TCCACAGCAA ACAAACAAGA TTCACCTTTG TCATTTGGAA AAATGGCCCT TAATATGAGA GATATATCTT CTCCTGTAGC ATCTGTAATC ACACCATTGT CGACCATACC CGATAGCCAG CTTAACAGGT CCTTAGTTAA GACATCAGAA CAAGAGCAAA TCTTGCATTT ATACTCTCGG TACACTTGTT GCATCATGTC TATAAAGAAT GGAGCCAACG AAAATCCCTG GCGAAACTTG ATTGTGCCTC TTGCTACTAA GTATTCGTGT TTATTCAACT CGATTGCTTC TATGACATTA TTTCATTTAG CAGGTAACAG CGATTTGAAA GGAAATGGAG CTGATATGAG AGCTAAGGGG TACATCTACT TAAGGAGATG CATCCTTGAG TTGGCCAGTG GACTTTCCAA AGTCAATGGC GAGGGTGAAG ACAACGAAGA ACTTCCAGCC GATATTGCAC TTGCCACTTG CCTAAATTTG GCCGTTTGTG AATCTTGGGA TATACAAACT TCTAGCGGAA TAGCTCATCT TAAAGGTGCC AAAAGTATGA TCCAGAAGGT CTTGACACTC TTGAGAGACC AACAGCAGTC TTTATCAAAA AAGAGGAAGG AGTTGGAACT TGTTTCAAGT CCAGCAGAAA GCGAGCATCA GTATAATGAA CTTGTTATTT CTAAAAAGAA AGACTTGGAA AAGAAATTAG TATTAGTAGA AGACAACGAG TGGGAATGTT TATTTGAAGA TGCTGGTAGC CAAGAAGAAC CATCAGGACA GGGATTTGCC AATAGAGTAG ACAAGAGTAG CATTCAAATT CCCAAGAGCT TACAATTCCT CTTCAATATC TGGATATATT TTGAGGTATT AGCTCAAATG ACTACAGACA CCAATTATGA CGACAAGGGA GTGGATTTAG TGGCTACAAT TACCACTATG TTGCAGACAT CTCAAAAGAA CCACAGGAAA AAGAGTGGTG GCAGTATTCT GCTGCTAGCA AGCGAACACG GTGACGTCGA TGCTGACACT GGTTCACACA AATCAGATTC AGGAGAACTG AGCGTCACTG AATCGATTAC TAGAAATGGA TTTAGTCTTT TCGAGAACTT TGACTCCTTT AGCTACAACA CTGAGTATGT AGATCCATTA TTGGGATGTG CTCAATCACT CTTTCTGATC ATGGGTAAAG TTGCAAACTT GATTGCCAAG ATTAGGAAGT CTAGAAGAAA GGAAACAGAG AGTCAAAACT GGAAAGGAGC AAAACCAAGA AATAGTCTCA AAACCATAAC TCTTGCTACC GAATTGAAGC AGCAACTTGT TAACTGGAAA CTGACTATAA CAGCTTCCAT GATTAACCAA GCCAATTTGT ATGACGACAA CAATGGAAAC ACTTGGGATC TTCCCTCGTG TATTGCAACA GCAGAAGCTT ATAGATTTGC TACAATTATC TATTTGCATC AGGCAGTTCC TGAAATTCCA TGTCCTTCTA CACACTCATT AGCAGAGAAG ATATTTATAC TCTTTGCATC AATTCCAACA AATTCCGATT TGCATGTCAT TCACATCTTT CCCCTTCTAG TTGCATCTTG TGAGGCAGAA CCGGGAGATG AAAGAGAATG GTGTGAGACG AGATGGAAGT TGCTATCTGA AAAGATGTGG ATTGGTAACA TTGACCGTGC ATTGCAGGTT GTCAAAGAGG TATGGAAGAG AAAGGATGAT TACAAGAAGA AGAGGAGAAG AGGAGAGTTT GACGAAGCTA CGATGAAAGG TGGTGCCAAT AAAGAAGATG AGGACTCGTT GAGAAATATT TCTGCGCAAA TCAGCGGACT TATGTCTGTT ATCAATGACC TGAATGGCTC TTCGCTGGAA GATATTCGTG GTGGAATTGG TTCAAAACTC CATTGGAGCA CCGTGATGAA AGAATGGGGG TGGGAAGTCT TGTTAGGTTA G
|
Protein sequence | KTTKSRNGCI TCKRKRLKCD ETKPACLNCK KRNIECGGYA TNFKWRSFGE TDSNSTQKSN SLKRHLELAS LSVTGRTIDD IKIENDLISK GINPHSYKRK KHHSDFSSQH MIHAASVDSI PRRSNSMSKD MSPVARESLV RSFSTNSTME SGNLITSLRE EYSRDTSARL DSLADAAVDE MNRSPSSIKK ESIAESPASQ LPFSPNFGDF LTVRSPAGSV DEVSKILNEI NLTPSLSAII NFAFNISSPV ASVITPLSTI PDSQLNRSLV KTSEQEQILH LYSRYTCCIM SIKNGANENP WRNLIVPLAT KYSCLFNSIA SMTLFHLAGN SDLKGNGADM RAKGYIYLRR CILELASGLS KVNGEGEDNE ELPADIALAT CLNLAVCESW DIQTSSGIAH LKGAKSMIQK VLTLLRDQQQ SLSKKRKELE LVSSPAESEH QYNELVISKK KDLEKKLVLV EDNEWECLFE DAGSQEEPSG QGFANRVDKS SIQIPKSLQF LFNIWIYFEV LAQMTTDTNY DDKGVDLVAT ITTMLQTSQK NHRKKSDSGE SSVTESITRN GFSLFENFDS FSYNTEYVDP LLGCAQSLFS IMGKVANLIA KIRKSRRKET ESQNWKGAKP RNSLKTITLA TELKQQLVNW KSTITASMIN QANLYDDNNG NTWDLPSCIA TAEAYRFATI IYLHQAVPEI PCPSTHSLAE KIFILFASIP TNSDLHVIHI FPLLVASCEA EPGDEREWCE TRWKLLSEKM WIGNIDRALQ VVKEVWKRKD DYKKKRRRGE FDEATMKGGA NKEDEDSLRN ISAQISGLMS VINDSNGSSS EDIRGGIGSK LHWSTVMKEW GWEVLLG
|
| |