Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53814 |
Symbol | |
ID | 4852001 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 3392158 |
End bp | 3395016 |
Gene Length | 2859 bp |
Protein Length | 912 aa |
Translation table | |
GC content | 39% |
IMG OID | 640393709 |
Product | predicted protein |
Protein accession | XP_001386976 |
Protein GI | 126276271 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.68527 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 0.494406 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | AACTTACTTT CCGTGCTATC GGGGAAGCAA TCGACTAAAA ACGCCCATAG CAAACGAGCC ACACCCAAAG CAGAAATCGT AAATGAAACA GAAGTTGTCG AAAATGGGGC CCTTGATGAA AATGAAACTA CTGATGACAG CGAGATATCG GACAACTTGG ATAGCTCGTT CACGGAAGTG AAAAGTCGTG ATTTCCGCAA AGATCACGAA GTTTTGGAGT TGGAAAGGCG CATCTATAAC AAAGAAGTAC CGACAATTTC AGCCAAAGAC TTCTTTGCAC GTTTGAAAAC TGTGAAGGAA CAACCTCCGG CGAAATCGGA TTCACAACAA GTAATTTCCG TTGGTGTAAC CCCTGACAAA CGAAAGTCAG TAGACATAAT AGAGGTTGAA GATAGAACTA TAGTAGATGT AGATGAAGAT ACACCTCCAG AAGTTCTTGA TTTATCGAAT ATAACTGGCG ATTTAATGGG CACATCCACA AACACGGAAG ATTTGGAATC GAAAATCTTT GGAGAGAATG TAAAGCGCAT GACAGCGAGT GATATCTTCA ATTCAGCAAA GCCCGTTCTT AGCAAGACTA AAAAACAAAA GAAGACAGTA GGAAAACCAA CTATGATGGT CACTTTGCGA TTGAGTTCGC CAATCTTGGA AGAAATCAAA AAGTATGACA ACCCACTTAA AACGAGAGGA AACGCCCATT TCCTGAGTCT GTTTGCGAGC AACAATAACT CTGACAATGA CTTCTTTTCT AGATCGTCTA GCAGGTCTAC AAATTCCTTA ATGGTTACAT TGAAAGTTAG TCCTTCACAC CTACAAGAAA TCAAAAAGTA TGAAAACCCA CTTAAAACGA AGGGGAACAG TCAACTTTTG GGTCTGCTTG GAAGCTCGAA CAATAATGGC AACAATAGTT TCTTCAATAG ACCATCCAGT AAACCTTCTT CATCTAAAAC AAACAATAAA TCACTATTTG CATCGATGAT GGCTGCTTCG AAGGGAGCTT CCAAACTTAC GCCACTTCAA AAGTTGAAGG AACTCTTGCC TCCCGCCTTG CACAAGTCAC AATTTCATGT TATTCCTGAA GAACAGGATT TTCTTCAAAG GGATCCACAT TTTCCATTTA GTCGTGCTTG CCCAGTAGAG GCTAGTTCTT GTGAAGACTT ATCTCTACTA GAATTAACTC AGAAGGATGC GGAAAGGATA AATTCTTATG TAATGCAACC CGTGAAAGAT ATTTCTGTAG CTGATTTGGT ATTGGAGAGG ATTCCTGATA TTTCCAATAT CCCGGTACTC AACTCGATAT ACGAAAGGTT TATTGTTAAT TCATCCAAAA ATCCGGATCG ATGTTTGTGG AATGATTTGT TTCGTCCCAG TTCACATAGG GAATTGCTTA TGGCTCCCGA AAATAAGGAT TCAATCATGA ATTGGATAGC CAACTCTTTT CTGAGACTCC GAACCCAATC CTTAACCAAT CCACGTAACA TTATGATGAA GAAGAAAAAG AAAAAACAAG CAGGTTCTGC GTTAGATGGG TTTATTGTAG ATGACGATCT GTTCTTAGAT GGTTCAGAAA CTGAAGAAGA AATTTTCGTA CCGTTATTGA TTTTATATGG TTCAAGCGGA TCGTGCAAGT CTTCTTCTGT GTATGCCATA ATGAAGGAAT TCAGTGGATA TGTTCATGAG ATCAATAGTG GTATGTACAG AGGTAGAAAG GATATCTACA ATGGATTGAA AGAGCTCTCC ACTACTCAGC TAGTACACAA GCAGAATGAA TCAAAGACAT TCCAACAGGG TTTGATATTA TTTGAAGATG TTAACTACCT TTTCGAGCAA GATAAGAACT TTTGGCTGGT AGTTCAAGAT ATCTTAAATA TATCCAGAAG ACCGATTGTC CTAACTTGTG AGGATATGCT CAATATACCA AAGAATTTGA TTGACTTTGC AGCACAAGAG GATTCGATTA TCAGGTTGGA CGAATTTACA ATTTCCAGAG ATATTCTTCA AAAGTACTTA TGGTTATGTT GTGCAAGTCA AGGCTATGAT GTGTCTACTT CCATTTTGGA AGAAGTATCT TCAAATTCAT TCAACAGCAA GAATTACGAC TTAAGACGGT GTCTCAACGA TCTACAATTT CTTTGTCAGA AGGAATATGC TGACAATTTC AATGGAATTA TTCAACTTAC TAAAGTGGAG AAGTCCAAAA GTCATCATTG TCTTGAGCTA AATCAATTTC TGTCAAATTA TGACCTTTTA TCAGAAAGCG ATATAATCTC CACCAACTCG TTCTCACAGT TAAATTATGA TATCATTCCT AATGAACTTA ACGATGTTTA TGTTATCGAT GATTCTACAA AATTGCGAGC TCCTGCTCTT CCTTTCGAAT TGGATGTAGG TAATTACTTA CAGCACGAAG TACTGAAGTC ATGCTTTGTA TCAATGTACT CTCCCGAGCT AAAGTATTCG TTTAACCAGT TGCGTACTGA GGCGGTAAGC TTTATTGGAT CTAGGTCGAA AAAGTTGCCT AAGTTCATTC AAGATTTGCA AACTGCTAGA AGGGCATTAC GGTCCGCGTC AAATATGTCT ACTCCAGGAG ATTCTCCCTT CACATCTTTC GAAGAACTTT CTTCTGCTGG TAGAACCCCG GAACCAACTG GCATACCGGA TACATCATTT GTGAATCACA TTGGACCCAC TTCGTTTGTT CTTGATTTGC TACCAATCTG CAGATGGTGG TCGCGGTTAC AAGAATCATT CGATGAAGTT GACAAGAATG CTTTAGCAGA AGGAAGACAG AGCGTCAAGA CCTTCTTGAG ATACAGAGAC TTCCAGTACA AATCCAGAAT CATAGATGAT TCTATATAG
|
Protein sequence | NLLSVLSGKQ STKNAHSKRA TPKAEIVNET EVVENGALDE NETTDDSEIS DNLDSSFTEV KSRDFRKDHE VLELERRIYN KEVPTISAKD FFALISVGVT PDKRKSVDII EVEDRTIVDV DEDTPPEVLD LSNITGDLMG TSTNTEDLES KIFGENVKRM TASDIFNSAK PVLSKTKKQK KTVGKPTMMV TLRLSSPILE EIKKYDNPLK TRGNAHFLSL FASNNNSDND FFSRSSSRST NSLMVTLKVS PSHLQEIKKY ENPLKTKGNS QLLGLLGSSN NNGNNSFFNR PSTSKLTPLQ KLKELLPPAL HKSQFHVIPE EQDFLQRDPH FPFSRACPVE ASSCEDLSLL ELTQKDAERI NSYVMQPVKD ISVADLVLER IPDISNIPVL NSIYERFIVN SSKNPDRCLW NDLFRPSSHR ELLMAPENKD SIMNWIANSF LRLRTQSLTN PRNIMMKKKK KKQAGSALDG FIVDDDLFLD GSETEEEIFV PLLILYGSSG SCKSSSVYAI MKEFSGYVHE INSGMYRGRK DIYNGLKELS TTQLVHKQNE SKTFQQGLIL FEDVNYLFEQ DKNFWLVVQD ILNISRRPIV LTCEDMLNIP KNLIDFAAQE DSIIRLDEFT ISRDILQKYL WLCCASQGYD VSTSILEEVS SNSFNSKNYD LRRCLNDLQF LCQKEYADNF NGIIQLTKVE KSKSHHCLEL NQFLSNYDLL SESDIISTNS FSQLNYDIIP NELNDVYVID DSTKLRAPAL PFELDVGNYL QHEVLKSCFV SMYSPELKYS FNQLRTEAVS FIGSRSKKLP KFIQDLQTAR RALRSASNMS TPGDSPFTSF EELSSAGRTP EPTGIPDTSF VNHIGPTSFV LDLLPICRWW SRLQESFDEV DKNALAEGRQ SVKTFLRYRD FQYKSRIIDD SI
|
| |