Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_31924 |
Symbol | |
ID | 4839474 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 234010 |
End bp | 236634 |
Gene Length | 2625 bp |
Protein Length | 874 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640390789 |
Product | predicted protein |
Protein accession | XP_001384701 |
Protein GI | 150865470 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 25 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCCGAG CTGGAGCAAG CATGACCAGC CACGGTGGAA CGAACGACAG GTCCTCGACC TCCCACGGCG ATGCGGCTCA CACTCCCATG CCTGGCTACA TCCAGCCTTC GTTGTTCGGT GATGGTGACA TCACCGACAT TAACATCATG ACCAGTGCCA GCAATCCGGA TATCAACAGG GGACTTCTCC ACAGTGCAAA CAACAACCAC ACCAACACCA GTAACACCAA TAGCGCCCTC AACAGCAATA TTAGCCCCAA TAGCAATTCT CTCCAGGGAG ATTCGCACAC CCTCGCTAAT CTCAGTAACC TCAGCAATTT CAGCAATCTC AATAATCTCA GCAACCTCAA TACTCCGGAT AATTCCACCA CTCCCAACAA CCTCAACTTC AACAATAACT CTTCTGGAAA CCAATACAGC CAGATGGACG ATTTTGGCTT TGATCTCAAA AACGACCAGT ACGATTACTT CTCGCAGGTC TCGCTACTGG CACCGTCATC CCGTATAGAC CATGTCTATG GGTTGGATTT CGGAGACTTC GAGAAATCTC AGCAAAAGCC GCAAGGCAAT CGACAACAAC CACAAGGCCA ACAAACTCTA CACCAGCCAC TACCACTACC ACTACCGCAA TATGGACAGA ATTTACAAAG TCAATTACAG AGTCTGAATC AAGTACAAAA TCAATTAAAC AGCCCTGAAC AAAGAGTAGA ACAAGAATTT CAGAACTATC ATCCCCGTTT CAGATTGAAC CAACAACACT CTCTTTCTCA AGAAGTAGTG GAGTATCCAA ACGCAGAAGG GTATCAGCCG AAACGTAAAC ACAGTAATGA AAACGATTCA TCTGTTTCTG ACACAGGGGC GGGACCTGGT TCTGGCTCCA TGACACAGTT GCCATATCCA CCAAAACCAC CTACCATACG GCAGAACTCG CTTCCATTAT CCCAGAAGGC TATGAACCAC AACCACAAGA AGAACCACGG AGCTAATGGG CTCAGCTTCT TTGGTTATAG TAGCAATCCA CAAATCTATG GAAATGAGAA TCCACCGAAT TCCGGAAATT CGCAGAATGC TCAAATTCCC CAAAATCTCC AAAACCAACA AAAGCAAAAT CTCCCAAACC AATACAATTC TTCTTCTTAC AACGATATAG ACTTGAATAC AAGTCAGAAA TCCGATGCCA CTGTTACACT TGGCTCACCT TCTGAGGCTA AAAAGAGAAC AACGCCCAAA CCAAGAGGCA GACCCAGGAA GAATAGAAAT CCCGGGTTTT TCTTGCCATT AGATAACCTC AACACAAAAA ATCAGCTTGC AAAGGTAGCC TCTAATTCAA TTTCGTCTTC TCATTCTGAC ACAGCAGCAT ATAACTGGAA CCGGTCTGAA CGAGTTAACA ATAGTGGCAC AAGAAACTTC AGTGCTAATA TCCCCTTGTT CAAAGATGAT GGCCAACACA CTCCCAATCC TGAGTTGCGG CTTTTCGAAG CTGATTTAGG ACTTCAGGAC GACAGCTTGC AACGATTCGC ACCTTCCCAT TCCAATATCA CTCCATCCAT GAATCCTCCT CTCAATGTAA ATGGCTACTT TGAGTTGGAG AACAGAAATA TTAATAGTAA CAGCAGCAAC AGCAACAGCC AGTTTGATAC CAGCAGTCTT CTCACTCATT TCATTTCTCC CCACATTAAT TACAACGAAC ACACCTTCAA CAACTTCAAC AAGGAGTTTG AGGACTATTT GTCAGAATCA GGTGCGGACT CAAATCTTGT GCAACAGCGT CCGGTTTCGG GTCAATATCA GCTTATAAAT AAGGAGCAGA TCAGTATGAG CCGAGGAGTT CCTTTGGAAA CTGATCCTGA TGATGTGGGT TTTGGTGCCG ACTTGGGCGG GTTCAACTTT GATGTTAACC CTCGAAACGA GTTGCTTGAT CCGGAAAGTC TTCCAAAGCC AACTGTTTAC GACAACAATG AGCTCGTGGC TACCCTGGCA AGCTCGATGT TGAATCCTAC TCCGATCTAC GACGACAATG ATGCTCTTTC TGTAAACCAG CATGGAGCTA CAAGTCTCAG CCAGAGTCTG GCCAATGGGT CAGATTTTGA ACTGTCGGAA TCAGACAGGG AGTCTAAACC ATCGGGTCAT CTAGACGTTC CACGACCAGT AATTGTTCGT TCCATATCCA ATCATAGTGG CAATAGCAAT TCCAATTCCA ACTCGCTGCC TCCGCAGGAG TTCAGACACG ACGACAACGA TATATTGCTT CAGAAGCCCA AGAAGAAGCG GTTGCCCAAA GGTGCCGTTT GCTCCATATG CGACAAGTAC ATAAGTCGAG ACTTGACTAG ACACATGCGG ATCCACAATG AAGTGGGCCG GTTCCAATGC GTATATCCGA AGTATATGTG CAACCACAAG ACTCAGTACT TCAACCGGCC CTATGACTAC AAGAAACATC TTCTCCATAT GCATTTCCGC TTCGACGACC CCAAAGGAAA AACAGCCAAC ACCTTGACAG ATAAGCTACC ATTAGAAGGT ATCTGTATTG CGTGCGGTGC TCGTTTTGTA GCGAATGACT GGTTGGAAAC TCATGTGTTG ACGAATTCTG CCACCAAGTG CCCTTCGCTC GAGAATCGGG AGTGA
|
Protein sequence | MGRAGASMTS HGGTNDRSST SHGDAAHTPM PGYIQPSLFG DGDITDINIM TSASNPDINR GLLHSANNNH TNTSNTNSAL NSNISPNSNS LQGDSHTLAN LSNLSNFSNL NNLSNLNTPD NSTTPNNLNF NNNSSGNQYS QMDDFGFDLK NDQYDYFSQV SLSAPSSRID HVYGLDFGDF EKSQQKPQGN RQQPQGQQTL HQPLPLPLPQ YGQNLQSQLQ SSNQVQNQLN SPEQRVEQEF QNYHPRFRLN QQHSLSQEVV EYPNAEGYQP KRKHSNENDS SVSDTGAGPG SGSMTQLPYP PKPPTIRQNS LPLSQKAMNH NHKKNHGANG LSFFGYSSNP QIYGNENPPN SGNSQNAQIP QNLQNQQKQN LPNQYNSSSY NDIDLNTSQK SDATVTLGSP SEAKKRTTPK PRGRPRKNRN PGFFLPLDNL NTKNQLAKVA SNSISSSHSD TAAYNWNRSE RVNNSGTRNF SANIPLFKDD GQHTPNPELR LFEADLGLQD DSLQRFAPSH SNITPSMNPP LNVNGYFELE NRNINSNSSN SNSQFDTSSL LTHFISPHIN YNEHTFNNFN KEFEDYLSES GADSNLVQQR PVSGQYQLIN KEQISMSRGV PLETDPDDVG FGADLGGFNF DVNPRNELLD PESLPKPTVY DNNELVATSA SSMLNPTPIY DDNDALSVNQ HGATSLSQSS ANGSDFESSE SDRESKPSGH LDVPRPVIVR SISNHSGNSN SNSNSSPPQE FRHDDNDILL QKPKKKRLPK GAVCSICDKY ISRDLTRHMR IHNEVGRFQC VYPKYMCNHK TQYFNRPYDY KKHLLHMHFR FDDPKGKTAN TLTDKLPLEG ICIACGARFV ANDWLETHVL TNSATKCPSL ENRE
|
| |