Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32201 |
Symbol | |
ID | 4839587 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | + |
Start bp | 931295 |
End bp | 933544 |
Gene Length | 2250 bp |
Protein Length | 749 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640390902 |
Product | predicted protein |
Protein accession | XP_001384846 |
Protein GI | 150865575 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.214118 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTTTTGC GAGGCAGAAG TCGTCTTGTA CGACATTTGG ATCCACAAAA GTCGCAGCAT GTATTTTTCA GTGTGTTTCC AGCACTTTCC AGTCTGAAAA ATGTGATTCC TACCATTCCA GTGCCTGAAC TCCAGGATTC TTCCACGATT AGAACTTTTC AGCCGCAGAG CTCTAGACCC CCAAGCAGTA GGAAGCAAAT CTTCCTTGAG CCTCGAGAAA CCTTTAATCG AAATTTCAAC TCGAGGATCG AAGACATCAA AAGTTCGGGA CTAGAGAACC CCATCGAATC CAAGTCTATG ACTTTTATTG AATCTGAAGT CGATATTGAC GAAGCATCTC CAGAAAGATT AATTGGAATA ATAGACAAGT TCAAGCCAGC TGTAGCGGAG GTTAACCGAA TGAGATTTCG TAGCATTCAA CAGGAATTAG AAAAAGCCTT TTCTCAGCAG CAATTGGCTT CTTATTTAGA ACAGAACTAT TCCAAAGTTG AATTTGCAAA TAGTAGGAGA TCCACCAGAC GAGTTTCAAA GAAGAAGCTT GCGCAGCGTA TTATCAATGA CATTTGGGGT ATAAAAGTGA ACTCGAAATT GACGGAAACG AGTGATTTGA TAACAAAAAA GACTGTACCC CTTTCAGATC TTGATATCTT TTTGATCAGA CTGAATCGAA GACTTTTGCA TTGGCTCTCT TCCAATGGAT GTAGAATTAA ACTTGTGAAG GACAAGAAGG AGTTGGAGAT TTCTGGGACA ACCTCCCAAG TCAGTAATGT AGAAGTCAAT TTGATTGAGT TCTTGAGACA ATCTGAAAAA CAGGAACTCA ACTTGACCGA CATAAAGCAT TTGTTTCAAG AAAAATACGG CCGATTTTCC CTTGAAAAAG TCGCTGGATA TACCGATGTT TTTTTCAAAG ATGTAGGAAA TGATATTTAC GAGGTTCATT CACAGACTCC ACACCAAGTA GAGCGCTTGA AAAGGTTGCT TCTTTGGTCC TTAAACTACA ATAAGCATTC TAACGAAGTA ATTACACTTC CAGAAGAGCC AGAAAAGAAG CAGCTCTTTC CATTTAAGGA CGAATTGTCA TTGTCGTGGA ACAACAGGCG TCAGCCACTC TTCACGTTGA AAGATTGTGG TTCTTCCAGA TCAGAAACGA ACCATCGTTT ACTTTCTGAT TTGCAAAGAT ACGACGATGC CAATATGAAC ATAGATGCAG AGATACCTAA TAAAGAAAAT GATAGTGAAG TTGATACCAT CATAAAAGAT GAAGACGCCC CTATACTTTC TCAGAATAAG ATCAACGACA TCTACGATCA GTTATACGAC TTCGACTACA GAAAGAATTT GATCAGTTTA GAAACTACGC TGTCTCCTGT CTTTACTATA TCGCTTGGCA ATATTCTTTT CGAAGGAGAA CAGCAAGAAA CAAAGAGTGC AATGAGCAAA CTTTTCCCTA CACCACCAGA ACTTTCTGAA ACAACCAAGT TTAAGTTCAA CTCTAACGTT CAGTTGGTAG CTGACAAAGC ACTTTCTCTT CCTATCCACA GTACCAGTCT GACATCAGGT TCAGATGTAT TGGATGACAT TTTCAGAGAT GAACGTTACA AGAATTCAGT CCAAATCACG TTCTTGCCTT CTTTATTTGT AGAAGACAAC AAGGATATCA AGATGGAAGA CTTAACCAAG TTTCAACCTG TGGAGTTGTG GGTTGATCTT AAGCACAACA TGACTCCAGA CATGGACACT TTACAACTTG TAACAGTAGA AGGCGAGAAT ACCAATTACG TTTCGCTACC ATCTTTTAAG TCAGACTTGA AGGTGAGCTG CCAATTATCA GGAAATCTTC TACAAGAGAA AGGCGAATTG GCTCAAGACT TTGAGCTCAG TATTGACGAT ATCTTGAACT CACAAGCCGA CAGGTATACA CGGTTCAAGT CTCAACCTGG ACTTGGTGGC TTTCTCCTGA GATCGAAGCT TGATTTCCAG GAAGAGACTT CAATTTCGCC ATACATAGAC TTGAACATCA ATGGCGATAT TGTCAGGTAT AGATTCATTA GTATGGAATT CCGCAAGAAG TTGAGTTTTG ATTTCAACGG TCGCGAAGTC CAGTACAACA TGGTAGAGGG AGGTAATCTT GGAGGACGTA AGGTCGAGGT ACTTTTTGTA GGAGACATGG CATTGGATGC GTCTGGTGAA GAGAGGAAGA GGTTCGGACA ATTGATGAAC GATGCTGTTC TGTTTGTGAG CGAGCTCTAA
|
Protein sequence | MLLRGRSRLV RHLDPQKSQH VFFSVFPALS SSKNVIPTIP VPELQDSSTI RTFQPQSSRP PSSRKQIFLE PRETFNRNFN SRIEDIKSSG LENPIESKSM TFIESEVDID EASPERLIGI IDKFKPAVAE VNRMRFRSIQ QELEKAFSQQ QLASYLEQNY SKVEFANSRR STRRVSKKKL AQRIINDIWG IKVNSKLTET SDLITKKTVP LSDLDIFLIR SNRRLLHWLS SNGCRIKLVK DKKELEISGT TSQVSNVEVN LIEFLRQSEK QELNLTDIKH LFQEKYGRFS LEKVAGYTDV FFKDVGNDIY EVHSQTPHQV ERLKRLLLWS LNYNKHSNEV ITLPEEPEKK QLFPFKDELS LSWNNRRQPL FTLKDCGSSR SETNHRLLSD LQRYDDANMN IDAEIPNKEN DSEVDTIIKD EDAPILSQNK INDIYDQLYD FDYRKNLISL ETTSSPVFTI SLGNILFEGE QQETKSAMSK LFPTPPELSE TTKFKFNSNV QLVADKALSL PIHSTSSTSG SDVLDDIFRD ERYKNSVQIT FLPSLFVEDN KDIKMEDLTK FQPVELWVDL KHNMTPDMDT LQLVTVEGEN TNYVSLPSFK SDLKVSCQLS GNLLQEKGEL AQDFELSIDD ILNSQADRYT RFKSQPGLGG FLSRSKLDFQ EETSISPYID LNINGDIVRY RFISMEFRKK LSFDFNGREV QYNMVEGGNL GGRKVEVLFV GDMALDASGE ERKRFGQLMN DAVSFVSEL
|
| |