Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_44822 |
Symbol | |
ID | 4838782 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 1604386 |
End bp | 1607592 |
Gene Length | 3207 bp |
Protein Length | 1068 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640390097 |
Product | predicted protein |
Protein accession | XP_001384262 |
Protein GI | 150865160 |
COG category | [O] Posttranslational modification, protein turnover, chaperones [Z] Cytoskeleton |
COG ID | [COG5234] Beta-tubulin folding cofactor D |
TIGRFAM ID | [TIGR01435] glutamate--cysteine ligase/gamma-glutamylcysteine synthetase, Streptococcus agalactiae type |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.0357754 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACGACT CGGCTGAACG AGATATACTC AAACGAAACG ACCGTCTCCA TAAAGACATC AGCTCTTCTA TAGATTCGCT CGTAGAAATC GCCAGAAATT CACTGCTCAC GGATCACGAT AAGAAAGATG CCTCAACTGT GACTTTTCAG AGACTCAAAT TGTGGATTAA TGAGTTCGAG CCCTCGCCTA AGTTGCTTGA TATCCATTTG GCCGAGTATA TTGACAAGTT AACAACGTTG TTCTTATGGC TATTTCGTCA TTCGCACCAT TCCAGCGACT TGACTAAAGG AATAGGTGCA ACCATCTATG AATTGTCCAA GATCAGAGGG TTCAAGTTTG TAACCAATTT CTTCTCCAGT GATTTATACC TTGTAGGCAC TCTTATTGAT ATCGTGTCCA CGCTTGAGAA TGATAACGAG AAGTTTCTTG GTCTCATATG GCTCTCAAAC TTGGTTCTAG TGCCGTTTCG TCTACTTGAA ATCGACTCTG AAATGGTTGC CAGAGTTTTG CAATTGGCCA TTACCAACTT GAAATTGCAT TCAAATGCTT CAAAGAACCA ACTTGTAGCA CTGATCTTGC TTTCACGTCT AGTGACTAGA TCAGATACCC CGATATCTCT CAGACACTAC ATGCTTGATG TAGTGGAACC GGAGTGGCAG GAGACTGGAA AACTTCATAA CGAATCTGTA AAACTCGGTC ACCTAATGAC AATTAATAAG ATCCTCAAGA GACCAGAGTC GGACTTGTAT GAACCATACT TGCCCCTCAT TTACGAATTG GTCTGTATAG ATTTGCTAGC CTTAAGACAA GCTCCAGACT CTGTCAACAA CCTCAATATC ACCTATTTAA TCAAAATACT CAGCAAATTA TCCAAATTCT ATATTCATCG AGACAACTAT GATATGGTAG CAGCAGTGAT TAATAACTTG TTGCATGATA TTATGGACGT ACTTATCAAT AAGTTTGATA CTAATAATCG ATATGCCATG GCGAAGGCAC TCGGCCATCT TAGTCTTAGC CTTTCGCATC CAGCAATCAA CTACCAGCAC CAATTAATAA TACATTTGAT TAAGCAACTA GAATTGCCTT TGCAAATGGA CACCTTTTCA TCTCAAATGG AGATTAACTC AGACAATATT TTCATTCCCA AATACCATAC AATTTTACTA TTCTTAGGCT ATATTGCCCT CAACAAGTCG ATGCCGCTAG AATTCGTAAA TGTGGTGCTT ACCATTGTCC ACAAGACGCT CTTTATTCGA CAATATCGTC TTACATCAGT TGTTGGAACT CAATTGCGAG ACACATCTTG CTTCGTTATT TGGGCCATAA GTCGAATGCT TAAACCTGCA AATCTTGTAC CTAAAGATGC AAGCAAAATG ATGGAAACAA TATTTGTAGA TTTGGTTAAA GTATCAGTTT TTGATGAAGA ACTTCTAATT AGAAGGTGTG GAATGGCAGT CCTCCAGGAG TATGTGGGAA GATTTGGTTC AATCTTGTAT CAGTATCTGT CCGGAGAAGA GCGAGGGAAT CGTATAGTTT CCTTCATTCA GCTATTCAAT AGCCAAAGTA TCAAAACTAT ACGTCTGTCT TATGAAATCA TCCTACAATT ACTTGAGCAA CAGTTTGATA GAAATATATT TATTGGTGAA CTATTACAAA ATTCAACAGA CGATTCTAAG TCGTATGAGG TTAGAAAACT TAGCGCAACC TACCTAAGAA AAGTTTTAGC CTGCAAACAG AAGAATGTGA TCATTGGTTC TGATCTGTTT CCTGATTACG GTGTATTTGA TTTCGTAACA CGATTCATAG AGTGTGGAAA TCTCACTGCT GCTTGTGAAG TGCTTGACAA AATCGATGTT AAACGTGTTG AGCATAAATT GGTAGAGTTG AATAGTCTGT TTTCTTTTGA CTTCCACAGA GACAGTATCG AAAAGGCTAT AGGCTACCTC AGACTTCTTA GTAAGTTAGT TGAAGGTTGT AATTACAAGT TCCAGGAATC GGACTGGACA AACCTACTCG ATATTTCCCG AGTCAGGAAT AAGCACGAGG AATTAATACC TCTATTCAAA GAGGTTATCC AGGTTCATCA TGCTACCCCA GATTCAATAG CATTAAAGAT GAGAGATTTG ATCAAGAGCA ATAATCTAAT TTTGTCCAGA ACGACTTTAC ATGACTCTTC ACTTCTGAAG TCTCTGTTCG AGACGATATT CGAATTAGTG TATGACACGT CTGTCGACTG TGAAATTCGG GCAAACATGG TTTCTTCTCT TGATTTCTTT GTCGACACTC CAGAATTCTT TGACGATATT AGGGTGGAAA ACTTGCTTCA TTTGCTTGAT GATTATACAA TAACGAACCA GGGAGACGTC GGTTCCAAAG TTAGATTAGC AACCCTTGGC GTGATCAAGC AGAACTTCAG CATGTTCGTG AAAAAAGAAG ATATCTTGAT GGGAAAATTG TTGAGATTGT CAGGTGAGTT GATTGATAAG ATCAGAATTA GTTCCTTTGA GTTATTTCTT CGGGGAATGA ATATCCAGCA TCCCATGAAT GCTAATAACG ACATCGGAAA ATACTATTCA AGTTTGTTTA CTGTGTACCT CTCAAACCCA AGGGTGAGTA CATGGGCGAA TTCCTTCTGG AAAGGAGTCT GTTATAGTAT TGGAGCTACA GCTGCTAACC GTTCAGTAAT TAACGAATCT TTTCACCAAT TGCTAAAGTA CTTGGAGTCT GGTGATAGAT GCCAAGATTT AAAGGAAGTG CTACTATTAC TAAATACCAC TACAAGCGTA CTGGACAGGC AGTCAAAGGG CTGGGTACTG GTACTCAATG TGTTTGTCAA GCTCTTTGAG TGTAACTATA GGTTCCCCTC TGATTTCCCA TTCGAGTCTT TGTATGTGAA GTGCTACAAT TTGCATATCA ACACCAAGAA CTCAGCCAGA ATCGGAGCAG TGATGAGAAT TTTTCTTTAT TTGATATTGC TGGATCAAGT TGACTCAAAC TTGAAGCTGA AGGTGACGGC TAGATTGTTG TGGATATGTT GCAACCACAG GTTTGAAGGA ACCAGAGAAT TAGGAAGTAG CTTGATTTTT GAACTTGCCA ATGAGATCAT GAGTGAAGAA GCCATGGAAT ACATTTCTGC AATTGACTGG AAGCAACTGC CGGCGAAACT AAAGATTCAC ATCACGGATT TACAAAGATT TTTTTGA
|
Protein sequence | MDDSAERDIL KRNDRLHKDI SSSIDSLVEI ARNSSLTDHD KKDASTVTFQ RLKLWINEFE PSPKLLDIHL AEYIDKLTTL FLWLFRHSHH SSDLTKGIGA TIYELSKIRG FKFVTNFFSS DLYLVGTLID IVSTLENDNE KFLGLIWLSN LVLVPFRLLE IDSEMVARVL QLAITNLKLH SNASKNQLVA SILLSRLVTR SDTPISLRHY MLDVVEPEWQ ETGKLHNESV KLGHLMTINK ILKRPESDLY EPYLPLIYEL VCIDLLALRQ APDSVNNLNI TYLIKILSKL SKFYIHRDNY DMVAAVINNL LHDIMDVLIN KFDTNNRYAM AKALGHLSLS LSHPAINYQH QLIIHLIKQL ELPLQMDTFS SQMEINSDNI FIPKYHTILL FLGYIALNKS MPLEFVNVVL TIVHKTLFIR QYRLTSVVGT QLRDTSCFVI WAISRMLKPA NLVPKDASKM METIFVDLVK VSVFDEELLI RRCGMAVLQE YVGRFGSILY QYSSGEERGN RIVSFIQLFN SQSIKTIRSS YEIILQLLEQ QFDRNIFIGE LLQNSTDDSK SYEVRKLSAT YLRKVLACKQ KNVIIGSDSF PDYGVFDFVT RFIECGNLTA ACEVLDKIDV KRVEHKLVEL NSSFSFDFHR DSIEKAIGYL RLLSKLVEGC NYKFQESDWT NLLDISRVRN KHEELIPLFK EVIQVHHATP DSIALKMRDL IKSNNLILSR TTLHDSSLSK SSFETIFELV YDTSVDCEIR ANMVSSLDFF VDTPEFFDDI RVENLLHLLD DYTITNQGDV GSKVRLATLG VIKQNFSMFV KKEDILMGKL LRLSGELIDK IRISSFELFL RGMNIQHPMN ANNDIGKYYS SLFTVYLSNP RVSTWANSFW KGVCYSIGAT AANRSVINES FHQLLKYLES GDRCQDLKEV LLLLNTTTSV SDRQSKGWVS VLNVFVKLFE CNYRFPSDFP FESLYVKCYN LHINTKNSAR IGAVMRIFLY LILSDQVDSN LKSKVTARLL WICCNHRFEG TRELGSSLIF ELANEIMSEE AMEYISAIDW KQSPAKLKIH ITDLQRFF
|
| |