Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_56468 |
Symbol | CTA4 |
ID | 4837060 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009042 |
Strand | - |
Start bp | 1890740 |
End bp | 1893469 |
Gene Length | 2730 bp |
Protein Length | 876 aa |
Translation table | 12 |
GC content | 40% |
IMG OID | 640388375 |
Product | Fungal transcriptional regulatory protein |
Protein accession | XP_001383134 |
Protein GI | 150864356 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.753603 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.102552 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCCAT CCACAGAACC TGGCAGAGTC AGAAAAAGAA ACAAGCCAAC TTTGGTATGC ACCAACTGCA AAAAGAGAAA AATCAAATGC GATCGTAAGA TGCCATGTTC CTCATGCGTA AAGATGAACG CGGGTTTCTC GTGTTCATAT GAAACCAAAT GGCGATCTGT TTCCTTTGAA GGAAGATCCC CAGCATTGCA TGGTTCTAAA TCACAATTGG AACATCAAGT TCTCAGAGAT TCTAATATTA CCAACACTAG CAGTGGTTCA GGTAGTGAAA ATGGATCAGG ATCTAATGAA ACAAACCCTT TGAAAGAAGA ATTGTCTTCA CTCAAGAGGA AGCTTAAACT TCTAGAAGAA ACCTTGAGTT CACATGATTC CGCCACCCCA GAGGAAAAGT TCAATGCCAT AAATTCCGGG AACTCCAATA GTTCCAGAAT CACAGGCTCT AATCTGGACT CTGCTTCCAG TATACATTCC AACTCAGTCA ACCCAGTAGT TTCTGACGCT GACTTATTGG ACCTATATGG AGGCTACACT TCATTACAGG TTAAGGGAAA TATTCGAAGA GTAAATTATG GGCCACTTTC GTGGGCTTCT CTAATGCGTA AGGATCCTTG GTTGAACGTT TTGTGGAAAT ACGTGGACAA CAAACAACTA GGGCTTACCT GTTTGATAAC CCAGAAGGTA CCCCAGATAT CTCAGGATGC CATCAATACC CTTAATACTG ACAGAAGTGA AAGCACCGAA CTGAAGCAGG GGAACGAAGA AATTTTCCAG AAAAAGGCAT TGGAGAATGA AGGAATAGAC GAGATGGTAC CTTTCAAAAG TTTGACCAAG TTTACAAAAA ACCTGAATGA TACTAGTACA AAAATTTTAG AAAATGTCAA TGTCAGTACT GTCACTTTGG GCAAGACCCT TTTTGATGGG AGATGCAATC CGGAATTACA ACTTATTGAA AAAATAAAAA TGATATTACC TAAGAAAAAG GTATTATGGA GCTTAATAGA CAGATTTTTC ATGAAGTTAT ATCTCCATTT CCCCTTTATT GACGAACTTG ACTTCAAGAA GGAACTTTCC AAAATTATAG GTCCTGTCAG CTACGAAGAT GTGCCATTCG ACAAGGTAAA AATCGAAAAG AAACTCGACT TGGCCATTAT AGCAATCTGT TTAATCTTGT TGAGACTAAC CTACTTGTCA TTGTTCTCCA ATAGAAACTG TGTCAACGTC CATAGGATGA ACTCTTCGGA TGATAATATA GAAAAGTACT TGCTTGAGAA CTCCATCAAT TTGGTCAATA TTGAAGTAGC CAACTCTTGC ATTGAGTGTT TCCAATATGG CAGAAAATCT AATCTCACAG TATTCCAGGC CCTTTTATAC ATGAGACTTT ACCGTTCCCA TGCTCCGGAA GAAGGTGATG GTGTTGATGG TGGTGACTCG CAGGTCGCCA CCGCTATGCT CATTCAAATG GCATTTTCGC TAGGCTTGAA TCGAGAACCA GAAAAATTTG ACAATTGCTT AGATCCCAAG ATTAACAACA TGGGCAGGAA GATATGGCAC TTTTTGGTCA GATCTGATTT TATACATTGC TACAGTGTTG GGAATCCAAC AACTATTCAC TTATCACATT TTGATACTAA GATCCCTTTT CTTGCTGAGG GGAACAGCAA TTTGGTAGAT TTGGAACGCG AAACTGCAGC AATTAGATCG TTCTCATACT TGGGAGAAAG TTTGGGTACA TTAAGAAAAG TTCTAGACAT GGTGATGGAC CTCAATAATG GAATTCCCAT AAATGAGTTG ACGCAGAAAT TGAACGTCGT TGAAAGAGTA GCTAGCGAGG TTTTCAATGT TATTGAAAGT ATTAGAAAAA TGCAATCCAC CAATGACTTG ACGTTGTTTG GCTATGTTGT GAAGTCGAAA ATCTTCATCT CTGTCAAGAG TTCCTTGTTG ACACTCTACT ATCATTTGTA CTTGCATTAC GAAAAACTTT ACAATAACGA GTTGTCGTAC TTCTACTTGA AGAAGATGTT TGCCATTATT TACGAGGATA TTTTGCCATA TTTGTTTGAA TTATTATACG GTAACTTGGC TAATTCAGGC TTGACTCTTA ATCCCTCCAT GGAACTTGCA CTTCACAAGA GCAACCAAGT AAATCTATCC TGCTTAGCAA GAGTCAATTA TCTTCGTACT ATGATGGAAC TGAAAGCCGA TCACGCCAGA AGATTGCAAA TAGACCAGGC TTACAACTCT CATTACTACC GCTTCAAGAG TTTGTCTACT AACCTCAGAA GAACCGGAGA TTTGATAACT ATGATATTCG AAAGATTCGG TACTAGATAC TACTACGCCT GGAGAGTATC CAAGGCACAT TCAAGTCTAT TCAAGTTCTT GGGTTCCAAT GAGTTATTCA CGAAGTCAAT TCCTGGTATC AAGAAATTGC ATGCCTTCCA GTTTACTGCT GAACAGCTTG AAGATTTGGA CAGTGTGGTC CATGCCATTG GTAAGAGAGT GGAAGGTGCT GTGTTCTATG AAGATAGTGT GGACAAAGCT CCACCATTCG AGAAAGGCAC TGGCATTTCT CCTTCAATTA GTGCCAAGTC GGAGATTGTA ACGCCTTACT CTTCCATAGG CTCTCCTCCG CAAGAATCGC GTCCAAATAA CCAATACATA GATCAGATGT GGTTGCTGAT GATGGCGATG AAGTTTGACC CAGTAGATCT GGAAGGAAAT
|
Protein sequence | MSPSTEPGRV RKRNKPTLVC TNCKKRKIKC DRKMPCSSCV KMNAGFSCSY ETKWRSVSFE GRSPALHGSE NGSGSNETNP LKEELSSLKR KLKLLEETLS SHDSATPEEN SRITGSNSDS ASSIHSNSVN PVVSDADLLD LYGGYTSLQV KGNIRRVNYG PLSWASLMRK DPWLNVLWKY VDNKQLGLTC LITQKVPQIS QDAINTLNTD RSESTESKQG NEEIFQKKAL ENEGIDEMVP FKSLTKFTKN SNDTSTKILE NVNVSTVTLG KTLFDGRCNP ELQLIEKIKM ILPKKKVLWS LIDRFFMKLY LHFPFIDELD FKKELSKIIG PVSYEDVPFD KVKIEKKLDL AIIAICLILL RLTYLSLFSN RNCVNVHRMN SSDDNIEKYL LENSINLVNI EVANSCIECF QYGRKSNLTV FQALLYMRLY RSHAPEEGDG VDGGDSQVAT AMLIQMAFSL GLNREPEKFD NCLDPKINNM GRKIWHFLVR SDFIHCYSVG NPTTIHLSHF DTKIPFLAEG NSNLVDLERE TAAIRSFSYL GESLGTLRKV LDMVMDLNNG IPINELTQKL NVVERVASEV FNVIESIRKM QSTNDLTLFG YVVKSKIFIS VKSSLLTLYY HLYLHYEKLY NNELSYFYLK KMFAIIYEDI LPYLFELLYG NLANSGLTLN PSMELALHKS NQVNLSCLAR VNYLRTMMES KADHARRLQI DQAYNSHYYR FKSLSTNLRR TGDLITMIFE RFGTRYYYAW RVSKAHSSLF KFLGSNELFT KSIPGIKKLH AFQFTAEQLE DLDSVVHAIG KRVEGAVFYE DSVDKAPPFE KGTGISPSIS AKSEIVTPYS SIGSPPQESR PNNQYIDQMW LSMMAMKFDP VDSEGN
|
| |