Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_53992 |
Symbol | HAP1.3 |
ID | 4851602 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 2263784 |
End bp | 2266057 |
Gene Length | 2274 bp |
Protein Length | 758 aa |
Translation table | |
GC content | 36% |
IMG OID | 640393310 |
Product | CYP1/HAP1-like protein |
Protein accession | XP_001386784 |
Protein GI | 126275001 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGCAC AGCCGAAGAA AAGAAAGAGA CTACCTTTGA GCTGCGATAA TTGTCGAAAG AAGAAAGCGA AATGTGATAG GAATTTTCCT TGTTCAAACT GTATCAAGTT AAGCATATCT CATACTTGCA TTTACAGCTC TCCAGATCAC ATCAATCATA CAAAATCTGT CACTTCGGTG TTTAATAACG AGAGCTTACC TGTTCACAAA ACTAGCTCTT CCGTTCACAG TGAATTGTAT TTATTGAAAT CAAGACTAAA TGATCTTGAG ACTTCAATTA CAAATACACC TGTGCAGCCA AGATTGTTAT ACTCTTCTGA TGGACTTAGT CAAGCCGACA TATTTATTGA ATCAGATAAT GGCCTTGCTA AGTCCAATAA GGACTTGCTC GTAAATATCT TACCTTATGA TTCAGAAGAT GAGACAATTA ACTTTTACTT CAGCGAGGAA GATGGCTCGC GAGTTTCAAG ACAACCCTTG CCCTTCATTT TCTTACTGAA GAGAGATCCC GGAGCTAAGT TCTTTTGGCT CATTAAGAAA GAAAGGAAAA CCAAGACAAA CATAAAGGAG ATTTACCAGT TTATGAATAG GACTGGGGAG CTTGAAGAAA TGAAACAAAT GGCTAAAGTG AAATTTGGAG GTCGATACAT CAAAGCGCTA GAAGACGGTT ACTCAATTCA AGACGTAAAA GAATCATTGT CAATTTATGG AAAAGAACTA GGATTAAGCT TTCATACTGC TGACATCAGT GAATTGAGTT TGGAACAAAA GATAGCCGCC ATCATTCCCG ACCTGGTGGC TTCATGCAAA TACCTCACTT CATTTTTTGA ATATTTGTAT CCGTTCTTTC CCATTGTTGA TGAAATGAGT TTCCGCTCGG ATTTGACTCG AATACTTGGT GATCTGGCTG ATGGCTCAGT GAACAAAACA TTGATACATA TAGAGAAAAG AACTGATCAT GCAGTTCTCT CATCATATTT GTATATCATC CGTTTGACTT ATTTGTTATG GTTTACAAAC GACAAAAAAT ACAAATATGG CCTACCGGGT GATGTTTCTC TGTCCTATTC AAGAGAAAGA CAAGTTGCAA TGGATAATCC GGTTCCGTTG GAAGCAGCAA CACTAGCTCA AGAGTGTATT GACCAATGCA ATTTAACAAG AAAACCTCAA CTTGCCGTGT TACAAGCATT AATTTTTTCT AGGGTATATG ATAAATTCGC TCCTGAAATA GGAGAAAAAT TCAAAGGACA AGAGACGCAA CTATTGGATG GGATAATAAT GAATTTGGCT ATTGCACTAA ACCTTAATAG AGATCCAGAC TATGCTCTAG AAAAACAGGA AGACAATATG AAAAATTTGA AAAGGAAAAT ATGGCACACC ATTCTTTTTA TTGATCTCAT TGACACGATG ATATATGGGA CAACTTTATC TGCTGACCGA GATCACAATT ACGATATGAA GATTCCATAC TACAAAGAAG GGAATGAAAA TATTGTTGAC GTTCAACTTG AAAAGGATGT GATTAAATCG ATTGCATGCT TTAGACCTTT AATACTCAGT ATGGACAGAA TCGCTAAAAG ATTATTCGAT GTTAAACAGA GAGTGAAAAT TTCAATAATT TTGGATGAAA TAAGAGATTT GGAAATGCTT ACAGTGAGAA TATTCGGAAG ATTCAAATGT TTCTTGAAAC CGGACTTTTC CAATTCTGAT TCATTTGTGA TACAAGAAAT GTATTACTAT TTGAACATAA AGGCATTTCT AGTAACCATT TATTTCTATT TTCATATTCA TTTTCACGAG AAGGGTAATC CTGACCTCGA ATTCTTCTAC CATAAGAAGG TATTTACTAC ACTATTCTAT GAATTAGCAG AATTATCCAA TTTGCAAACA TCTGTTAATA ATAGGACTTT TGGAAGGGCT GTGACGCTAA TCTTATCACC TGTCATTAGT AGGATAGATC ACATATCTTG CATGGTTGCA TCTTCATTCT GCGTTCGCTT AACAGCCACT CTAAGGCAAA GAATGGAAAC AATAGACTCA CAGAATGCTC AGTTTACCTC AAAAGAATTG GCACAAGATT TGCTTCAGAT GTCATTAATT AGCGCAAGAA ATTCGCTTGC AAGGCTCTCG ACCACTAGTA TTCGGTACTT CCATTCTTGG AAAATCAACA AAGTACATCT TTTTGGAATT AATCTAGTAA GGGAAACACT GATGTATGAG TGTAGCAATA GTTCCGTTAC AAAATGTGCC AAGTTCAATT ACACTAATCA GCAA
|
Protein sequence | MPAQPKKRKR LPLSCDNCRK KKAKCDRNFP CSNCIKLSIS HTCIYSSPDH INHTKSVTSV FNNESLPVHK TSSSVHSELY LLKSRLNDLE TSITNTPVQP RLLYSSDGLS QADIFIESDN GLAKSNKDLL VNILPYDSED ETINFYFSEE DGSRVSRQPL PFIFLLKRDP GAKFFWLIKK ERKTKTNIKE IYQFMNRTGE LEEMKQMAKV KFGGRYIKAL EDGYSIQDVK ESLSIYGKEL GLSFHTADIS ELSLEQKIAA IIPDLVASCK YLTSFFEYLY PFFPIVDEMS FRSDLTRILG DLADGSVNKT LIHIEKRTDH AVLSSYLYII RLTYLLWFTN DKKYKYGLPG DVSLSYSRER QVAMDNPVPL EAATLAQECI DQCNLTRKPQ LAVLQALIFS RVYDKFAPEI GEKFKGQETQ LLDGIIMNLA IALNLNRDPD YALEKQEDNM KNLKRKIWHT ILFIDLIDTM IYGTTLSADR DHNYDMKIPY YKEGNENIVD VQLEKDVIKS IACFRPLILS MDRIAKRLFD VKQRVKISII LDEIRDLEML TVRIFGRFKC FLKPDFSNSD SFVIQEMYYY LNIKAFLVTI YFYFHIHFHE KGNPDLEFFY HKKVFTTLFY ELAELSNLQT SVNNRTFGRA VTLILSPVIS RIDHISCMVA SSFCVRLTAT LRQRMETIDS QNAQFTSKEL AQDLLQMSLI SARNSLARLS TTSIRYFHSW KINKVHLFGI NLVRETLMYE CSNSSVTKCA KFNYTNQQ
|
| |