Gene PICST_53992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_53992 
SymbolHAP1.3 
ID4851602 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009068 
Strand
Start bp2263784 
End bp2266057 
Gene Length2274 bp 
Protein Length758 aa 
Translation table 
GC content36% 
IMG OID640393310 
ProductCYP1/HAP1-like protein 
Protein accessionXP_001386784 
Protein GI126275001 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCGCAC AGCCGAAGAA AAGAAAGAGA CTACCTTTGA GCTGCGATAA TTGTCGAAAG 
AAGAAAGCGA AATGTGATAG GAATTTTCCT TGTTCAAACT GTATCAAGTT AAGCATATCT
CATACTTGCA TTTACAGCTC TCCAGATCAC ATCAATCATA CAAAATCTGT CACTTCGGTG
TTTAATAACG AGAGCTTACC TGTTCACAAA ACTAGCTCTT CCGTTCACAG TGAATTGTAT
TTATTGAAAT CAAGACTAAA TGATCTTGAG ACTTCAATTA CAAATACACC TGTGCAGCCA
AGATTGTTAT ACTCTTCTGA TGGACTTAGT CAAGCCGACA TATTTATTGA ATCAGATAAT
GGCCTTGCTA AGTCCAATAA GGACTTGCTC GTAAATATCT TACCTTATGA TTCAGAAGAT
GAGACAATTA ACTTTTACTT CAGCGAGGAA GATGGCTCGC GAGTTTCAAG ACAACCCTTG
CCCTTCATTT TCTTACTGAA GAGAGATCCC GGAGCTAAGT TCTTTTGGCT CATTAAGAAA
GAAAGGAAAA CCAAGACAAA CATAAAGGAG ATTTACCAGT TTATGAATAG GACTGGGGAG
CTTGAAGAAA TGAAACAAAT GGCTAAAGTG AAATTTGGAG GTCGATACAT CAAAGCGCTA
GAAGACGGTT ACTCAATTCA AGACGTAAAA GAATCATTGT CAATTTATGG AAAAGAACTA
GGATTAAGCT TTCATACTGC TGACATCAGT GAATTGAGTT TGGAACAAAA GATAGCCGCC
ATCATTCCCG ACCTGGTGGC TTCATGCAAA TACCTCACTT CATTTTTTGA ATATTTGTAT
CCGTTCTTTC CCATTGTTGA TGAAATGAGT TTCCGCTCGG ATTTGACTCG AATACTTGGT
GATCTGGCTG ATGGCTCAGT GAACAAAACA TTGATACATA TAGAGAAAAG AACTGATCAT
GCAGTTCTCT CATCATATTT GTATATCATC CGTTTGACTT ATTTGTTATG GTTTACAAAC
GACAAAAAAT ACAAATATGG CCTACCGGGT GATGTTTCTC TGTCCTATTC AAGAGAAAGA
CAAGTTGCAA TGGATAATCC GGTTCCGTTG GAAGCAGCAA CACTAGCTCA AGAGTGTATT
GACCAATGCA ATTTAACAAG AAAACCTCAA CTTGCCGTGT TACAAGCATT AATTTTTTCT
AGGGTATATG ATAAATTCGC TCCTGAAATA GGAGAAAAAT TCAAAGGACA AGAGACGCAA
CTATTGGATG GGATAATAAT GAATTTGGCT ATTGCACTAA ACCTTAATAG AGATCCAGAC
TATGCTCTAG AAAAACAGGA AGACAATATG AAAAATTTGA AAAGGAAAAT ATGGCACACC
ATTCTTTTTA TTGATCTCAT TGACACGATG ATATATGGGA CAACTTTATC TGCTGACCGA
GATCACAATT ACGATATGAA GATTCCATAC TACAAAGAAG GGAATGAAAA TATTGTTGAC
GTTCAACTTG AAAAGGATGT GATTAAATCG ATTGCATGCT TTAGACCTTT AATACTCAGT
ATGGACAGAA TCGCTAAAAG ATTATTCGAT GTTAAACAGA GAGTGAAAAT TTCAATAATT
TTGGATGAAA TAAGAGATTT GGAAATGCTT ACAGTGAGAA TATTCGGAAG ATTCAAATGT
TTCTTGAAAC CGGACTTTTC CAATTCTGAT TCATTTGTGA TACAAGAAAT GTATTACTAT
TTGAACATAA AGGCATTTCT AGTAACCATT TATTTCTATT TTCATATTCA TTTTCACGAG
AAGGGTAATC CTGACCTCGA ATTCTTCTAC CATAAGAAGG TATTTACTAC ACTATTCTAT
GAATTAGCAG AATTATCCAA TTTGCAAACA TCTGTTAATA ATAGGACTTT TGGAAGGGCT
GTGACGCTAA TCTTATCACC TGTCATTAGT AGGATAGATC ACATATCTTG CATGGTTGCA
TCTTCATTCT GCGTTCGCTT AACAGCCACT CTAAGGCAAA GAATGGAAAC AATAGACTCA
CAGAATGCTC AGTTTACCTC AAAAGAATTG GCACAAGATT TGCTTCAGAT GTCATTAATT
AGCGCAAGAA ATTCGCTTGC AAGGCTCTCG ACCACTAGTA TTCGGTACTT CCATTCTTGG
AAAATCAACA AAGTACATCT TTTTGGAATT AATCTAGTAA GGGAAACACT GATGTATGAG
TGTAGCAATA GTTCCGTTAC AAAATGTGCC AAGTTCAATT ACACTAATCA GCAA
 
Protein sequence
MPAQPKKRKR LPLSCDNCRK KKAKCDRNFP CSNCIKLSIS HTCIYSSPDH INHTKSVTSV 
FNNESLPVHK TSSSVHSELY LLKSRLNDLE TSITNTPVQP RLLYSSDGLS QADIFIESDN
GLAKSNKDLL VNILPYDSED ETINFYFSEE DGSRVSRQPL PFIFLLKRDP GAKFFWLIKK
ERKTKTNIKE IYQFMNRTGE LEEMKQMAKV KFGGRYIKAL EDGYSIQDVK ESLSIYGKEL
GLSFHTADIS ELSLEQKIAA IIPDLVASCK YLTSFFEYLY PFFPIVDEMS FRSDLTRILG
DLADGSVNKT LIHIEKRTDH AVLSSYLYII RLTYLLWFTN DKKYKYGLPG DVSLSYSRER
QVAMDNPVPL EAATLAQECI DQCNLTRKPQ LAVLQALIFS RVYDKFAPEI GEKFKGQETQ
LLDGIIMNLA IALNLNRDPD YALEKQEDNM KNLKRKIWHT ILFIDLIDTM IYGTTLSADR
DHNYDMKIPY YKEGNENIVD VQLEKDVIKS IACFRPLILS MDRIAKRLFD VKQRVKISII
LDEIRDLEML TVRIFGRFKC FLKPDFSNSD SFVIQEMYYY LNIKAFLVTI YFYFHIHFHE
KGNPDLEFFY HKKVFTTLFY ELAELSNLQT SVNNRTFGRA VTLILSPVIS RIDHISCMVA
SSFCVRLTAT LRQRMETIDS QNAQFTSKEL AQDLLQMSLI SARNSLARLS TTSIRYFHSW
KINKVHLFGI NLVRETLMYE CSNSSVTKCA KFNYTNQQ