Gene PICST_68007 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_68007 
SymbolHBR1 
ID4840334 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp265555 
End bp267869 
Gene Length2315 bp 
Protein Length520 aa 
Translation table12 
GC content42% 
IMG OID640391649 
Productzinc finger transcription factor 
Protein accessionXP_001385742 
Protein GI150866220 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.011178 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GACAAGAAGC CTTATTGAGA TAGACTTGAG TGTCTCCTTT TATCTCCTTT GAATCAAAAA 
CCGTCTGGGT TGAATTCGTT TTGTATATAT ATGATTTGAG TCGAACTAAA ATTAAAGTGA
ACACTTTTCC AATTCACTAA ACGATCAACC ACGACTCTCA CAATCGCATA CGCACATATT
CCAGAAATGA CCAAGCGGCT CAGTCCTCAT GAGAAGAAGA ATAGGAAGCC TGCATCACGA
GCGTGCGTAT TCTGTCATGA AAAACATCTA CAGTGCTCGA ATGAAAGACC TTGTAAGAAT
TGTGTGAAAA GGGGCCTCGC CCATGAGTGT CGTGATGTGA TACGAAAGAG AGCTAAGTAT
CTCAACACAA ATTCACGAAG AGGCAGCGAA GCTCAGGCAC AGTCCGGTTC TAAAAGAGCA
AGAACAAGTG TCAGTTCAAC CATAGACTCA TCGACAAGGT CTTCGCCTGG AAGCTCTAGT
GCAGATTATC CCCATAACCC AATAGACGGA TTCATGTCAC CAACTATTAA ACCGGAGATA
CCGTCGCCAG CATCGAATAT GGTTGTTTCT CCACAATTTG TAGGCAATGC TTTACTTCAA
CAGGAGGTAC AACATCCTCA GCAGTTTCAT CCTCAGCAAC AACAGAAATT ATCTCTTCAT
AACTCCATGT TGAACACTAC CAACGATGTG CTTAATAGGT TACTCGAAGA ACAAAATTTC
AAGGACACAG ACTCCGACAA CATGTCAGCA AACTCTGTAA ATGCCAGTAG ACCCAACACT
GCCATAGGAA CAGGAACATT CAGTTCAAAC TATTTGAACG AAGAGTACTT GATGTTGGGA
GATATAATCT TGCATTCAAA GCCGACGTCG CCGTCGCCTT CTAACACCAG TGTTTCAGAA
TATAATACAA ACACAGTATC TCCCAATTTT AGTAGTCAAA TCAACTACGA CGACCTTAAC
CAGCCTCGGA GAAAAGTTTT GCAGCGACTC AAGGATTCTC GTCCCTTCAT ATCACTTGGG
TTTTCGAACG AATCGAGCCA ATTACCTAAT CTAAACAGCA GTAATGTTAA ATTGGAGTCT
ACTGAATTCC TTGATAACAA TGTTACCCAG CGTCCCATGT TTCAAGAAGC AATCAATAAC
CCTCTCATGC ACAAGATAGC CCAATCGTCG TCTATTCCAA CAGAGTATGT TTCTCCGCTT
GTAACACACC ATCTCTATCA GTCCGTACAG GATATATACA CCAACAACAT TATGAACTTT
GATTATCCGC AATCGTATCA TCTGTTGACC CATTTTTTAA AGAAACGGTT CCTGGGGAAC
AACCTACCTG CGGAACAAAA GCAAGCCAAA AGGCAGAGTC TACTTGTAAT TCTTAAGTTA
ATTGCAAGCT ACAGACCTAC ATTTATTTCT GCCCACAAGT CACTCTTGAA GCCCTACGAT
TTACAGTTCT TGGAAATGAC GTTCCAGCGT TGTTTAATTG ACTACGAGAA GCTTTCGCAG
CTAAACTCAT CGCCCACTAT TATTTGGAGA AGAACCGGCG AAATCGTGTC GATAACGGAC
GATTTGCTCA GTTTACTTGG TTACAATCTA GCCGACTTAT TGTCGCACCG TACATTTATT
ATGGAGCTCA TGTATGACGA TGAGTCGATT ACCAACTATT TCCGGTTGTT TAAGACTGTC
GCAGTCGGAA ACCTCCATCT GAGTATTATT ACCAAAATCA AGCTCACCAA AAACCAAAAT
AGAAACGTAT CGGATCAAAC AGGAACAAGA CGGCTTTCAT ATGAGTTGTC CGAGAGGGAT
CACATCGAGT TTTGCTCGGT ATGGACCGTT AAGCGAGACA TGTTTGATCT ACCTATGATG
ATCATAGGTC AATTCCTACC AATTCTTCCT GCAGGAGACG GTGTGAGGAT GTACTAAAAA
TGTGACGAGT GGAACCACTA GATGCAAAAG TGGACACACC AATACAGACA TCCACCCGCC
CTGGTAGAGT AAAAAAATTA CATGATCATG AGACCCCTCT CGATAAGGTA ATCATCGGCG
ATAGATTGTC ATTTCTGGGT GGTGACGGTC GATGAATAAT AAGGTGTAAG ACCAAATGCA
GAAATGGAGT GATCTTTTAG TGGGGTGACA TTTTCTTTTT TCGCATACAG TAGAAATTTT
TGCGACTGCC CCTTCTAACT GGATTGTCTT CCAGCTGAGC ACTATGACCA CTTAGAAGTG
GTAGAGATCC ACTTACGAGG ACTATATATA CTGGGATTTA GGCGTATCTA TTTATTTCTA
TTTATTCTGT ATTATACAAC AAAGACAACT TTGCT
 
Protein sequence
MTKRLSPHEK KNRKPASRAC VFCHEKHLQC SNERPCKNCV KRGLAHECRD VIRKRAKYLN 
TNSRRGSEAQ AHADYPHNPI DGFMSPTIKP EIPSPASNMV VSPQFVGNAL LQQEVQHPQQ
FHPQQQQKLS LHNSMLNTTN DVLNRLLEEQ NFKDTDSDNM SANSVNASRP NTAIGTGTFS
SNYLNEEYLM LGDIILHSKP TSPSPSNTSV SEYNTNTVSP NFSSQINYDD LNQPRRKVLQ
RLKDSRPFIS LGFSNESSQL PNLNSSNIAQ SSSIPTEYVS PLVTHHLYQS VQDIYTNNIM
NFDYPQSYHS LTHFLKKRFS GNNLPAEQKQ AKRQSLLVIL KLIASYRPTF ISAHKSLLKP
YDLQFLEMTF QRCLIDYEKL SQLNSSPTII WRRTGEIVSI TDDLLSLLGY NLADLLSHRT
FIMELMYDDE SITNYFRLFK TVAVGNLHSS IITKIKLTKN QNRNVSDQTG TRRLSYELSE
RDHIEFCSVW TVKRDMFDLP MMIIGQFLPI LPAGDGVRMY