Gene PICST_61722 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_61722 
SymbolHYR6.4 
ID4840201 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009046 
Strand
Start bp1696495 
End bp1697718 
Gene Length1224 bp 
Protein Length408 aa 
Translation table12 
GC content42% 
IMG OID640391516 
Producthyphally regulated cell wall protein 
Protein accessionXP_001385682 
Protein GI150866178 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones35 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTATTTT TGAAACTTTT GGTGAGGACA CTTGCTTGTG TGGCAGCTGT TCTTGCTATT 
GATATAACCT CCCCAAGAAT TGATAGAGGT GTTGTCAACC TTTCCATCGG AGACATCACG
ATTGAATCCG GAGCTTACTG GTCCATCGTA GACAATGCTG TATCTGTACT TGCTGGTGAC
TTAGATGTCA AGGATGACGC TGGATTCTAC ATTACTTCGA CACAGAGTTT AATTGGCTTA
TCTGTTACCC TTGCTTCTGG ATTGGGATCG ATCACTAATG ATGGTATAAT TGCATTTAAT
TCGGTTGTCT CGTTGGTTGC TCCAAATTAC AACTTGATTG GTCTCTCATT CACGAACAAC
GGAGAAATGT ACTTGGGTAC TAACGGCTCT GTTGTTGGTG CTCCACCAAT TAACCTCGTT
GCCCCAAGCT GGACTAACAC AGGTTTGTTG GTGATCTACC TGCAAACAAG AAGTGCAGAT
GGTATTGTCA ATCTCGGAGG AACTGGATTG GTTATTCAAA ACAATGGTCA GATCTGCTTA
ACGAATGAGC TTTACCAAGC TTCCACTCAA ATTCGTGGTA CTGGTTGTAT AACCGCAAAT
GTAGACTCTA CTGTTTTCTT GGCTAATGGT TTGTTAGGCG TTGATCAATC ACAAACGTTT
TACTTAGCTG ACTCCAGATC ATCTATTAGA GCTAATGCTG TTGCTGTTCC TCAAACGTTC
ACTATTGCTG GTTTTGGTAA TGGTAACATT ATCGGTTTAG ATATTCCACT TGCAACTGTT
TTTCCGCTAA GCTCGTGGAG TTATACCTCA AGTACTGGTA TCTTGACACT CAGAGGTCTT
GGTTTATTGT CTCAAAACTT CAACATTGGT CCAGGATATA ATAGCAATTT ATTTTCGATC
ACAACGGATA GCAGTCTTGG ATTGGCCAGT GTTCCTTTAG GTGGCCTCAC TTACAGTGGT
CCAGTACCAA ATGCAATTCC TTCGAACTGC CAACCTTGCA AGAATTTGCC TAGTGCACCT
GGTACATCTG CAAGTGTCAC TTCCACTTCT TTCACTTCTA CCAAGTCTGA TGGATCAATT
TGTACTGATG TTGATCAAAT CCTCATTTCC ACCGATGCAC AAGGTTCTTG GTTCACATCT
ACTTCACTTG TATCTGAAGT TTGCAGCACT ATCCCTAACT CTCAGACAAC AGAAACGTCT
ACTTGGACCG GAACTACTAC TAAG
 
Protein sequence
MLFLKLLVRT LACVAAVLAI DITSPRIDRG VVNLSIGDIT IESGAYWSIV DNAVSVLAGD 
LDVKDDAGFY ITSTQSLIGL SVTLASGLGS ITNDGIIAFN SVVSLVAPNY NLIGLSFTNN
GEMYLGTNGS VVGAPPINLV APSWTNTGLL VIYSQTRSAD GIVNLGGTGL VIQNNGQICL
TNELYQASTQ IRGTGCITAN VDSTVFLANG LLGVDQSQTF YLADSRSSIR ANAVAVPQTF
TIAGFGNGNI IGLDIPLATV FPLSSWSYTS STGILTLRGL GLLSQNFNIG PGYNSNLFSI
TTDSSLGLAS VPLGGLTYSG PVPNAIPSNC QPCKNLPSAP GTSASVTSTS FTSTKSDGSI
CTDVDQILIS TDAQGSWFTS TSLVSEVCST IPNSQTTETS TWTGTTTK