Gene PICST_31390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_31390 
SymbolCHT4 
ID4838457 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009044 
Strand
Start bp676543 
End bp677766 
Gene Length1224 bp 
Protein Length407 aa 
Translation table12 
GC content42% 
IMG OID640389772 
Productchitinase endochitinase 1 precursor 
Protein accessionXP_001384092 
Protein GI150865042 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3325] Chitinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.888541 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.285609 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTACACA AGCTCTCTTC TAAAGTGAGC GACAGAATTC ACAAGTACAA TCTGCTCAGC 
CAGGATAAAA ACCAACAGGC CTTTCAAATT GCTGGAAAAG ACGATTGCAA AGGGTATAAA
TCATGCGTAT ATTTCTCAAA TTGGTCCGTC TACGGACGTA AGCATTTTGC CATAGATATA
CCTGTAGAAT TTGTGACTCA TGTCTTTTAT GCTTTCATCA CTATCGATGC CAATACTGGA
AATGTCAAGT TCACAGACGA GTGGTGTGAT CTCCAATTGC CGTTGGAATC GCCTGTAAGT
TCCAATCAGA AAGTCACTGG TTCAATTCAG CAACTTTTCC AGATGAAGCA GCTTAATCGT
CATCTCAAGG TGGTGATGTC AATTGGAGGC TGGGGGACTG AACATTTGTT CCAGGCTGTG
ACGAGCGATC ATGCGAAGCT AGACAATTTT ATCAATAGTG CTGTGAAGTT TGTTTGTGAA
TACGGTTTTG ATGGAATTGA TATCGACTGG GAGTATCCTC GCAATACCCA TGAATGTAAA
CAGCTTGTAA AGTTACTTTC AGGATTGAAG CAGAAGTTGA ACCTCGTGTC TCCAGATTAT
TTGCTTACAA TTGCCTCACC TGGGGGCGAC GAAAATATTG AAGTTTTGGA CTTTCCAGAG
TTGGACAAGT ATCTTTCGTT CTGGAACGTC ATGTGCTACG ACTTCTGTGG AGAGGGCTGG
TCAACCAGAA CGGGGTATCA TTCCAACTTG TACGGCAATA ATGGGGATAA TAACTTGAGT
GCTAGTAACA TCATTGAAAA GTACATTCAG CATGGAGTTT CTCCACAAAA ATTGATTCTT
GGTATGCCAT TATATGGACG AGTATTCCAT GGAGCTCTGT CTCCGACTGT AGGTCATTCT
TTCACCAAAG AAATACTTCC TGGCTCTGTA AATGGTGATA CTTGTGACTA TAAGCTGTTG
CCTATTAGTC AGGAGAGTTT TGATGAAAAG ACGGGAAGCT GTAGCTACTA CGATAGCCAA
ACGAAACAAC TCTTTGTCTA CGATAATCCT CAGGTGGCTC GGATGAAGGC TGATTTTACT
AGTAAGTATA AACTTGGTGG AGGCATGTGG TGGGATTCAT GTGGAGATGT TGCTATCAAA
GAAAAGGAGA GATCTCTTAT TTATAACTAT ATCCAGCAGC TTGGGGGTAG TGCAGCATTA
GAGAAGACTC CCAACCATAT CTAG
 
Protein sequence
MLHKLSSKVS DRIHKYNSLS QDKNQQAFQI AGKDDCKGYK SCVYFSNWSV YGRKHFAIDI 
PVEFVTHVFY AFITIDANTG NVKFTDEWCD LQLPLESPVS SNQKVTGSIQ QLFQMKQLNR
HLKVVMSIGG WGTEHLFQAV TSDHAKLDNF INSAVKFVCE YGFDGIDIDW EYPRNTHECK
QLVKLLSGLK QKLNLVSPDY LLTIASPGGD ENIEVLDFPE LDKYLSFWNV MCYDFCGEGW
STRTGYHSNL YGNNGDNNLS ASNIIEKYIQ HGVSPQKLIL GMPLYGRVFH GASSPTVGHS
FTKEILPGSV NGDTCDYKSL PISQESFDEK TGSCSYYDSQ TKQLFVYDNP QVARMKADFT
SKYKLGGGMW WDSCGDVAIK EKERSLIYNY IQQLGGSAAL EKTPNHI