Gene PICST_30036 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_30036 
SymbolLAC4 
ID4837088 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009042 
Strand
Start bp1783793 
End bp1786915 
Gene Length3123 bp 
Protein Length1021 aa 
Translation table12 
GC content40% 
IMG OID640388403 
ProductBeta-galactosidase (Lactase) 
Protein accessionXP_001382569 
Protein GI150863923 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0855272 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTGACT ATACAAAGAA TTCGCTAGTA AAAGTACTTT CAGATCCACA GACTGTTCAT 
ACCAACAGAT TACCAACAAG GGCATACTAC CTCCCGTCTG AATCGACTTT GTCTTTGAAT
GGAGACTGGG ATTTTAGTTA TTTCGAAACT CCTCAGGAAG CCCCAATACC AGGAGACAAT
TTCGAAGACT TTAAAAAGAT TCGAGTTCCT GGCCATTGGC AATTACAAGG ATACGGAAGA
CCTCATTACA CCAACGTCGT CTATCCATTC CCTGTAACAC CCCCAAATCC ACCTTCGAAA
AATCCAACTG GGGTTTATCG CCATTCATTT GAAGTTCCAG AAGATTGGTC AAAAAAAGAT
TATGAATACA GACTCCGGTT CGAAGGAGTC GACAACTCAT ATCACTTATT TCTCAATGGT
AAACTCATTG GATACAACGA AGGAAGTAGA AATGCTGCTG AATTCGATGT ATCCGACTGC
ATCCACAAAA CAGGCAAGAA TGACTTGGTC ATTAGAGTTT ATCAATGGTC TAGCTCATCT
TACATTGAAG ATCAGGACCA ATGGTGGTTA AGTGGAATAT TTAGAGATGT CTACTTACTA
GGATTCAATA AGAAGGGCTA TATCAAGAAC TTTCAAGTTG CTACTGATTT GGACAAAGAG
TACAAGAATG CCGAATTGAG AATTAACTTG CAATTGAACA CTACTTCAGA TGTAAAGATA
TCTTTGCATG ACCCCACAAA GAATTTGATA TTTGAACAGA AGTTTGACAA ATTGGTTCCA
TCTTCTGAAC TCAAATTTCC AGTTTCAGAG CCTTTGAAAT GGACAGCGGA GTCGCCTTAC
TTGTACCTTC TTCGAATTGA GATTGTCGAC GAATTAGAGG CAAAGATTTC TTGTGTTGAA
CAACAGATCG GGTTCAGAAC AGTAGAAATG AAGAAAGGCT TGATCTGCGT CAATGGAGTT
CCAATTTTGA TAAGAGGAGT TAATAGACAT GAACACCATC CAAAGTTTGG CAGATCAGTT
CCTTTTGACT TTGTAGAAAG AGACTTAAAA CTTATGAAAG CTCACAATAT CAACGCCATT
AGAACTGCTC ATTACCCAAA TCATCCCAAG TTTTATGAAT TGGCAAATCA GCTAGGATTT
TGGGTTTTAG ACGAAGCCGA CTTAGAATGT CATGGATTCG TCGAAGCTGT ACGTATTCCA
CAGAATAAGG AGACACAAAT TCTGTACGAC GAAACCACTC GTCAACTATT CAAAGAAGCA
GCGGAATTCA CATCAAATAA CCCGTTATGG GAGAATGCTT ATATAGACCG TGCAAATCAA
TTGGTGCATA GAGATTGTAA TCAACCCTGC ATCATCATAT GGTCTCTAGG AAATGAAGCA
TTTTTCGGAC GGAATCATGC TAAAATGGCG AAAGAAATCA GGAGAAATGA TATTCAGAAC
AGACCAATTC ATTACGAAGG AGATTTGAAC GCTGAAGTTG CCGACATGTT TAGTAGAATG
TACATCACTC CTGATGAAGT TCTTGAGTAT ACTAAGCAGA AAGCAAAACC CTTGATTCTA
TGTGAATATG CTCATGCAAT GGGAAATGGA CCCGGACTTT TAAGACAATA CCAGGACTTA
TTCTATGAGC ATGAAATTCT TCAAGGAGGA TTTGTCTGGG AATGGGCAAA CCACGGATTG
GAAGATGTTG ATTCCAAAGG AAATGTAGTA TATAAATACG GCGGTGACTT CGGCGAGTCT
CCTCACGACG GTGTTTTTAT TTTGGACGGA CTTACGAATT CTGTCCATGA CCCAACTCCT
GGATTGGTGG AATATAAGAA GGTGATCGAG CCTGTAGTTA TTTTAATTGG AGAAGAAGAA
GTTTCCATCA AGAATACTTT TGATTTCATC GACTTGAATG AGTACACGGC TGAATACACA
TTTCTCGAAA TCATTGGATT AGATAGACAT GTTTTGCAAT CCGGAGATTT AGACATCTCT
AATTTGCAGC CCAAACAAAC AAGAAAACTA GCACTCCCAA CTTTGGAATC CAAACCTGAG
CCTGGGTCCA CTGTTATATT TCATATCATT ATCAAAACAA AGAAAGAAAC TCGCGGTTTA
CGTCGAGACC ATATTATCTC TTGGGCACAG AGAAAGATAC AACAGGGAAG CTCCAAAATT
CTCAAGCAAC CTGGTGCAAC TTTAAAATGC AAACAAGAAG GGAACAGTTT GCAGATCGAT
TCCGAAGGAT CCAAGTTGGT GTTTGATTTG GTTAAAGGGA GAATTAATTA TTGGGGATCA
AGCAAAGAAC TTTTCTTGAG CGATGAAATG GACCAAGGAA GTTTGACATT CTGGAGACCA
AGCATCAATA ATGATGCTAC CAAAGATGCA CCATACTGGA AATCATTTGG CTTGGACAAG
ATGCAAAACC ATGTTCGTGA TGTTAGAGTT CAAAAACAAA ACCAATTTCA AGTAACCATC
GAAGTAGATT CCTTTGTGGC TCCTCCTATA TTAGCCTGGG GATTTGAAGT GAAACAGGTT
TACGAAGTGC TTGACAAAAA GATCAAATTA ACCACTTCTT TGAAGCCAAT TGGCCACAAG
GATGAGTTCA TTCCCAAGAC AATTCCTCGT CTAGGTTATC AGTTCATTAT TTCTGACAAA
CTAGGATCTA ACGTGAGATG GTTTGGGCGA GGTCCTGGAG AAAGCTATAG TGACAAAAAG
GAAGGACAAT GGTTCGATGT TCATAGACTT CCGCTTGACA AATTGGATTA CAGCTACGAT
TACCCACAAG AAAACGGAAA CCATGAAGAT ACCGATTGGG TTCTTCTTGA ATCAAAAGAA
GAAGTCAAAG GCACACAAGC AGAAAATGGA GCTAAGAGTG AGTGTGGTGC AGCCCCAAAT
GTGTCTAATG CAGTGTTAAT TAGCTCATCT AGAGCTTTCG GATTCAAGGC GTCGGATAGC
TGGCGAGTAG ACGAGGCACA GCATCCATCT GACATAGTTC ATGATAGACG GTTTATTCGG
TTAGACTACA AGCAACATGG AGTTGGAACC GAGGCTTGTG GTCCTGGTCC ATTAGCCGAA
TATCAATTTA GACTTAACGG TCCAATAGAG TTCGAGTTCA CACTAGACAT GATAACGAAC
TAG
 
Protein sequence
MIDYTKNSLV KVLSDPQTVH TNRLPTRAYY LPSESTLSLN GDWDFSYFET PQEAPIPGDN 
FEDFKKIRVP GHWQLQGYGR PHYTNVVYPF PVTPPNPPSK NPTGVYRHSF EVPEDWSKKD
YEYRLRFEGV DNSYHLFLNG KLIGYNEGSR NAAEFDVSDC IHKTGKNDLV IRVYQWSSSS
YIEDQDQWWL SGIFRDVYLL GFNKKGYIKN FQVATDLDKE YKNAELRINL QLNTTSDVKI
SLHDPTKNLI FEQKFDKLVP SSELKFPVSE PLKWTAESPY LYLLRIEIVD ELEAKISCVE
QQIGFRTVEM KKGLICVNGV PILIRGVNRH EHHPKFGRSV PFDFVERDLK LMKAHNINAI
RTAHYPNHPK FYELANQLGF WVLDEADLEC HGFVEAVRIP QNKETQISYD ETTRQLFKEA
AEFTSNNPLW ENAYIDRANQ LVHRDCNQPC IIIWSLGNEA FFGRNHAKMA KEIRRNDIQN
RPIHYEGDLN AEVADMFSRM YITPDEVLEY TKQKAKPLIL CEYAHAMGNG PGLLRQYQDL
FYEHEILQGG FVWEWANHGL EDVDSKGNVV YKYGGDFGES PHDGVFILDG LTNSVHDPTP
GLVEYKKVIE PVVILIGEEE VSIKNTFDFI DLNEYTAEYT FLEIIGLDRH VLQSGDLDIS
NLQPKQTRKL ALPTLESKPE PGSTVIFHII IKTKKETRGL RRDHIISWAQ RKIQQGSSKI
LKQPGATLKC KQEGNSLQID SEGSKLVFDL VKGRINYWGS SKELFLSDEM DQGSLTFWRP
SINNDATKDA PYWKSFGLDK MQNHVRDVRV QKQNQFQVTI EVDSFVAPPI LAWGFEVKQV
YEVLDKKIKL TTSLKPIGHK DEFIPKTIPR LGYQFIISDK LGSNVRWFGR GPGESYSDKK
EGQWFDVHRL PLDKLDYSYD YPQENGNHED TDWVLLESKE EVKGTQAENG AKTFGFKASD
SWRVDEAQHP SDIVHDRRFI RLDYKQHGVG TEACGPGPLA EYQFRLNGPI EFEFTLDMIT
N