Gene PICST_46395 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPICST_46395 
SymbolLAC9 
ID4839338 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameScheffersomyces stipitis CBS 6054 
KingdomEukaryota 
Replicon accessionNC_009045 
Strand
Start bp418753 
End bp421083 
Gene Length2331 bp 
Protein Length776 aa 
Translation table12 
GC content41% 
IMG OID640390653 
Productlactose regulatory protein LAC9 and GAL4-like protein 
Protein accessionXP_001385092 
Protein GI150865754 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGTCT CCACACGATC ACCAAATACG CACCAGGCGT GCGATCTGTG TCGACTCCGA 
AAAATGAAAT GCTCCAAAGA GTATCCTCAG TGCCAAAAGT GCAAGGAACA GAATTGGAAA
TGTGTATATT CCCTTAAAAC CATAAGATCA CCGTTGACAA GAACTCACTT GCTGAAAGTG
GAAGACCGTG TGAAAGCACT AGAGAAGTTG CTTGTCCGGT TGCTTCCAGG AGATGTAGAG
ATAAATGACT TGCTCCGCGC TTCAGAATCC AACTCAGACA TCAAGGAAGA AGTAGAAACT
ATCGACAACC CGCAGTTCTA TAAGCAAATC TCTTTGACCA CTGAAGATTT GAGCAACATA
CCTATCACCT TCAAGAATAT TAACAAAATC AGTCGGGAAA AGGTCCAGAA AACTCCGCTG
GAACAGACTC TCGATTACCA GCCTGAAGAT TACTTGATAG ACTTGGAAAA GTCAGACTTG
AACCAGTATG ATGAAAGGGA AGACAGTCTC AACAATAACA TCAGCAATAT TGACCAGCCG
TTATACTCTC CTAATACTGA TGGAATGGCT GTTTTGTCGA ACGACATAGG ACTCAACTAC
GACTCACCTA AATCCAATGG TTATTTCGGG ATTAATTCTA CCAATGGTTT GCTTAAGTTT
CTTCTGTTGA AATCCAAGAA GACTGGTGGA AAAGATGTAG TCCTCAATTT AAACAACTTT
AGCTATAACG ATGATGAGGA AGAGGAAGAG GCAGCAACGG TGCTTGATGT CCATCTAAAC
GAAATATGGA AAGGAATCAA CTCTGGTAGA ATCGCTGACT TGTTGGACAA TGCAGCCTTC
CAGACTCTCG CTGTATCCAG CTATTTCGAT ATTTACCACA ATGCGTACCC GTTTGTAGAC
AAGTCGAAGT TCATGAAGCA GTTTAACGCC ATGATCAGCG GTGATAACCC CAGCGAGTAT
GACTATGCCA AGATAGAAGA CAACGAAAAG AAGTTGAGTT TCCATGTCCT ATTGAACACC
ATTCTTGCTA TAGGTATATG GTGTATCAGT GGAGAGAGCT CGCGTGTCCA CACATACTAC
TATCAGCGAG TAAAGAACTT ACTTCAGCTT ATCAACGTAT TCGAATACAG CGACAGCCAG
TTGTTCGTCA GTTACGTCTT GTTGAGCAAC TATGTCCAAA AGAATAACAA GCCCAATACA
GGCTGGAGTT ATCTAGGATT ATCTGCTAGG GTTGCAACAG CCTTGGGATT ACACAAGGAG
GTTAAACTTG ACCAGTTCAT AGACCACACT AATGGTGATA GCCCTAGAAC AAACTTAAAG
TTGTACAAGG AAATTGAGCA TAGAAAACGT CTTTGGTGGG GAATGTATTT TTTTGACGTC
GGAACAACGT TAACTTTTGG TAGACCGTTG ACAATTCCTG CTTTGAACAC TATTGACTTG
GAACCGGTTC TCAATATTGA TGATGATATT CTTAACTACG GCAACATGTC ACGAATAGAA
GACGCTGAGG TTAAGTATCC TACCATCTAC ACTGGTTTGA TTTACGAGTC AGAGTTAACT
AAAATATCCA CAAGAATATA CAACTACAAC TCATCAGTGC TCAAGTTGAA GAACGACTTG
TCCAAGATGA TCGGTTTGTT GGATATGAAC GAACTCTTGG AAGACTTTGT GGGGAAGCTT
CCCTTATATT TCAACCAGAA TGACGAAATT TCCACTCCGA ACTTGTACCA ACAATGGCAG
AATACCAAAT ATGCAGCACA GCCTATCCCC AAGTGGTTTT CGTTGACAAG ACTCAGATTA
AATTGTAGAA TCAAGAACTT GCAGATGTTG ATATTCAGAT ATATCCTCTG GGAGTCCAAC
GAAGGGTTTG AGGATCCTAA CTTTATTGCC TTGATCAAGA GATGCCGTAA CATATGTTTC
AAGTCTTCAG TAGAGACTAT TGAGATGGTT GCCAAGTTCT TGGAGAAATT TGAAATCGAT
CGCTTAACTG CCTGGTACTT GACGTACTTC TTGTTCCAAG CTGTTTTAGT TCCTATTTTG
AAACTTGGAA TTAAAGATAT CGGCTTGGAT AGAACAGATG AGGTCTACTA CAGAACCGAC
GATGTCATCT CCCGATATAT CGATATTTCT CAACGTTCAT TTAACAAATT GAAGCCTTAC
AACAAGTTGG CAGGCAAGTT CGTCAAGATC ATCGACATTC TTACGACAAA GGATAGAGAG
GCTACAATTA ACTACGAGAG CCTTTTTGCA ATTGAGCCAA ACAATGTGTC ATTGTTTGAC
AGTATGGAGG ATTTCTTCAA TTTCGAAAAC GATGTCATGC AATTTAAATA G
 
Protein sequence
MSVSTRSPNT HQACDSCRLR KMKCSKEYPQ CQKCKEQNWK CVYSLKTIRS PLTRTHLSKV 
EDRVKALEKL LVRLLPGDVE INDLLRASES NSDIKEEVET IDNPQFYKQI SLTTEDLSNI
PITFKNINKI SREKVQKTPS EQTLDYQPED YLIDLEKSDL NQYDEREDSL NNNISNIDQP
LYSPNTDGMA VLSNDIGLNY DSPKSNGYFG INSTNGLLKF LSLKSKKTGG KDVVLNLNNF
SYNDDEEEEE AATVLDVHLN EIWKGINSGR IADLLDNAAF QTLAVSSYFD IYHNAYPFVD
KSKFMKQFNA MISGDNPSEY DYAKIEDNEK KLSFHVLLNT ILAIGIWCIS GESSRVHTYY
YQRVKNLLQL INVFEYSDSQ LFVSYVLLSN YVQKNNKPNT GWSYLGLSAR VATALGLHKE
VKLDQFIDHT NGDSPRTNLK LYKEIEHRKR LWWGMYFFDV GTTLTFGRPL TIPALNTIDL
EPVLNIDDDI LNYGNMSRIE DAEVKYPTIY TGLIYESELT KISTRIYNYN SSVLKLKNDL
SKMIGLLDMN ELLEDFVGKL PLYFNQNDEI STPNLYQQWQ NTKYAAQPIP KWFSLTRLRL
NCRIKNLQML IFRYILWESN EGFEDPNFIA LIKRCRNICF KSSVETIEMV AKFLEKFEID
RLTAWYLTYF LFQAVLVPIL KLGIKDIGLD RTDEVYYRTD DVISRYIDIS QRSFNKLKPY
NKLAGKFVKI IDILTTKDRE ATINYESLFA IEPNNVSLFD SMEDFFNFEN DVMQFK