Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_46395 |
Symbol | LAC9 |
ID | 4839338 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009045 |
Strand | - |
Start bp | 418753 |
End bp | 421083 |
Gene Length | 2331 bp |
Protein Length | 776 aa |
Translation table | 12 |
GC content | 41% |
IMG OID | 640390653 |
Product | lactose regulatory protein LAC9 and GAL4-like protein |
Protein accession | XP_001385092 |
Protein GI | 150865754 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGTCT CCACACGATC ACCAAATACG CACCAGGCGT GCGATCTGTG TCGACTCCGA AAAATGAAAT GCTCCAAAGA GTATCCTCAG TGCCAAAAGT GCAAGGAACA GAATTGGAAA TGTGTATATT CCCTTAAAAC CATAAGATCA CCGTTGACAA GAACTCACTT GCTGAAAGTG GAAGACCGTG TGAAAGCACT AGAGAAGTTG CTTGTCCGGT TGCTTCCAGG AGATGTAGAG ATAAATGACT TGCTCCGCGC TTCAGAATCC AACTCAGACA TCAAGGAAGA AGTAGAAACT ATCGACAACC CGCAGTTCTA TAAGCAAATC TCTTTGACCA CTGAAGATTT GAGCAACATA CCTATCACCT TCAAGAATAT TAACAAAATC AGTCGGGAAA AGGTCCAGAA AACTCCGCTG GAACAGACTC TCGATTACCA GCCTGAAGAT TACTTGATAG ACTTGGAAAA GTCAGACTTG AACCAGTATG ATGAAAGGGA AGACAGTCTC AACAATAACA TCAGCAATAT TGACCAGCCG TTATACTCTC CTAATACTGA TGGAATGGCT GTTTTGTCGA ACGACATAGG ACTCAACTAC GACTCACCTA AATCCAATGG TTATTTCGGG ATTAATTCTA CCAATGGTTT GCTTAAGTTT CTTCTGTTGA AATCCAAGAA GACTGGTGGA AAAGATGTAG TCCTCAATTT AAACAACTTT AGCTATAACG ATGATGAGGA AGAGGAAGAG GCAGCAACGG TGCTTGATGT CCATCTAAAC GAAATATGGA AAGGAATCAA CTCTGGTAGA ATCGCTGACT TGTTGGACAA TGCAGCCTTC CAGACTCTCG CTGTATCCAG CTATTTCGAT ATTTACCACA ATGCGTACCC GTTTGTAGAC AAGTCGAAGT TCATGAAGCA GTTTAACGCC ATGATCAGCG GTGATAACCC CAGCGAGTAT GACTATGCCA AGATAGAAGA CAACGAAAAG AAGTTGAGTT TCCATGTCCT ATTGAACACC ATTCTTGCTA TAGGTATATG GTGTATCAGT GGAGAGAGCT CGCGTGTCCA CACATACTAC TATCAGCGAG TAAAGAACTT ACTTCAGCTT ATCAACGTAT TCGAATACAG CGACAGCCAG TTGTTCGTCA GTTACGTCTT GTTGAGCAAC TATGTCCAAA AGAATAACAA GCCCAATACA GGCTGGAGTT ATCTAGGATT ATCTGCTAGG GTTGCAACAG CCTTGGGATT ACACAAGGAG GTTAAACTTG ACCAGTTCAT AGACCACACT AATGGTGATA GCCCTAGAAC AAACTTAAAG TTGTACAAGG AAATTGAGCA TAGAAAACGT CTTTGGTGGG GAATGTATTT TTTTGACGTC GGAACAACGT TAACTTTTGG TAGACCGTTG ACAATTCCTG CTTTGAACAC TATTGACTTG GAACCGGTTC TCAATATTGA TGATGATATT CTTAACTACG GCAACATGTC ACGAATAGAA GACGCTGAGG TTAAGTATCC TACCATCTAC ACTGGTTTGA TTTACGAGTC AGAGTTAACT AAAATATCCA CAAGAATATA CAACTACAAC TCATCAGTGC TCAAGTTGAA GAACGACTTG TCCAAGATGA TCGGTTTGTT GGATATGAAC GAACTCTTGG AAGACTTTGT GGGGAAGCTT CCCTTATATT TCAACCAGAA TGACGAAATT TCCACTCCGA ACTTGTACCA ACAATGGCAG AATACCAAAT ATGCAGCACA GCCTATCCCC AAGTGGTTTT CGTTGACAAG ACTCAGATTA AATTGTAGAA TCAAGAACTT GCAGATGTTG ATATTCAGAT ATATCCTCTG GGAGTCCAAC GAAGGGTTTG AGGATCCTAA CTTTATTGCC TTGATCAAGA GATGCCGTAA CATATGTTTC AAGTCTTCAG TAGAGACTAT TGAGATGGTT GCCAAGTTCT TGGAGAAATT TGAAATCGAT CGCTTAACTG CCTGGTACTT GACGTACTTC TTGTTCCAAG CTGTTTTAGT TCCTATTTTG AAACTTGGAA TTAAAGATAT CGGCTTGGAT AGAACAGATG AGGTCTACTA CAGAACCGAC GATGTCATCT CCCGATATAT CGATATTTCT CAACGTTCAT TTAACAAATT GAAGCCTTAC AACAAGTTGG CAGGCAAGTT CGTCAAGATC ATCGACATTC TTACGACAAA GGATAGAGAG GCTACAATTA ACTACGAGAG CCTTTTTGCA ATTGAGCCAA ACAATGTGTC ATTGTTTGAC AGTATGGAGG ATTTCTTCAA TTTCGAAAAC GATGTCATGC AATTTAAATA G
|
Protein sequence | MSVSTRSPNT HQACDSCRLR KMKCSKEYPQ CQKCKEQNWK CVYSLKTIRS PLTRTHLSKV EDRVKALEKL LVRLLPGDVE INDLLRASES NSDIKEEVET IDNPQFYKQI SLTTEDLSNI PITFKNINKI SREKVQKTPS EQTLDYQPED YLIDLEKSDL NQYDEREDSL NNNISNIDQP LYSPNTDGMA VLSNDIGLNY DSPKSNGYFG INSTNGLLKF LSLKSKKTGG KDVVLNLNNF SYNDDEEEEE AATVLDVHLN EIWKGINSGR IADLLDNAAF QTLAVSSYFD IYHNAYPFVD KSKFMKQFNA MISGDNPSEY DYAKIEDNEK KLSFHVLLNT ILAIGIWCIS GESSRVHTYY YQRVKNLLQL INVFEYSDSQ LFVSYVLLSN YVQKNNKPNT GWSYLGLSAR VATALGLHKE VKLDQFIDHT NGDSPRTNLK LYKEIEHRKR LWWGMYFFDV GTTLTFGRPL TIPALNTIDL EPVLNIDDDI LNYGNMSRIE DAEVKYPTIY TGLIYESELT KISTRIYNYN SSVLKLKNDL SKMIGLLDMN ELLEDFVGKL PLYFNQNDEI STPNLYQQWQ NTKYAAQPIP KWFSLTRLRL NCRIKNLQML IFRYILWESN EGFEDPNFIA LIKRCRNICF KSSVETIEMV AKFLEKFEID RLTAWYLTYF LFQAVLVPIL KLGIKDIGLD RTDEVYYRTD DVISRYIDIS QRSFNKLKPY NKLAGKFVKI IDILTTKDRE ATINYESLFA IEPNNVSLFD SMEDFFNFEN DVMQFK
|
| |