Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_80055 |
Symbol | GAT3 |
ID | 4851196 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009068 |
Strand | + |
Start bp | 1171005 |
End bp | 1171964 |
Gene Length | 960 bp |
Protein Length | 219 aa |
Translation table | |
GC content | 46% |
IMG OID | 640392904 |
Product | GATA-family DNA binding protein |
Protein accession | XP_001387455 |
Protein GI | 126274182 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.440121 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0807232 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | CTTTTTGGTG TTCATTTCAA GAAAAACATT TATTGTTTCA ACATATTGAT TCTACAATAG TAACTTGACT ATTCAACTTC ATTCCTTTAC ACTTATATCT ATTCACACGA CACACTTCTA TCCTTACAAA CTCATATAAT GTCATCAACA ACTCCCACAT CGATTAGGTT GCCTTCGATA AGCGAATTAA CCTCCCGCTC GTCTCCAATC AGCGAAGCTC CCTCGCAGTT GTCTCCAAGA TTAAGAGCTG ATTCTAGAGT ACTTCCCAAC CTAGGTGGAT CTACTGGAAG CACATATTGG CATCCTTTGA AGGCGACTAC TCCTTCCAAT TACGTATTGG GGAACACTGG AACCACAAAC CCGAACAGCT TGCTGAAGTT GCCATCTCCA ACATTGCCAT ACTTCGATAA CAAGCAAGGT CCAAGTCCGC TTGCAGCTGC TCCACATCAC CAATTACCTA CGCCGCCTCT GCACCAAGCA TCAGCTCCAT CGTCCACATC TAGTCCTACT TCTCACTACC AATACTATTT ATACCACCAA TTGCCCGTCA TGCATCCTGG CGCATCGCCA GCCCCAGTGC AAGTGGCTCA ACTGGTTACA TATGCTACCA CCGCCCCTGG TGGATCCTTC TACCCACAAC CAGTGTACTA CCACCAAGCA CCAAATTCAT CTGTGCCAAT GCCAATGGGC CACACACAAT ATGCCATTCC AGAAGTGATA AACAAACCTA CGAACAAATG CCATAGATGC GGCACTACCG AAACTCCTGA ATGGCGTAGG GGCCCCAAGG GTGTCAGAAC TCTCTGTAAT GCATGTGGAT TGTTCCACGC TAAGCTTGTA AAGAGAAAGG GAGCAGCTTT AGCAGCCGAG GAGGTGCTAA ACAATAAAGT ATGTAAGGGC AAGAACGGAA GAAGGATTAG CATCAAGAAG CATTTGTTGA ACGAGAGCAT GAAGAACTCC
|
Protein sequence | MSSTTPTSIR LPSISELTSR SSPISEAPSQ LSPRLRADSR VLPNLGGSTG STYWHPLKAT TPSNYVLGNT GTTNPNSLLK LPSPTLPYFD NKQVAQLVTY ATTAPGGSFY PQPVYYHQAP NSSVPMPMGH TQYAIPEVIN KPTNKCHRCG TTETPEWRRG PKGVRTLCNA CGLFHAKLVK RKGAALAAEE VLNNKVCKGK NGRRISIKKH LLNESMKNS
|
| |