Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_32587 |
Symbol | GAT2.1 |
ID | 4839991 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009046 |
Strand | + |
Start bp | 171315 |
End bp | 172454 |
Gene Length | 1140 bp |
Protein Length | 379 aa |
Translation table | 12 |
GC content | 39% |
IMG OID | 640391306 |
Product | GATA-family of DNA binding protein-like protein |
Protein accession | XP_001385365 |
Protein GI | 150865945 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.527498 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCACCCC CTAGCGAAAT TTTGAACAAT GAAATAATGG CTGCAAGAAT TCTTGTGAAT GCGAAGTTTG CAAACAAAAA TTTGCAACCG TTGAATACAA CTTTCGTTTC TATCAATACC CCAATGGTTC AGCAGGAGGT TTCTTTGCAA AGAAGTACGG TAAGATGTCT TCCTCCACCA ACTCCGTCAC CTACACCACC ATATACTACC GACCCCAAAG CATATACCGC TCCTCAGCTA AACAAAAATA GAACTTGCTA CTATCACAGC CAAGCTAACT TCCACATCAT GAATCCTTAT TGCGCTAACT GTATTTATTG TGGTCATGAA TACAAGAACC AAATTGTTAG ACTTGCTAAT GAAGTGTCTA ATAATGCTAA CAACTACGCC AGCACTTTAT GGAATTACAA GAGCACTTTG AACAGAATAG TTGCAGCGAA CGCTCAAGAT CTGGCAGAAA GTGAGGAGCT TTATAACAAC TTCAATAATT TGTCTTACCA AGTAAAGGGT CATTTGCTGA CAATTGCAGC TACTTCTTCA TCAATTACAA ATATTGAATC TTCCGTCAAC AAGCTCAGAG AAATTTCTTC GGACAAAGAG TCAAACAATC CCATTACTCC TCCATTAGAA GAACAGACAC AAATTGTAAT ACACGAACCT AATGCACAGA AAGAAAGTGC ACAAATTCAG ACATTTCATT ATTCTGAACC ACAAGAGAAC CTTGTACAAA AGAAGCAGAT CGTTCAACTT TCTATGCCAG TTCCAACACA ATCTCCACAA ACTTTACAGA CAGAAACCAA AACGAAAGAG TCTTTAAAAC CCAAAAAAGG ACGCCCAATC TTACTTAAGA AAAGGGCTAA GGAACCGAGG AAATCTAAAA TCAATGTCAA AGTTTCGAAA TGCTCTCATT GTCAATCACA TAGCACACCG GAATGGAGAA GGGGTCCAGG AGGTGTTCGT TCATTGTGTA ACGCTTGTGG GTTGTTCTAC TCTAAGCTTG TTAAGAAGTT TGGAACAACA GATGCAAATA CCATCTTTCT ATCCAGAAAG GAGTCTGATA AATTGATAGA CAGAACTATT CCCACCACTC TACAGAGTAA TGCTATCTTA CAGAAACAAC TACGGCTAAT TCGAAAGTAA
|
Protein sequence | MAPPSEILNN EIMAARILVN AKFANKNLQP LNTTFVSINT PMVQQEVSLQ RSTVRCLPPP TPSPTPPYTT DPKAYTAPQL NKNRTCYYHS QANFHIMNPY CANCIYCGHE YKNQIVRLAN EVSNNANNYA STLWNYKSTL NRIVAANAQD SAESEELYNN FNNLSYQVKG HLSTIAATSS SITNIESSVN KLREISSDKE SNNPITPPLE EQTQIVIHEP NAQKESAQIQ TFHYSEPQEN LVQKKQIVQL SMPVPTQSPQ TLQTETKTKE SLKPKKGRPI LLKKRAKEPR KSKINVKVSK CSHCQSHSTP EWRRGPGGVR SLCNACGLFY SKLVKKFGTT DANTIFLSRK ESDKLIDRTI PTTLQSNAIL QKQLRLIRK
|
| |