Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | PICST_67467 |
Symbol | NAF2.1 |
ID | 4838399 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Scheffersomyces stipitis CBS 6054 |
Kingdom | Eukaryota |
Replicon accession | NC_009044 |
Strand | + |
Start bp | 208884 |
End bp | 210392 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 12 |
GC content | 46% |
IMG OID | 640389714 |
Product | DNA-binding proteins Bright/BRCAA1/RBP1 |
Protein accession | XP_001384000 |
Protein GI | 150864968 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.088517 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 23 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACATAT TTGATTCAGC GTTGAAGAAT CCCGAACCTC CCAACAAACA CAACGAATAC AGCACCAATA ATAATAATTA TTCCAGTGAA AATATCAATC CTGATAATCC TAACAACCCA AATATCAATC CTAACAACAA TACTCTCAAT CCCAACACTT ACCATACTAA CCCCAATATT ATCAATACTA ACATAAACCC TAATAATCAA AATTTCAATA CCAACCCTAA CATAAATAAT CCTAATATCG GGCCTAACTC TGATATCAAC CACAACATTA ACCACAACTA CTATATGATG CAATCACAAC TTCCGCGTGC GCTTCCCACA TTGCCTCCAC TAACTGGCGG AGCCTATCCA CAGATGGTGG ACCAACTTCC TCTGATTCAA GACATAGGTG TGTCCAAAGA CGGTACTGGC AACGGTCTGA TCGGTAAGAA GAAACGAAAG AAGAATTCTC GTAGAAAACA CAGAAACTCC CACTTAGGCT GTGGAACTTG CAAAAAGCGT AGAATCAAGT GTGACGAGAC TTTGCCAGCC TGTCTCAACT GCCTCAAGGG AAAACTCCAT TGCGCCTATT TGAATCTCGA TTCAACCGGT AGAAATGCCT TGAGAATGGC CCAGTATAAC CAGAACTTGC GCCAAGATAA GCTAGACCAG CCTCTTAAAA AGGACAAAGA CTCTCTGCTG GACCAGAATA CTAATACAAA TTCTCCGTCA GATATCCTAC TCAAGTCTAC CAACAACTCT CCTCCAACTC TCCAAGTTAC CACGCACCAG CTTCCGACAC ATCCACAACA GCTTCAGCAA GTATATTCCA CTGGTCCGCT CCAGCAGCTG TACTCCATTG TTCAACAACT TCCTCCGGGA GCCGCTCCTC CACCTGGAGC TCCACAGACA GCTACTGTAA TACAATCTCC GTATGGACCA TTGGTGACAT TACAGCCAAT TCTAGCTAAC GGTGGTGTAG TATATACAAC AGCTCCTCTT GGGGCACAGC CTCAGGCTAT TCCAATCCAG TTAGTCCAGG CTCCTCCACA ACCTCAGCAT CAACCTATAC CGACACATCT AGTTACTGAA CCTGTACCGT CTCAGATGCC AACTCAAATG GCTTCTGGGC CTGTCCTGAT TCCCTCTGGA GCTATGCCTA TGGCCCTGAT TCCCAATGCC ATGAGTGGTA GAACCTATGC TGAGGTTCTG CCTCGTGGCA AGGAGGAAAT AGCACTTCCA CCTATTTCTG CTAAGAGTAC CAGCTACACT GATCTCAAGG CAGCCGAGCG ATTGGCTCAG TATGCGGATA AGTCGCCTGT TCTAACCAAC GACATGATCC AGGGCCATTC GCCCAGTGAC CAGCCTACTT CGTCGTTCTC CAATCTTACT ATCAGTGGTA GGAGAGATGA GGCTATCAGA CTCCCTTCCA TCAAATCGTT AACCACTTCC AGTAGCGACA ACTCCAACAA TGACGCTACC GAGGATAAGG TGCCCAGCAT CCTGAAGCTA CTCTCGTAG
|
Protein sequence | MDIFDSALKN PEPPNKHNEY STNNNNYSSE NINPDNPNNP NINPNNNTLN PNTYHTNPNI INTNINPNNQ NFNTNPNINN PNIGPNSDIN HNINHNYYMM QSQLPRALPT LPPLTGGAYP QMVDQLPSIQ DIGVSKDGTG NGSIGKKKRK KNSRRKHRNS HLGCGTCKKR RIKCDETLPA CLNCLKGKLH CAYLNLDSTG RNALRMAQYN QNLRQDKLDQ PLKKDKDSSS DQNTNTNSPS DILLKSTNNS PPTLQVTTHQ LPTHPQQLQQ VYSTGPLQQS YSIVQQLPPG AAPPPGAPQT ATVIQSPYGP LVTLQPILAN GGVVYTTAPL GAQPQAIPIQ LVQAPPQPQH QPIPTHLVTE PVPSQMPTQM ASGPVSIPSG AMPMASIPNA MSGRTYAEVS PRGKEEIALP PISAKSTSYT DLKAAERLAQ YADKSPVLTN DMIQGHSPSD QPTSSFSNLT ISGRRDEAIR LPSIKSLTTS SSDNSNNDAT EDKVPSISKL LS
|
| |