Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04450 |
Symbol | |
ID | 3254727 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 233836 |
End bp | 237342 |
Gene Length | 3507 bp |
Protein Length | 1097 aa |
Translation table | |
GC content | 49% |
IMG OID | 638253916 |
Product | White collar 1 protein (WC1), putative |
Protein accession | XP_567995 |
Protein GI | 58261170 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00229] PAS domain S-box |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTTCC AAACTCCTCC TGGAGTTCAA GGCAATTCGT TGGGGCCGCA TCGCTCGCTG AGTAGCTTGG AGAATGGCCA ACCAGGACGG GTAAGCGGTC ATAGCATGGC AACCCCTAGC GTCGAGCCTA GCAGCTGTCC TGGAGATCAG TATCAGTCTT TTGCACAAGG AATTGCTCTC TCTCAAGCCT CATCGTCAGC AATCAATTTT CAAGCGCCTC AACCACGAAT CGTAAGGCCA AGTCAGTTCT CTACATCACA GCAGTACGCT ATAGACTCCT CCCCGCGGCA GAGTCAAGCA TATACTTCAC CAAATTTCCA ATTTCATCCC TATGCTTCTC AACAATATTC GCATCATCCA TCACCTCAAA TATCCTCTCC CCACACTCCC GCGGACCAGC TGTTCAAAGG CAAATCTCCA GCCTCCAGTT CCTCACATAG CCACTCAAAT CTGACACCGG GTAGTTCGGT CCAGCCATCG CATCCAGAAC CGCTTCGTAG AACCAGTGGC GACATGAGGC CGTCAGACGA TGGCGACAGT GGTTTGTCTC TTGACCTGTC CAAATTTCCG AGGGATGTAC GCTTTCAGGT TCCTTCCTTT CCGACAAATC GTGCGCCAGG AGCTCCTCCA GCTGGTAGTC AAGCTTGGAC TGAGTATTCA AACACCTCTG TTCAGAGCTC TGATGCTTCT CAAACTGCGC CAGGAGGCTT GTTCGCACGC GCAGCGTGTG TAGGAGGGCC AGCTTGGACC ATGGCTCCAG ATCAGGGGCG GCCCTTTGGC TCAAATGAAA ACGCTGGTTG GAACAGGTGG GATCAAGCTC AAATGAAGCA GAACGGATTA AGTATGGGTG TAGGTGTAGG AATGGCTATG GCTCAGATGG GAACGGTTTA CGTCAACCCT AATTTGAATC CTGCCGTTTA TGGTCAAGGA ATCCAATCTT TTCCGTCTAA TGGTTATTAT GGTGGTCAGT ATAATTTACA ACAAGGCCAG CATATGCAGC AGGAAAACGA GTCAGGTGGT GGGGAGGTGG GAAGCAGAGA TATTGCGATG AACAGCCCCC ACCAATATGC GGCCATTTCA GGGCCGATTT ATGGTCCCGG CCCACCTTCC AGTAATCACT TTAGCTCGCC TGGTAACCCT CATCTACAGC CACATACATC GCCTTCATCA CATCGGGCGC CATCCCCCCG TCCGTCTCAC GATCAAACGC ACGCCCCTAA CATGATGTCC ACCTCGGCTA TCCTTCCTCA GCGCCCTTCC TATGATCAAA ATATCTCGGA TCATCGACTT CCCCATCAAA TTGACTTCTC CAGTGCCGCC TCTGTATCTA ATCCAAACCC ACACCGTCGT GAGATGAACT TTACATTGTC ATCTACGACT TCGTATGTAC ATTCAGCCGA TGGCCGAGCC CTTCTACAGC ACCCACTACT ATCTCCCGCC CAACAAGCTT TACAAGATGG CCCAGGTTTG TATTCCACTA CCGGCTTTGA TATGCTTGGT ATATTGGGAA GAGTAGCAGC CAGAAAGAAT CCCAGCATGG TGATTGGACC TGTCGACCTG AGTTGTAGTT TCGTAGTGGT GGTGAGATAT TTGCGTCATT TGATAATTAC TTGACGCTGA TGTAGTCTCC AGGATATAAG GAGATATGAC TCTCCCATTG TCTACGCTAG TCCGAATTTC ACCAGGTTGA CTGGTTACGA GCTTCCCCAA CTTCTCGGCC GTAACTGCCG ATTCCTCCAA AGTCCCGATG GAGAAGTGAC CAAAGGCTCC AAACGTGAAT ATACGGATAA TGAGGCTGTC TATCTCCTCA AACGATCTCT CGAGGCTGGA AAGGAATGTC AAACCTCTCT TATCAATTAT CGACGGAATG GCGAACCTTT CATCAACCTT GTCACTGTTG TGCCTATACC GTGGGACGGT CCTGACATCG TCTACCATGT GGGCTTCCAA ATTGACTTGG TTGAACAGCC AAACAAAATC CTCCGTAATA TGCAAGATGG GAACTATTCA GTGGACTACA CATATTCCAT CCCACCTGAA AAACCTCTAC AGCTTGATGG AAAAGGTGCT GCCATAGCAG GCCTGAGTAC AGCCGTGCTG GACATTATGG GCAGTAGGAC GAAGGCACTC GCGGCGGGTA CGGAAGAGGG GGCACGAATG GAGTGGTTTA AGATGATTCT GGATAATACG GATGGTAAGT ATTGTGTCAC ATGTTGCCGT CAATGTACAA GGCTAACACA ATTTAGACTT CATACATGCC TTATCACTCA AGGGTTTCTT TCAGTACGCT TCATCTTCCA TTCGACGTTC TTTAGGGTAT GAGCCTGAAG ATCTGTTGAA CAAAAACATC TCCGAATTCG CACATCCGTC TGATATCGTG CCCGTCATCC GTGCCCTAAA AGATTCCACA CAAACGATTG GGGAGGAGCA GCAACCAAAA CCTGTTCATT TCACCTTTCG CATCCGCACG AAGAACTTTG GGTACGTGTG GGTGGAAAGT ATGGGACGGC TTGTCGTGGA AGCTGGCAAA GGTCGGAAGG CTGTCATCCT TTCTGGGAGA GTGAGGAACA TGCCCACTTT GAGCTGGGGC AAGGTAGCAG AGCACGGTGG GCTCGCCAAA ACGGAGTTCT GGGCCAAGAT TTCATTCCAA GGTTTACTCC TAAATCTCAC TTTTGGCGTT GACAAGGTCT TGGGCTACCA AGCTGAAGAG ATTCTGGGTA GAAACTTATT TTCTCTTTTG CCTGGTGGCC AAAATACCCC TCCGTCTGGC GAACATCTCA TTAATAACTA TGCTTCAGCT CCCGATATCG CTCCTGTTGC CAAGGCAATC TATCAAACCA TACATGACAG TACCCACCGG GGCGCGTTGT CTATTCGGCA AAAGATGGTA CATCGTTCTG GCCAGCCTGT CGACGTGATT CTGGTCTTTT ACGCTCCAGG GCAAGCTAAA GACAAGCAAT CGCCCATTTC TTACTCAAAT AATGACGAAT CGGCTTCAGT CAATTTTATT GGCACACCAA ACGGAGCTTC GACCTACCCG GTCAAGATTA CCGATATTTT CGTCCAGGTC AAATTGCTGT CATCCTCCCC TTCGCATCAA CGCTCCGTCC AGCCCCCCTC AATCGCTTCA CTCACTCAAG CCCGCCCCCT CGTCCATCTA CCCAACGACA ATATTTTTGA AGAGCTGGAG ACTGGGAGAG ACTCTTCGTG GCAGTATGAG CTTCATCAGA TGAAGATGAC AAATAGGAGG TTGAGAGATA GCATTGCGGC TCTCAAAGAG AAGAAGGCTG GGAAAGCGAA GAAAAGAAAG TACAACATTG TGGATGAGAG TCCCGAAAAC CAATCAGAAA TCATTGATAC GAGCTTAAGC TCGCAAATTG GGGGTATTGG ATTTTGATAT GTGCATGAGA GGCAAGCCTA CTGCTACTTG CGTGATTTGC TGTATTGTCT TCAGTTTGTT TGTTGTACAG AATCGTCGCA AGTTTTTAGC TTTGCATAGT TCAATGC
|
Protein sequence | MNFQTPPGVQ GNSLGPHRSL SSLENGQPGR VSGHSMATPS VEPSSCPGDQ YQSFAQGIAL SQASSSAINF QAPQPRIVRP SQFSTSQQYA IDSSPRQSQA YTSPNFQFHP YASQQYSHHP SPQISSPHTP ADQLFKGKSP ASSSSHSHSN LTPGSSVQPS HPEPLRRTSG DMRPSDDGDS GLSLDLSKFP RDVRFQVPSF PTNRAPGAPP AGSQAWTEYS NTSVQSSDAS QTAPGGLFAR AACVGGPAWT MAPDQGRPFG SNENAGWNRW DQAQMKQNGL SMGVGVGMAM AQMGTVYVNP NLNPAVYGQG IQSFPSNGYY GGQYNLQQGQ HMQQENESGG GEVGSRDIAM NSPHQYAAIS GPIYGPGPPS SNHFSSPGNP HLQPHTSPSS HRAPSPRPSH DQTHAPNMMS TSAILPQRPS YDQNISDHRL PHQIDFSSAA SVSNPNPHRR EMNFTLSSTT SYVHSADGRA LLQHPLLSPA QQALQDGPGL YSTTGFDMLG ILGRVAARKN PSMVIGPVDL SCSFVVVDIR RYDSPIVYAS PNFTRLTGYE LPQLLGRNCR FLQSPDGEVT KGSKREYTDN EAVYLLKRSL EAGKECQTSL INYRRNGEPF INLVTVVPIP WDGPDIVYHV GFQIDLVEQP NKILRNMQDG NYSVDYTYSI PPEKPLQLDG KGAAIAGLST AVLDIMGSRT KALAAGTEEG ARMEWFKMIL DNTDDFIHAL SLKGFFQYAS SSIRRSLGYE PEDLLNKNIS EFAHPSDIVP VIRALKDSTQ TIGEEQQPKP VHFTFRIRTK NFGYVWVESM GRLVVEAGKG RKAVILSGRV RNMPTLSWGK VAEHGGLAKT EFWAKISFQG LLLNLTFGVD KVLGYQAEEI LGRNLFSLLP GGQNTPPSGE HLINNYASAP DIAPVAKAIY QTIHDSTHRG ALSIRQKMVH RSGQPVDVIL VFYAPGQAKD KQSPISYSNN DESASVNFIG TPNGASTYPV KITDIFVQVK LLSSSPSHQR SVQPPSIASL TQARPLVHLP NDNIFEELET GRDSSWQYEL HQMKMTNRRL RDSIAALKEK KAGKAKKRKY NIVDESPENQ SEIIDTSLSS QIGGIGF
|
| |