Gene CNL04450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04450 
Symbol 
ID3254727 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp233836 
End bp237342 
Gene Length3507 bp 
Protein Length1097 aa 
Translation table 
GC content49% 
IMG OID638253916 
ProductWhite collar 1 protein (WC1), putative 
Protein accessionXP_567995 
Protein GI58261170 
COG category 
COG ID 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTCC AAACTCCTCC TGGAGTTCAA GGCAATTCGT TGGGGCCGCA TCGCTCGCTG 
AGTAGCTTGG AGAATGGCCA ACCAGGACGG GTAAGCGGTC ATAGCATGGC AACCCCTAGC
GTCGAGCCTA GCAGCTGTCC TGGAGATCAG TATCAGTCTT TTGCACAAGG AATTGCTCTC
TCTCAAGCCT CATCGTCAGC AATCAATTTT CAAGCGCCTC AACCACGAAT CGTAAGGCCA
AGTCAGTTCT CTACATCACA GCAGTACGCT ATAGACTCCT CCCCGCGGCA GAGTCAAGCA
TATACTTCAC CAAATTTCCA ATTTCATCCC TATGCTTCTC AACAATATTC GCATCATCCA
TCACCTCAAA TATCCTCTCC CCACACTCCC GCGGACCAGC TGTTCAAAGG CAAATCTCCA
GCCTCCAGTT CCTCACATAG CCACTCAAAT CTGACACCGG GTAGTTCGGT CCAGCCATCG
CATCCAGAAC CGCTTCGTAG AACCAGTGGC GACATGAGGC CGTCAGACGA TGGCGACAGT
GGTTTGTCTC TTGACCTGTC CAAATTTCCG AGGGATGTAC GCTTTCAGGT TCCTTCCTTT
CCGACAAATC GTGCGCCAGG AGCTCCTCCA GCTGGTAGTC AAGCTTGGAC TGAGTATTCA
AACACCTCTG TTCAGAGCTC TGATGCTTCT CAAACTGCGC CAGGAGGCTT GTTCGCACGC
GCAGCGTGTG TAGGAGGGCC AGCTTGGACC ATGGCTCCAG ATCAGGGGCG GCCCTTTGGC
TCAAATGAAA ACGCTGGTTG GAACAGGTGG GATCAAGCTC AAATGAAGCA GAACGGATTA
AGTATGGGTG TAGGTGTAGG AATGGCTATG GCTCAGATGG GAACGGTTTA CGTCAACCCT
AATTTGAATC CTGCCGTTTA TGGTCAAGGA ATCCAATCTT TTCCGTCTAA TGGTTATTAT
GGTGGTCAGT ATAATTTACA ACAAGGCCAG CATATGCAGC AGGAAAACGA GTCAGGTGGT
GGGGAGGTGG GAAGCAGAGA TATTGCGATG AACAGCCCCC ACCAATATGC GGCCATTTCA
GGGCCGATTT ATGGTCCCGG CCCACCTTCC AGTAATCACT TTAGCTCGCC TGGTAACCCT
CATCTACAGC CACATACATC GCCTTCATCA CATCGGGCGC CATCCCCCCG TCCGTCTCAC
GATCAAACGC ACGCCCCTAA CATGATGTCC ACCTCGGCTA TCCTTCCTCA GCGCCCTTCC
TATGATCAAA ATATCTCGGA TCATCGACTT CCCCATCAAA TTGACTTCTC CAGTGCCGCC
TCTGTATCTA ATCCAAACCC ACACCGTCGT GAGATGAACT TTACATTGTC ATCTACGACT
TCGTATGTAC ATTCAGCCGA TGGCCGAGCC CTTCTACAGC ACCCACTACT ATCTCCCGCC
CAACAAGCTT TACAAGATGG CCCAGGTTTG TATTCCACTA CCGGCTTTGA TATGCTTGGT
ATATTGGGAA GAGTAGCAGC CAGAAAGAAT CCCAGCATGG TGATTGGACC TGTCGACCTG
AGTTGTAGTT TCGTAGTGGT GGTGAGATAT TTGCGTCATT TGATAATTAC TTGACGCTGA
TGTAGTCTCC AGGATATAAG GAGATATGAC TCTCCCATTG TCTACGCTAG TCCGAATTTC
ACCAGGTTGA CTGGTTACGA GCTTCCCCAA CTTCTCGGCC GTAACTGCCG ATTCCTCCAA
AGTCCCGATG GAGAAGTGAC CAAAGGCTCC AAACGTGAAT ATACGGATAA TGAGGCTGTC
TATCTCCTCA AACGATCTCT CGAGGCTGGA AAGGAATGTC AAACCTCTCT TATCAATTAT
CGACGGAATG GCGAACCTTT CATCAACCTT GTCACTGTTG TGCCTATACC GTGGGACGGT
CCTGACATCG TCTACCATGT GGGCTTCCAA ATTGACTTGG TTGAACAGCC AAACAAAATC
CTCCGTAATA TGCAAGATGG GAACTATTCA GTGGACTACA CATATTCCAT CCCACCTGAA
AAACCTCTAC AGCTTGATGG AAAAGGTGCT GCCATAGCAG GCCTGAGTAC AGCCGTGCTG
GACATTATGG GCAGTAGGAC GAAGGCACTC GCGGCGGGTA CGGAAGAGGG GGCACGAATG
GAGTGGTTTA AGATGATTCT GGATAATACG GATGGTAAGT ATTGTGTCAC ATGTTGCCGT
CAATGTACAA GGCTAACACA ATTTAGACTT CATACATGCC TTATCACTCA AGGGTTTCTT
TCAGTACGCT TCATCTTCCA TTCGACGTTC TTTAGGGTAT GAGCCTGAAG ATCTGTTGAA
CAAAAACATC TCCGAATTCG CACATCCGTC TGATATCGTG CCCGTCATCC GTGCCCTAAA
AGATTCCACA CAAACGATTG GGGAGGAGCA GCAACCAAAA CCTGTTCATT TCACCTTTCG
CATCCGCACG AAGAACTTTG GGTACGTGTG GGTGGAAAGT ATGGGACGGC TTGTCGTGGA
AGCTGGCAAA GGTCGGAAGG CTGTCATCCT TTCTGGGAGA GTGAGGAACA TGCCCACTTT
GAGCTGGGGC AAGGTAGCAG AGCACGGTGG GCTCGCCAAA ACGGAGTTCT GGGCCAAGAT
TTCATTCCAA GGTTTACTCC TAAATCTCAC TTTTGGCGTT GACAAGGTCT TGGGCTACCA
AGCTGAAGAG ATTCTGGGTA GAAACTTATT TTCTCTTTTG CCTGGTGGCC AAAATACCCC
TCCGTCTGGC GAACATCTCA TTAATAACTA TGCTTCAGCT CCCGATATCG CTCCTGTTGC
CAAGGCAATC TATCAAACCA TACATGACAG TACCCACCGG GGCGCGTTGT CTATTCGGCA
AAAGATGGTA CATCGTTCTG GCCAGCCTGT CGACGTGATT CTGGTCTTTT ACGCTCCAGG
GCAAGCTAAA GACAAGCAAT CGCCCATTTC TTACTCAAAT AATGACGAAT CGGCTTCAGT
CAATTTTATT GGCACACCAA ACGGAGCTTC GACCTACCCG GTCAAGATTA CCGATATTTT
CGTCCAGGTC AAATTGCTGT CATCCTCCCC TTCGCATCAA CGCTCCGTCC AGCCCCCCTC
AATCGCTTCA CTCACTCAAG CCCGCCCCCT CGTCCATCTA CCCAACGACA ATATTTTTGA
AGAGCTGGAG ACTGGGAGAG ACTCTTCGTG GCAGTATGAG CTTCATCAGA TGAAGATGAC
AAATAGGAGG TTGAGAGATA GCATTGCGGC TCTCAAAGAG AAGAAGGCTG GGAAAGCGAA
GAAAAGAAAG TACAACATTG TGGATGAGAG TCCCGAAAAC CAATCAGAAA TCATTGATAC
GAGCTTAAGC TCGCAAATTG GGGGTATTGG ATTTTGATAT GTGCATGAGA GGCAAGCCTA
CTGCTACTTG CGTGATTTGC TGTATTGTCT TCAGTTTGTT TGTTGTACAG AATCGTCGCA
AGTTTTTAGC TTTGCATAGT TCAATGC
 
Protein sequence
MNFQTPPGVQ GNSLGPHRSL SSLENGQPGR VSGHSMATPS VEPSSCPGDQ YQSFAQGIAL 
SQASSSAINF QAPQPRIVRP SQFSTSQQYA IDSSPRQSQA YTSPNFQFHP YASQQYSHHP
SPQISSPHTP ADQLFKGKSP ASSSSHSHSN LTPGSSVQPS HPEPLRRTSG DMRPSDDGDS
GLSLDLSKFP RDVRFQVPSF PTNRAPGAPP AGSQAWTEYS NTSVQSSDAS QTAPGGLFAR
AACVGGPAWT MAPDQGRPFG SNENAGWNRW DQAQMKQNGL SMGVGVGMAM AQMGTVYVNP
NLNPAVYGQG IQSFPSNGYY GGQYNLQQGQ HMQQENESGG GEVGSRDIAM NSPHQYAAIS
GPIYGPGPPS SNHFSSPGNP HLQPHTSPSS HRAPSPRPSH DQTHAPNMMS TSAILPQRPS
YDQNISDHRL PHQIDFSSAA SVSNPNPHRR EMNFTLSSTT SYVHSADGRA LLQHPLLSPA
QQALQDGPGL YSTTGFDMLG ILGRVAARKN PSMVIGPVDL SCSFVVVDIR RYDSPIVYAS
PNFTRLTGYE LPQLLGRNCR FLQSPDGEVT KGSKREYTDN EAVYLLKRSL EAGKECQTSL
INYRRNGEPF INLVTVVPIP WDGPDIVYHV GFQIDLVEQP NKILRNMQDG NYSVDYTYSI
PPEKPLQLDG KGAAIAGLST AVLDIMGSRT KALAAGTEEG ARMEWFKMIL DNTDDFIHAL
SLKGFFQYAS SSIRRSLGYE PEDLLNKNIS EFAHPSDIVP VIRALKDSTQ TIGEEQQPKP
VHFTFRIRTK NFGYVWVESM GRLVVEAGKG RKAVILSGRV RNMPTLSWGK VAEHGGLAKT
EFWAKISFQG LLLNLTFGVD KVLGYQAEEI LGRNLFSLLP GGQNTPPSGE HLINNYASAP
DIAPVAKAIY QTIHDSTHRG ALSIRQKMVH RSGQPVDVIL VFYAPGQAKD KQSPISYSNN
DESASVNFIG TPNGASTYPV KITDIFVQVK LLSSSPSHQR SVQPPSIASL TQARPLVHLP
NDNIFEELET GRDSSWQYEL HQMKMTNRRL RDSIAALKEK KAGKAKKRKY NIVDESPENQ
SEIIDTSLSS QIGGIGF