Gene CNK00590 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNK00590 
Symbol 
ID3254625 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006680 
Strand
Start bp193872 
End bp197499 
Gene Length3628 bp 
Protein Length916 aa 
Translation table 
GC content52% 
IMG OID638253548 
Productfork head homolog XFD-2, putative 
Protein accessionXP_567623 
Protein GI58260426 
COG category[K] Transcription 
COG ID[COG5025] Transcription factor of the Forkhead/HNF3 family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGGTCCTCGA CCATACTATC CTTACTCACT TTTGCCACCC ACAATTACAG CACAGCCATC 
GCCACCGCGC CGCACCGTCC GCCTACAACG ACAGTTACTA CGGCAAGGCC AATCACTACG
CGTCTTGAGC AGCTCTTTCA TCCCGTCCGC TTTATCGGGC TTTCAACCTT TCCTCCCTCC
CTTCCTCTTT GACGGCACCG CGTCTATCAC ACGGACCGGG ACATCCATAT AGACATACAA
AACGCGTCTG CTATACTGGA GCACGACTCT TCTGCCCAGT ATGTTTTCTG CCCGTCATTT
TTTTTGCCGC TCCACCATTC GACGCGTCTC TACTAGAAAG CTGATCAGTC TTTTCCTCTT
TAGATACCAT TTGCTCCAGT TTACAGCCCC GCTGGACGGG AAGAGTAACA CCGCCTATTT
ACAACCTCGA GCAAAATTTT CGCCAAATGT CGCGTCCTAC GAGCTCGGAC TCTCAACATT
CTGGCAGCAT GTCCTCCCAT AATGTCATTC AGAATGACAT GTAAGTCTTC GACGCGTCTT
TCCATCCTCC ACTCACCTTT TAGCTCACCA TACTCTTTAG CACCCATGAA AATAGCTTCA
TGACAACTCC CACGTCCGCA TCTTATCAAA CTCAAGTCTA TTATGCCGAC CCTTACGCGC
ACGCGTCATT CCAACAGTAC CAATCGGGCG CGTATGGTGG AGAGATGCGG GAGCATATGA
TCACGCCGTA CGGTGGACAT CAGGTGGCTC AAGGAAGGAT TGCGATGCCG ATGCACGGCC
AGCCGTTGCA TCGCTCGCCC AATGGCGCGT ACGAGGAGTT TGAGTTCACG CCCTTTGTGC
CAGAAGAGGA GACCACACCG TATCTGCAGG TACCTGAGAG GTATCACGCC ATGGTCCAGT
CGCCCAGCAC GACTATGGGC CAATTCACCC ATCCGCTGTC ACCTGTTGAC CAGATGACAA
TGGACCAACA ACATCATTTT CGGCAAAGTA ACTCGTTGGG CTCTCAACCT CAACAGTTCA
TCGTTCAAAC CGGCCGCCCA CAACGACCCA AGCTCCAGAC TAGCCAATCT ATGATCGTAT
CCACGCGACC CAGTACCCAA ACTTCTATGC AGCGACCAGG GATGACGCGA CACGCATCTC
TTCGGGGCAC GACGCGCCCT GATAGTCCGT ATGTGGCCGG AGACGTTTTT GATCACGTTC
CTGGGCATCT GGAAGAATCG CCAATTTACC AGCCAATCCC TCCTCAGGAT CCATCTTGGG
AGCTTTCTGC GTTTGAACCT AATGTAAGCA AAATTTTGGT TTTAAAACAG ACACACACTA
ATAACTTGCT CTAGTATGCC AACGGCCAGG GTATCTCCCC CGCTCGTGCT CTGGGTCCCG
CGCCTCCGCA GCCACAACGT TTCTCTCCAC GCCACGACGA GTTGATGGTC ACCCCTCAGG
CGAACAAGAT TGCATACTCT GAACACCCTC CGTCATCCGC CATTTCCACT GCTGCGTCTA
CATCCTCCAG CTTTTCAGTC ACTATGAAGC ATCAACGACG CGAATTTGAC AGTAGCGATG
ATGAGCAAGA AGGTCGTACG CGTAAACCAT TGCCTTCGCG CACGGTCAAG ACGCGTCAAG
AAGAAAAGCT TCCCCCGCCG CCTCTCCCTC TCCCGACGCG TCCTCCAAAG GCGTCCAGTG
CACGGGCGCC CGTCAAGCCT GCAAAGGTTG CCGAAGACCC CGGCGTCGAA GGTGTACCTC
CAGGGCCAAG ATCACACGAA CGGCCTGGTC CATCTTTTGC TTGCATCATC GGTCAAGCCA
TTTTATCATG CAAGGCTGGT GGCCTTTCGC TCGAGCACAT TTACCGATAT GTAGAGACAG
CTTATCCCTA CTTCAAATCT GGCGATAACG CGTGGCGCAA CTCGGTCCGA CATAACCTAT
CTATCCACAA GATGTTTGAG ACCATTCCGC GAACTGAAAA ATTCCCTCCT GGTAAGGGTG
GTATTTGGAT TATTCACGAG GATGAAAAGT GCCACTGGCC AGCGCAGGAC AAATTTATCA
AAAATTTTCC TCCTGGTCAC CCTCATCACG ACGTGTGTCG CCAGACGTTG CATGAGAGGC
AAAAGGAAAA GGACGCTATG GAGAAGGCCG CGAGGGAAGG GAGGGTGTAT GTACCCAAGA
AGGGAAAGAA GAGAAGAAAG CAAATGTTGA AAGAAGAATT GGAGGCTGAA GTGGCTAAGC
GATTAGTCAT GGGAAATTCA GGAGCTGTGA CCAGTGAGCA AAAGGTTGAG GAAGAAAAGG
AACAGACACC CGAGCCTGTC GAACAAGAGA CTATTAGCGA AGAGCCTGTT GTTACCGAGC
CTGAAGCTCG ACCAGCTCCC GAGCCTGAGC CTACTGCCGA AACCGAAGGT GCCTCCAAAC
CCCAAACCAA AAAGAACACA CCCATGCCAC CTCCCGCTGC CAAGCTTGGA AAGTGGGCCT
TGCCTCCGCC TCCTGTAGAT CCCAAAGGAA AAAGGAAACA AGCAGAGTCT GAAGATGATC
CCCTCTTCTC CACCTCCAAG CGCGTCCGTA TGGCTGAGCC TCTTGCCCCT ATCCATCCAT
TCCCCCAAGA AACTGTACCG GGCAAGGCTG AAAAGTATGA CGCCTCATTC GTCACTCCTG
AAAGGGAAAG GCCCATCCCT AATGGTAGCA AACTCTTATC AAGCGCGAGT GATTTCAAGA
CACCCGCCCT TGTCCAATCA TCTTCATCAC CAGGATCTCC ACCTATGCCT GCCACCGTTA
CCCGTCCAAC CCACCACCCC AGCAGTCTGC AACAGGCATG GACACATGAC GACATGTCAC
AAACCCCTCC TCGCGATTCA TCACCTGCTA GACCGATGCT TGATGCGGCA TTTGATTTGA
AGCCCAAGTC TTTACGTACA AAGCAAGTCG CGCAGGAGGA TGAATTCCCA CATTCTCACA
CTCACCTCGC TTCCCCTCCA CATCCTCGTG GGCCTCCCAA AACACCTGTT ACCCGCTCTT
CGGCGGCAGC AGACAAAACT CAGACTCCCC GCTTACATCA TCGCAAAACA CCAAGCATGT
CGACCGTCAC CCCTGTCGTC TTCCGAGATA GCCCTGGTCT TCCGCCACCA ACGTCGAGTG
CTTTACTCTC TACACCCATG TGGGAGATTG GTGGCTGCTT GGATAGGCTG AAGGACCATT
TTGCGCCTTC ACCCACATCC AGTATTCACC CGATCCGGTC ACCTGCCCCG CCAACGAGTC
CTACGAGGTA TGCGATGATG TTGATGGACA CTGGCAGTTC TCCGAGAAAA GGAAAGAGTG
CAAGCTAAGT GATGAACGGC CGCTCGTCTG TTTGGCGAGT AGAGAGACAT GTGATTGAAG
AATGGGCGAT GGACAAGGAT TGTCATGATT TCTTTTTTTT TTCTCGTTTT TGACGTTTAT
ATATCATGAC CCCATAATAT ACATGGCTAG ATACCAGTTT TGCGTTTCTT TTCCATAGTG
ACTAATTTGT AGCTTAGCGT GCGTACTCAG CGTTACATAC GAGTACATCC AATCTTTTCA
TCTCTTGAGC AATTCACTCC ATTACCACCC ATTTCGCCCT CAAGATTATA GTTTAGCAAT
TCGATTATGT TGTTGATTTT TTGTCGAT
 
Protein sequence
MSRPTSSDSQ HSGSMSSHNV IQNDITHENS FMTTPTSASY QTQVYYADPY AHASFQQYQS 
GAYGGEMREH MITPYGGHQV AQGRIAMPMH GQPLHRSPNG AYEEFEFTPF VPEEETTPYL
QVPERYHAMV QSPSTTMGQF THPLSPVDQM TMDQQHHFRQ SNSLGSQPQQ FIVQTGRPQR
PKLQTSQSMI VSTRPSTQTS MQRPGMTRHA SLRGTTRPDS PYVAGDVFDH VPGHLEESPI
YQPIPPQDPS WELSAFEPNY ANGQGISPAR ALGPAPPQPQ RFSPRHDELM VTPQANKIAY
SEHPPSSAIS TAASTSSSFS VTMKHQRREF DSSDDEQEGR TRKPLPSRTV KTRQEEKLPP
PPLPLPTRPP KASSARAPVK PAKVAEDPGV EGVPPGPRSH ERPGPSFACI IGQAILSCKA
GGLSLEHIYR YVETAYPYFK SGDNAWRNSV RHNLSIHKMF ETIPRTEKFP PGKGGIWIIH
EDEKCHWPAQ DKFIKNFPPG HPHHDVCRQT LHERQKEKDA MEKAAREGRV YVPKKGKKRR
KQMLKEELEA EVAKRLVMGN SGAVTSEQKV EEEKEQTPEP VEQETISEEP VVTEPEARPA
PEPEPTAETE GASKPQTKKN TPMPPPAAKL GKWALPPPPV DPKGKRKQAE SEDDPLFSTS
KRVRMAEPLA PIHPFPQETV PGKAEKYDAS FVTPERERPI PNGSKLLSSA SDFKTPALVQ
SSSSPGSPPM PATVTRPTHH PSSLQQAWTH DDMSQTPPRD SSPARPMLDA AFDLKPKSLR
TKQVAQEDEF PHSHTHLASP PHPRGPPKTP VTRSSAAADK TQTPRLHHRK TPSMSTVTPV
VFRDSPGLPP PTSSALLSTP MWEIGGCLDR LKDHFAPSPT SSIHPIRSPA PPTSPTRYAM
MLMDTGSSPR KGKSAS