Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00590 |
Symbol | |
ID | 3254625 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 193872 |
End bp | 197499 |
Gene Length | 3628 bp |
Protein Length | 916 aa |
Translation table | |
GC content | 52% |
IMG OID | 638253548 |
Product | fork head homolog XFD-2, putative |
Protein accession | XP_567623 |
Protein GI | 58260426 |
COG category | [K] Transcription |
COG ID | [COG5025] Transcription factor of the Forkhead/HNF3 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGGTCCTCGA CCATACTATC CTTACTCACT TTTGCCACCC ACAATTACAG CACAGCCATC GCCACCGCGC CGCACCGTCC GCCTACAACG ACAGTTACTA CGGCAAGGCC AATCACTACG CGTCTTGAGC AGCTCTTTCA TCCCGTCCGC TTTATCGGGC TTTCAACCTT TCCTCCCTCC CTTCCTCTTT GACGGCACCG CGTCTATCAC ACGGACCGGG ACATCCATAT AGACATACAA AACGCGTCTG CTATACTGGA GCACGACTCT TCTGCCCAGT ATGTTTTCTG CCCGTCATTT TTTTTGCCGC TCCACCATTC GACGCGTCTC TACTAGAAAG CTGATCAGTC TTTTCCTCTT TAGATACCAT TTGCTCCAGT TTACAGCCCC GCTGGACGGG AAGAGTAACA CCGCCTATTT ACAACCTCGA GCAAAATTTT CGCCAAATGT CGCGTCCTAC GAGCTCGGAC TCTCAACATT CTGGCAGCAT GTCCTCCCAT AATGTCATTC AGAATGACAT GTAAGTCTTC GACGCGTCTT TCCATCCTCC ACTCACCTTT TAGCTCACCA TACTCTTTAG CACCCATGAA AATAGCTTCA TGACAACTCC CACGTCCGCA TCTTATCAAA CTCAAGTCTA TTATGCCGAC CCTTACGCGC ACGCGTCATT CCAACAGTAC CAATCGGGCG CGTATGGTGG AGAGATGCGG GAGCATATGA TCACGCCGTA CGGTGGACAT CAGGTGGCTC AAGGAAGGAT TGCGATGCCG ATGCACGGCC AGCCGTTGCA TCGCTCGCCC AATGGCGCGT ACGAGGAGTT TGAGTTCACG CCCTTTGTGC CAGAAGAGGA GACCACACCG TATCTGCAGG TACCTGAGAG GTATCACGCC ATGGTCCAGT CGCCCAGCAC GACTATGGGC CAATTCACCC ATCCGCTGTC ACCTGTTGAC CAGATGACAA TGGACCAACA ACATCATTTT CGGCAAAGTA ACTCGTTGGG CTCTCAACCT CAACAGTTCA TCGTTCAAAC CGGCCGCCCA CAACGACCCA AGCTCCAGAC TAGCCAATCT ATGATCGTAT CCACGCGACC CAGTACCCAA ACTTCTATGC AGCGACCAGG GATGACGCGA CACGCATCTC TTCGGGGCAC GACGCGCCCT GATAGTCCGT ATGTGGCCGG AGACGTTTTT GATCACGTTC CTGGGCATCT GGAAGAATCG CCAATTTACC AGCCAATCCC TCCTCAGGAT CCATCTTGGG AGCTTTCTGC GTTTGAACCT AATGTAAGCA AAATTTTGGT TTTAAAACAG ACACACACTA ATAACTTGCT CTAGTATGCC AACGGCCAGG GTATCTCCCC CGCTCGTGCT CTGGGTCCCG CGCCTCCGCA GCCACAACGT TTCTCTCCAC GCCACGACGA GTTGATGGTC ACCCCTCAGG CGAACAAGAT TGCATACTCT GAACACCCTC CGTCATCCGC CATTTCCACT GCTGCGTCTA CATCCTCCAG CTTTTCAGTC ACTATGAAGC ATCAACGACG CGAATTTGAC AGTAGCGATG ATGAGCAAGA AGGTCGTACG CGTAAACCAT TGCCTTCGCG CACGGTCAAG ACGCGTCAAG AAGAAAAGCT TCCCCCGCCG CCTCTCCCTC TCCCGACGCG TCCTCCAAAG GCGTCCAGTG CACGGGCGCC CGTCAAGCCT GCAAAGGTTG CCGAAGACCC CGGCGTCGAA GGTGTACCTC CAGGGCCAAG ATCACACGAA CGGCCTGGTC CATCTTTTGC TTGCATCATC GGTCAAGCCA TTTTATCATG CAAGGCTGGT GGCCTTTCGC TCGAGCACAT TTACCGATAT GTAGAGACAG CTTATCCCTA CTTCAAATCT GGCGATAACG CGTGGCGCAA CTCGGTCCGA CATAACCTAT CTATCCACAA GATGTTTGAG ACCATTCCGC GAACTGAAAA ATTCCCTCCT GGTAAGGGTG GTATTTGGAT TATTCACGAG GATGAAAAGT GCCACTGGCC AGCGCAGGAC AAATTTATCA AAAATTTTCC TCCTGGTCAC CCTCATCACG ACGTGTGTCG CCAGACGTTG CATGAGAGGC AAAAGGAAAA GGACGCTATG GAGAAGGCCG CGAGGGAAGG GAGGGTGTAT GTACCCAAGA AGGGAAAGAA GAGAAGAAAG CAAATGTTGA AAGAAGAATT GGAGGCTGAA GTGGCTAAGC GATTAGTCAT GGGAAATTCA GGAGCTGTGA CCAGTGAGCA AAAGGTTGAG GAAGAAAAGG AACAGACACC CGAGCCTGTC GAACAAGAGA CTATTAGCGA AGAGCCTGTT GTTACCGAGC CTGAAGCTCG ACCAGCTCCC GAGCCTGAGC CTACTGCCGA AACCGAAGGT GCCTCCAAAC CCCAAACCAA AAAGAACACA CCCATGCCAC CTCCCGCTGC CAAGCTTGGA AAGTGGGCCT TGCCTCCGCC TCCTGTAGAT CCCAAAGGAA AAAGGAAACA AGCAGAGTCT GAAGATGATC CCCTCTTCTC CACCTCCAAG CGCGTCCGTA TGGCTGAGCC TCTTGCCCCT ATCCATCCAT TCCCCCAAGA AACTGTACCG GGCAAGGCTG AAAAGTATGA CGCCTCATTC GTCACTCCTG AAAGGGAAAG GCCCATCCCT AATGGTAGCA AACTCTTATC AAGCGCGAGT GATTTCAAGA CACCCGCCCT TGTCCAATCA TCTTCATCAC CAGGATCTCC ACCTATGCCT GCCACCGTTA CCCGTCCAAC CCACCACCCC AGCAGTCTGC AACAGGCATG GACACATGAC GACATGTCAC AAACCCCTCC TCGCGATTCA TCACCTGCTA GACCGATGCT TGATGCGGCA TTTGATTTGA AGCCCAAGTC TTTACGTACA AAGCAAGTCG CGCAGGAGGA TGAATTCCCA CATTCTCACA CTCACCTCGC TTCCCCTCCA CATCCTCGTG GGCCTCCCAA AACACCTGTT ACCCGCTCTT CGGCGGCAGC AGACAAAACT CAGACTCCCC GCTTACATCA TCGCAAAACA CCAAGCATGT CGACCGTCAC CCCTGTCGTC TTCCGAGATA GCCCTGGTCT TCCGCCACCA ACGTCGAGTG CTTTACTCTC TACACCCATG TGGGAGATTG GTGGCTGCTT GGATAGGCTG AAGGACCATT TTGCGCCTTC ACCCACATCC AGTATTCACC CGATCCGGTC ACCTGCCCCG CCAACGAGTC CTACGAGGTA TGCGATGATG TTGATGGACA CTGGCAGTTC TCCGAGAAAA GGAAAGAGTG CAAGCTAAGT GATGAACGGC CGCTCGTCTG TTTGGCGAGT AGAGAGACAT GTGATTGAAG AATGGGCGAT GGACAAGGAT TGTCATGATT TCTTTTTTTT TTCTCGTTTT TGACGTTTAT ATATCATGAC CCCATAATAT ACATGGCTAG ATACCAGTTT TGCGTTTCTT TTCCATAGTG ACTAATTTGT AGCTTAGCGT GCGTACTCAG CGTTACATAC GAGTACATCC AATCTTTTCA TCTCTTGAGC AATTCACTCC ATTACCACCC ATTTCGCCCT CAAGATTATA GTTTAGCAAT TCGATTATGT TGTTGATTTT TTGTCGAT
|
Protein sequence | MSRPTSSDSQ HSGSMSSHNV IQNDITHENS FMTTPTSASY QTQVYYADPY AHASFQQYQS GAYGGEMREH MITPYGGHQV AQGRIAMPMH GQPLHRSPNG AYEEFEFTPF VPEEETTPYL QVPERYHAMV QSPSTTMGQF THPLSPVDQM TMDQQHHFRQ SNSLGSQPQQ FIVQTGRPQR PKLQTSQSMI VSTRPSTQTS MQRPGMTRHA SLRGTTRPDS PYVAGDVFDH VPGHLEESPI YQPIPPQDPS WELSAFEPNY ANGQGISPAR ALGPAPPQPQ RFSPRHDELM VTPQANKIAY SEHPPSSAIS TAASTSSSFS VTMKHQRREF DSSDDEQEGR TRKPLPSRTV KTRQEEKLPP PPLPLPTRPP KASSARAPVK PAKVAEDPGV EGVPPGPRSH ERPGPSFACI IGQAILSCKA GGLSLEHIYR YVETAYPYFK SGDNAWRNSV RHNLSIHKMF ETIPRTEKFP PGKGGIWIIH EDEKCHWPAQ DKFIKNFPPG HPHHDVCRQT LHERQKEKDA MEKAAREGRV YVPKKGKKRR KQMLKEELEA EVAKRLVMGN SGAVTSEQKV EEEKEQTPEP VEQETISEEP VVTEPEARPA PEPEPTAETE GASKPQTKKN TPMPPPAAKL GKWALPPPPV DPKGKRKQAE SEDDPLFSTS KRVRMAEPLA PIHPFPQETV PGKAEKYDAS FVTPERERPI PNGSKLLSSA SDFKTPALVQ SSSSPGSPPM PATVTRPTHH PSSLQQAWTH DDMSQTPPRD SSPARPMLDA AFDLKPKSLR TKQVAQEDEF PHSHTHLASP PHPRGPPKTP VTRSSAAADK TQTPRLHHRK TPSMSTVTPV VFRDSPGLPP PTSSALLSTP MWEIGGCLDR LKDHFAPSPT SSIHPIRSPA PPTSPTRYAM MLMDTGSSPR KGKSAS
|
| |