Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00090 |
Symbol | |
ID | 3254545 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 27477 |
End bp | 30256 |
Gene Length | 2780 bp |
Protein Length | 779 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253503 |
Product | conserved hypothetical protein |
Protein accession | XP_567756 |
Protein GI | 58260692 |
COG category | [K] Transcription |
COG ID | [COG5576] Homeodomain-containing transcription factor |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CGCCGCTTGC AACCATCTGT CATCCGAGTG CAAGTAGTCT TCGCAAAATG GTCGACCTAT CTGCGCTTTC TATCCCCCAA TTCTCCTCCT TAAGCCCTTC GGACGGCTTC TCACCTTTGA AGCATGCTTT TGGTCCTTCC CGACCACTCA GCATCGCACC ATCTGATTCT TCATCAGAGC CTTTAACGGT ATCTCCGGCA GGGACCAAAA CTTCCTGGGC GCTTATCTCT TACTCATCTT GTTGTCCCGA TAACTCGCAA GATATCATCG ACCGTTCTCC CTTTCCCACT TCTACAAATT TTACCAGTTC CTTCTCTATG TCTCAATCGC AGCAAAATAA TGATACATAC TCGTACCGTA TGACACCTAA TACTGGCACC AGCCAGTGGT CTCAATCACC AACCGCAGAC AACTCATTCA TCTCCTCACC TTTGACAAAG AGCATAGCCT TTCAAAACGG TCTGACAGGC ATTTCTCCCA TGTTTGCGAC TTTCAATACC ACTTCTATCC CCAATGCAAG TTACCCTACA GCCGCCCATC CTATTAATCC TTCTTGCCCC ATGCCAGTTC GAGGCTACTC TGCAAGCTCC GCCGTGCATT CGCCTAACCG GCGTCGGTCC TCCACACTCG TCACCCTTTC TTCGCCATCA CCAATGACGC CTCCATTCAA CCACTACCCC TACCCTCCTC TACACATCTC TTACCCAAGC ACGCCATCCT CCGTTTCAGC GAGAAATCTC TTCCGGCAAC CATCAATACC GGCGTTTTCG GAGAACAAGA GGATATTTCA TCCTTCCCCA GCGGGGAAAA CGCCTGCTAC TGGAGCAAAA CTGACCCAAT TGCCTTACGA GCATGGATAT GAAGGAGGGT CTCAATTTGG ATCTGGAAAT ATGCTTGGAG GAATGGGCAT GAATATGGAA ACTGGTAATC CTTTAAGTCT CAATATGTTC CCACAGATGA ATACAGGCTT TCGTCCAGGG AATGGACAAG AATTCAAGTT GCCGAGGTTT AAACCTACAA AGGAGCAGTT GGAAATCCTC ATCAAATCGT ATGAAGAGAA CAAGTGAGTT AAGCTCTCGA TTTTGTGAAC GAAACTGGGT TTCTAATAAG GTGGTAGAAC TCCAGACGGT CCAACTCGGG AAGCACTTGC AAAAAAGCTT GGCCCGGACG TGCGTCCCAA AACTCTGCAG ATTTGGTTCC AAAATCGGTA AGCACATTCA TCACCTTGTA TTCTGAATCC GAAACTTACA TCTTTCAATA GTCGTTCAAA ATCTCGCGCG AAGGAGCGCG ATGCGGCCAA TATTCCAAAA CCTTTACAAA CCAACAGTCC GACCATCAAG CCGTCTGCGC AGGGACACGA GAAGGGTAAA AGTGGACCAA CAAGATCAAC TAGCGGGGGG CCGGGTGAAA TGAAACAAGG AGGGGTTAAT ATAGAGCGTT TAAACAGCCT TATCCATGAT GATGATCGTA GGTTTTATTA TAATGCGATT GCCGGCAGCT GATCAAGATG ATGTTCTCAG CGAGCTTGTC CATCCTCCCC ATCACTGTGC TTTCGATTGC CAAGTGGACT AGGTTCCTCA CCCCTGGAAC AGGCAATATA TGCCCCGACC TAGCTGCTTC TATCCGTTTC CGTTCAGCCT CCACTCCTTC CCATCCCTCC CCTCTCTTAC CCACGCTGCA CCTCTACGTC CTCCACACTA CCATCTTTCG TATCGACATT CCCCTGTCCG CCTCTGTCAT CTCCAATCTC CAAGCGACGA ATAACCCATC AGTCATTACA GATGCAGTGG CTGTGCGATT CGAACTTGGA CGAGGTCAAG TAAGATTTGC GTGTTGGATA GATGAAGAAG GCGCGGGATG GAAGGAAGTC GGTGATTTTA CGGGAGGCGA AGCTGGAGCC GGGGGTAGAG TCGAGTTGAC CGGACCGGCC TCTGTGAGCA TCACATCTGC TGCTTTTGAA ATAAATAAGA GCTTACTCCG TCCCACGACA ACAGATATTA CTACCAGCCT TTTCTACAGT CCAACAACTC CTCACGAATA CCGCCTATCC AACTCCTATT TCCAGTTTCC TCATCTCCGG GCCCATCTCT ATCCACCCAA AGCCTATCAA TGCCTTGCAT CTCCCCACCA ATGCCAACAC TCAGTCTCAC TCCAATCCAA CCTCAACCCC ACCTTCCTTT TCCTCACCTC CCAACCTTAC CGCCACGCCG CCCCTTCCAC TTTCAAGTGT CAGCGTCGAT ACAGACATGA TGTCCCGCTC CCATCAAGGG CAACAAGCTG CTACACTAGC TCCAGCAGGA CATCAGAGAC AGCGGTCCTT CTCACAACCG ATCTTTCCGA CATCTGCATT GATCAGCGGT ATGATGCCCA TTGACACGAG CTTATCGAGT ACTGCACCAA TTGATAAAAC TACCAGGGAC TCTCTCACGA TGCGTCCGTC GGTCAATACA ACATTGCAGG CTAGTGCACA GAGCAGCTTT GAACCCTCAC CCGTTTCGTT CATCGGGAAT AGCGGCCATA GTCAAACCCC TGTAACAGGC TCGTTCAATG GGAACGCAAC AGTTCTAGGG ACCGACCAGC TCTGGGAGTC TCCAGGCGGT CTTTCGGACA GCACAAGCCT GACATTAAAC ATCAACGAAT TCGAGTTGGT TGAAAAAAAA TAACATGAAG ATGGGTACCC CGGCTTCTAT GGCTTCTTCC GCAGAAGGGC AAAGTACCAT TAAGCAGAAA GATTCGTAGG CACCTGATGA CATTGACTTC AACCGAACAC TGCTTAGAGG
|
Protein sequence | MVDLSALSIP QFSSLSPSDG FSPLKHAFGP SRPLSIAPSD SSSEPLTVSP AGTKTSWALI SYSSCCPDNS QDIIDRSPFP TSTNFTSSFS MSQSQQNNDT YSYRMTPNTG TSQWSQSPTA DNSFISSPLT KSIAFQNGLT GISPMFATFN TTSIPNASYP TAAHPINPSC PMPVRGYSAS SAVHSPNRRR SSTLVTLSSP SPMTPPFNHY PYPPLHISYP STPSSVSARN LFRQPSIPAF SENKRIFHPS PAGKTPATGA KLTQLPYEHG YEGGSQFGSG NMLGGMGMNM ETGNPLSLNM FPQMNTGFRP GNGQEFKLPR FKPTKEQLEI LIKSYEENKT PDGPTREALA KKLGPDVRPK TLQIWFQNRR SKSRAKERDA ANIPKPLQTN SPTIKPSAQG HEKGKSGPTR STSGGPGEMK QGGVNIERLN SLIHDDDPSL SILPITVLSI AKWTRFLTPG TGNICPDLAA SIRFRSASTP SHPSPLLPTL HLYVLHTTIF RIDIPLSASV ISNLQATNNP SVITDAVAVR FELGRGQVRF ACWIDEEGAG WKEVGDFTGG EAGAGGRVEL TGPASVSITS AAFEINKSLL RPTTTDITTS LFYSPTTPHE YRLSNSYFQF PHLRAHLYPP KAYQCLASPH QCQHSVSLQS NLNPTFLFLT SQPYRHAAPS TFKCQRRYRH DVPLPSRATS CYTSSSRTSE TAVLLTTDLS DICIDQRSFE PSPVSFIGNS GHSQTPVTGS FNGNATVLGT DQLWESPGGL SDSTSLTLNI NEFELVEKK
|
| |