Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC05730 |
Symbol | |
ID | 3256754 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1700036 |
End bp | 1702972 |
Gene Length | 2937 bp |
Protein Length | 860 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255794 |
Product | conserved hypothetical protein |
Protein accession | XP_569790 |
Protein GI | 58265268 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.243763 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTTCAG GTATCAAACG TCATTATCAC GACGACGCCT TCTCGGCAAG CGGAAGGAGC GATAAATCCC AAGGAAGTGA TAGGAAGAAG CCAAGGGACT GGAGAGAGGC TTTCCTGGAT GATTCACCTA GGGGAGAGCA TGGTCGTGAG AGGGAACGGG ACCGAGAGAG GCCGAGAGGT TCTGAGCGAA ACTACGATAG AGAGCGACAG TATGGGGACA GACGGGGATC TTACGATGAC AGGCGCGGTG AAAGATACAA TGAGAAAAGG CAGGGAAATG GCGAGCATCG CTCTGGTCAA AGGGACCGAA AGGATTATCG TGAACGTGAG CCCGTGAAGG GAGATTATTC TCGACGATAT GGTGACCGAA ATGATCGTCA TAGGGACGAT CGCAATGATT ACAGGCGAGA TGTAAGGTAC AAAGAGAGAG AGAGAGCCGA GTGGGTTTTT ATCCCTTTCC TGAGGGATAT TGAGATGCTC ATCTATACAT AGGTCCCGTA GGTCGCCTCC AACAACTCAA TCTGGCCAAC TAGGATCTTC GCGGCCGCTC AGCCCCCCGC CATCTAGACC TCCCCAAAGG ACATCATCGA GGCCATTAAA CTCACCGCAA GGGATCCCAA TGGGTAAATT GGCAGCTCCA TCGCCTGGCT TTAAGCCTTC TCCAAGGAGC CCGGTTCGAG TCGAAAGTCA TCAGAAGTCG TCAATGAGTC GCTTCTTTGA CGCTGAACCT GAAAAATCTG AAACGGGGCC TGCCCCATCT AAAGCAATCG ACCTTCCCGT TGATCCAGAG CCTATTGTTG TTGAAGAGGA GCAAGATCCC GCTAAGCTGC TTGAAGAACG TAAGAGGAAG CGAGAAGAGA TCATGGCGAA ATTCAAGACT ACTGGTGGTA AGACCTCTGT CCCTGTCAGT CCCAAGATTG CTATAAACGA TGTAACTGGT GGACCAGGGA TGGAGAGTGT CACGAGCGGA GGTACTCGGA CAGGCTGGCA AACTGGGATA CAAAGCGGCA GAACGACTGC TACCGGTGAG TGATATCTGT CGGATCATTT TGCCAAACCT GTTGACTTTC TAGATCAAGG TGCTACGCCA CTCCTGAAAC AGCTCGGTAC AAGCTCTTCC AACCCAACTC CGATCCCTAC CCACGAGCCT TCTCTTGCTT CTACCCCTCT GGGCCGAGAT TTCGACCTCT CCAAGCAAGC ATCTACTACT AATGATGTCA CCATTCCTAT AGAGACTAAG GTTGAGGGTG TGGGTGCGGA CATGACGGTT TCAGCTGCTG ACTATGATCC CACGCGCGAA GGTTTAGTTG ATCATTCCAA GCGTCAGAAG GATTTGGGGA TAATCAATTC CCAAGCTAAG GAGATTGCTG AGGGCGTTGG TGAACCCGTT CTCGTGGAGG ATGATGAGCC AAAGGAGGAC GAGTATGAAG AAGTGGAGAT CGAGGTAGAA GATGACGAAG ACGACGAGTT TGATATGTTT GCAGCCTTTG GGGGTGAAGA AAAAGAAAAG AAGATGAAGA AGGTTGTCGT ACGACGATTG AAGAATGGCG GGGAAGGGGC TAAACAGGAT ATCATTAAGA AACCCGCTAG CACAATTGCA CCGGAAGTCG TGGACAATGT TGATGACTCG GATGGGTACT ACAGGATTAC TCCAGGAGAG ATTTTGGATA ATGGGAGGTA TCAGGTGACG ATCACCCTCG GCAAAGGAAT GTTTTCGGCG GTAGTGAAGG CCAAAGTTCT GAAGGCATTT GGACAGGAGA GGAGGCAAGA TGTGGTGGGG AAGGAGGTGG CTATCAAAGT CGTTAGGAGT CAAGAGAGCA TGTGAGTATC TCGCGTGTGT CTATTCTGCC ACGAAGTTAA TGAATAAATA GGTACATATC TGGACGGAAA GAATCCCAAA TCCTCCAAAA ACTCAATGAC GCCGACCCAG AAGATAAGAA GCATATCCTA CGTCTCGAAC GCACATTTGA GCATCGAGGG CATCTCTGTA TCGTCACCGA GAGTCTCAGG TAGGGCCATT TATCTATGTC AGATACCGGC AAAGTGCTAA CTCTACGTTA GCATGAATTT GCGGGACGTA ATCAAACGCT TTGGTAAAGA CGTCGGTCTC AACATGCGTG CGGTTCGAGC ATACGCACAT CAATTGTTCC TTGCTTTGTC GCTTATGCGT AAATGTGGGA TTGTACATGC TGATGTGAAG CCTGATAACA TTTTGGTGAG TACTGATGTC AGTTGGAGCT CAGCGTGATT CTAATGGAAA TTTTAGGTCA CGGAAAACAA GACGACATTG AAGGTCTGCG ATTTGGGTTC TGCGGCTGAG ATTACAGAAG GAGAAATCAC CCCTTACCTC GTCTCCCGAT TCTACCGTGC TCCTGAAATT AGTGTGTACT GTTTGTTTAT TCAAGTCGCT TCAAATCTAA TATCAAACAG TTCTTGGTCT TCCCTATGAT ACAGCGATTG ACATGTGGTC CATCGGCTGT ACCCTATACG AGCTTTACAC CGGCAAGATC CTCTTTCCCG GCAGGTCCAA CAACCATATG TTGTTACTCA TGATGGAGCT TAAGGGTAAG ATCAACCATC GAATGATTAA AAAAGCAGCT TTCGGTACAA TGCACTTTGA TGAGTCGTTG AACTTTATCA GTATAGAGAA GGACAAGATA ACAGGGCAGG CGAGTTTAAA TCTTTCTGCG GGCGATATGA AAAGGATACT TACGCAGGCC TCAGGATGTC GCCAAGACAA TGGTCATCAA CTCGGCTTCA AAAGATCTTC GGTCCCGTCT CGTTCCCCCG TCGTCTGTTC AGCTGAAGAT GAAAGACGAT GAGTTGAAGC AGCTTCTGAG TCTGGTAGAT CTGTTGGACA AATGTTTGCA GTTGGACCCT GCTAAAAGAT TGACTCCGAG GGATGCCCTG TTGCATCCTT TTGTTGCGGG TCCTTAA
|
Protein sequence | MSSGIKRHYH DDAFSASGRS DKSQGSDRKK PRDWREAFLD DSPRGEHGRE RERDRERPRG SERNYDRERQ YGDRRGSYDD RRGERYNEKR QGNGEHRSGQ RDRKDYRERE PVKGDYSRRY GDRNDRHRDD RNDYRRDVRY KERERAESRR SPPTTQSGQL GSSRPLSPPP SRPPQRTSSR PLNSPQGIPM GKLAAPSPGF KPSPRSPVRV ESHQKSSMSR FFDAEPEKSE TGPAPSKAID LPVDPEPIVV EEEQDPAKLL EERKRKREEI MAKFKTTGGK TSVPVSPKIA INDVTGGPGM ESVTSGGTRT GWQTGIQSGR TTATDQGATP LLKQLGTSSS NPTPIPTHEP SLASTPLGRD FDLSKQASTT NDVTIPIETK VEGVGADMTV SAADYDPTRE GLVDHSKRQK DLGIINSQAK EIAEGVGEPV LVEDDEPKED EYEEVEIEVE DDEDDEFDMF AAFGGEEKEK KMKKVVVRRL KNGGEGAKQD IIKKPASTIA PEVVDNVDDS DGYYRITPGE ILDNGRYQVT ITLGKGMFSA VVKAKVLKAF GQERRQDVVG KEVAIKVVRS QESMYISGRK ESQILQKLND ADPEDKKHIL RLERTFEHRG HLCIVTESLS MNLRDVIKRF GKDVGLNMRA VRAYAHQLFL ALSLMRKCGI VHADVKPDNI LVTENKTTLK VCDLGSAAEI TEGEITPYLV SRFYRAPEII LGLPYDTAID MWSIGCTLYE LYTGKILFPG RSNNHMLLLM MELKGKINHR MIKKAAFGTM HFDESLNFIS IEKDKITGPQ DVAKTMVINS ASKDLRSRLV PPSSVQLKMK DDELKQLLSL VDLLDKCLQL DPAKRLTPRD ALLHPFVAGP
|
| |