Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC01040 |
Symbol | |
ID | 3256194 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 297399 |
End bp | 300625 |
Gene Length | 3227 bp |
Protein Length | 985 aa |
Translation table | |
GC content | 53% |
IMG OID | 638255323 |
Product | chromatin modification-related protein, putative |
Protein accession | XP_569964 |
Protein GI | 58265616 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.541862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GAAATGAGGC ATGTCCGGGC GATGGAGCGT ATTGAGGCCA AAAAGGCGGA GAACAGATGG TCCCTTCGAC AGCCAAAAAA GGCAAGGGGT CCTGGCGTAC CAAAGTCTCA TTGGGACTAT ATGCTTGAGG AAATGGAATG GATGCGCACG GATTTTGCTG AAGAGCGTCG GTGGAAGGTC GTCGAGGCGA GGGAGTTCGC ATATCAAGTC GTAGAATGGC ATCTGGCGAG TCCTGAGGAA AAGAAGGCTC TTATGGTGGG AGGTCGAGGC TGGGGAGAGT GCCGCAATGT TCCGATACCA GGTCATGCGG GAAAGAGGAA GGAAGTCACT GTGGAAGTGG AAGCTGAGGA CGAGGATGTT GAGATGCTGG TGGGACAGGA AGGAGAGCTG GATGGGGAAG GAGAGGCGAA CAAGGTGTTG GAGTCAATAG ATGAAATGAG GGTTAATGAA AAGGAGAGGG AGAACCGACC AGAGGATCCT AGAGAAACTG TCAATATAAA TCAGGACATT GGGGAAGAAG TCGATGCAGA AGGCGAAGCA GATGCCGACG GTGAACCAGA AAATGGAGAA GCGGATGCAG AAGGAGAGGC AGATGCGGAT GGTGGACCTG TGGGCGACGA CGTTGTAGGG CTTTCCGGTA AATATTATTC TACGTTCGCT GCTCAGTCAA TGCTGATAAC AATCAGAAAT CGACGCTGCC CAAGATGATA CTAGGGAAAC TTCTGAACGG CCAAGCTATC GGCGTGATAC TGTTCTACCA AACGGCCTTG TCATCCATAA GCGGTTCGCC AATGCGTACG AGATTGCAAT TGCTCGGGGG CCCGTCCTTG ACACCCCTTT GGCGAACGCT ACCGTTGATC TTGATACTCT GACAAAATCC TCATCTGCTG CAACTCCAGC TACAGTCCCT GCCGAACCTT CTGTTAGCCC AGACGAACCT GCTTCCTTTG ACCAACTCTT TCCGGATCTC GCTATGTACT CTGGCCCCGC CCCGCCTGAG AATGACAAGA AGTATCGCCG AGATGAGGGC GGTACATACA GTCACCGCAT GGCACACACT TCTCGAATCA TGGACATTCG GCCCATTCTT GTTTCCACCC TTCAGCCAGC GAAGAATCTA ATTGACGGCG AGTGGGATCT TCATGATGGA CCTTATTATG AAGAGGTAAA GGGAGCTGCA GATATCCCTC CAAATGTAGT TGCTGCTTTT AATACGCCCT TTGGCGGCAA AGCGTCAAGA CCCTTGGAGC ACATGCGAGT GCCTGAAGTT CCCAAACCAG CTGCGCACCA TCTTCGTGCA CAATTGCTTT GGTCGCCAGA GGAAGACAAG TGCTTGTTAA AGCTCGTCGC CATGTATCCA TTCAACTGGG ATCTCATAGC AGACAGTTTC AATACGGAGA TGATCCTCAT TCCTGTCGAG AAACGAAACC CGTATGAATG CTGGGAGCGA TGGTACTATA CTTTTGGGGA AGGAAAGAAC AAGCCTCGGC AGGACGCGCC GCCCTCGGCT CCTCCACCTG CGCCTGCTTC TGCCACGCAA CCTGGTACAG CAACCGCTAC GCCCGTGCCA CAATCAGCTG TTACCACCCC TGGAGTCCCT CCATCCGCCA ATCTCCCATC TGCTTCGGGA CGTCCACAGC AAACTGGCGG TAACAGTGTG TCTTCTCTTC CTACTCCTAC CGGAGAAGCG CTGCCTGACG GTGCACCTCC ACCCCCCGGG ATGTCCAAGA GAGATAGAAT GGCGGCGAAG CCAAAGTACG AAGGGACAAA GAGGTCAGTC AGACACCAAG CGATCTATGA TGCGGTGAAG AGGATGAACA GGAGGAGAGA AGCGGCGAGG GCGAAGAGCC GTAAGTTTGA ACGCTAAGGA TGAGACAACA ACTGACAAGT GGGTTAGATA AAGACAATGC ACAACGAAAG GTAATCAATG TGCACGAAAG TCACAGTATG AGCTTCCCCC ATGTTGCTGC TTCCACCCCA TGGGAACTAG TTGAAGCTAA ATATCAGCGC GATGTGCAAA TTGCTCAGCA GCGACAGCAA CGTGCCATGC AAGAACAGCA ACGTCAACTT GCCATTCGTC AGCAGCAAGC CATGATGAGC GCTCAGCAAC AAGCACAAAT GAGGCCGCCA AACATGCCCA ACGTCCCGAA TATGCCGAAT GCTCAACCTA TCCGCATGGG TCCCAATGGC CAGCCAATGC CCACTATGGC GCCAAGCCAA CAACAGCTAT TAAATGCTGT TGCCGCTGCG ACAGCTGCCA ATAGGCAAAA TGCAAACGGC GCTGTCCAAG GTAACCCTAA TGTTCGTCCA ATGCCTGTTG TTCAGGGGCA GTCACCTCAG GTGCAGCAAC AAATGCTTCT TCAAGCACAG CAAATGGCCG CCCAGCAAGC ACGAGTGTTA CAAGCACAGG CACAAGCTCA GGCTCAACAA GGCCGAGCAC CCAGCATGGG TGGAAATCTA CAGCCACCGC AGCTGGGTGT CTCATCTCCT TTCGCCCAAT CTCGTACTCC CGATCTTCCC GCGGAGGGCG CTGGCCCTTC TGGTATCAAC CCCACTCCGT CTCCGGCCAT GCAAGCAGCG GCCATTGGGG CTCAATCCTC TCCTCAAATA GCAACGATGG GTCGGGCACC GTCCAATAAC GTACCTCCCC ATCTTCGAGT ACCCAATGCA GGTACCTCTT CACCTCAGAT ATCTAGCCCG ATGGCCTTGC CTCAAGGAAT ACCAAATGGA GCTGGGATGC CGGTACAGGG AGCCCAAACT CAAGGGATTC AAATTCCGGC AGCAATGATG AACAATGCGA CTGTGCAGCA GCTTCTGGCG ACACTAGCGG CCAGTGGTCA GCAAATGACA CCGGAGCAAT TACGAGGGTT AATGCTACGA TCGGTGAGCT AATTTGCTAC TTATATTCGC ATGGCTAACA TTTCCCTAGG CCCACATGCA AGCTCAGGCA CAAAGCCAGG TCGGCAACCC TGGGACACCT CAGATGGGGG TGCAAAACAT CCAAGGCGTC GTGAGTTCCC TTGATAGATC GCTGTCTACC TCCAAATTTT TGCTGACCAA AGGGGAAGCA ACATTTCGCG AGATCACCTA GTTTGCAAAA CGCCCAGTCC CAGCCTCGTT CAAGCCCCAA ACCAGGTCCT GCCAATGGTC AAGGGACATA AGCATTAGGA TTTGACGATG TTTTGAGTTA TTTTTTTTGG AAATTTTAGT TCAACTACGA ATGCGAC
|
Protein sequence | MRHVRAMERI EAKKAENRWS LRQPKKARGP GVPKSHWDYM LEEMEWMRTD FAEERRWKVV EAREFAYQVV EWHLASPEEK KALMVGGRGW GECRNVPIPG HAGKRKEVTV EVEAEDEDVE MLVGQEGELD GEGEANKVLE SIDEMRVNEK ERENRPEDPR ETVNINQDIG EEVDAEGEAD ADGEPENGEA DAEGEADADG GPVGDDVVGL SEIDAAQDDT RETSERPSYR RDTVLPNGLV IHKRFANAYE IAIARGPVLD TPLANATVDL DTLTKSSSAA TPATVPAEPS VSPDEPASFD QLFPDLAMYS GPAPPENDKK YRRDEGGTYS HRMAHTSRIM DIRPILVSTL QPAKNLIDGE WDLHDGPYYE EVKGAADIPP NVVAAFNTPF GGKASRPLEH MRVPEVPKPA AHHLRAQLLW SPEEDKCLLK LVAMYPFNWD LIADSFNTEM ILIPVEKRNP YECWERWYYT FGEGKNKPRQ DAPPSAPPPA PASATQPGTA TATPVPQSAV TTPGVPPSAN LPSASGRPQQ TGGNSVSSLP TPTGEALPDG APPPPGMSKR DRMAAKPKYE GTKRSVRHQA IYDAVKRMNR RREAARAKSH KDNAQRKVIN VHESHSMSFP HVAASTPWEL VEAKYQRDVQ IAQQRQQRAM QEQQRQLAIR QQQAMMSAQQ QAQMRPPNMP NVPNMPNAQP IRMGPNGQPM PTMAPSQQQL LNAVAAATAA NRQNANGAVQ GNPNVRPMPV VQGQSPQVQQ QMLLQAQQMA AQQARVLQAQ AQAQAQQGRA PSMGGNLQPP QLGVSSPFAQ SRTPDLPAEG AGPSGINPTP SPAMQAAAIG AQSSPQIATM GRAPSNNVPP HLRVPNAGTS SPQISSPMAL PQGIPNGAGM PVQGAQTQGI QIPAAMMNNA TVQQLLATLA ASGQQMTPEQ LRGLMLRSAH MQAQAQSQVG NPGTPQMGVQ NIQGVQHFAR SPSLQNAQSQ PRSSPKPGPA NGQGT
|
| |