Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND03610 |
Symbol | |
ID | 3257389 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | - |
Start bp | 992221 |
End bp | 997164 |
Gene Length | 4944 bp |
Protein Length | 1469 aa |
Translation table | |
GC content | 49% |
IMG OID | 638256295 |
Product | conserved hypothetical protein |
Protein accession | XP_570558 |
Protein GI | 58266804 |
COG category | [R] General function prediction only |
COG ID | [COG2940] Proteins containing SET domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.199179 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCTC ATGAGAAAGG TGTCAATCCA TCAGAGTCGC CCTCCGGCAG CCTCAAAAAG GCGCCACCTA GTGGCCCCAA AGCCCTCCGC GGCTTTGCGT CTCCTTCCGC GTTCCGCAAC GCTGATATCG GCAATGGCTT ACTGCAACAC CGTGGCGAAG ATGAGAGAAT CTCTTTCGCG TTTCCCAGGA AGGGAAAGGA AAGCCTCGAC AGAAACGGAG AGCGGCCGGC ACCAATGAGT TTGGAGAGCA GATTGGGTCC CCCTGTCAGT CGCTTCAACC GCCCTGTAGG TGGCGACAGC TTGGAGAGAG GTAATGGAAA GGGGGGCTGG GATAATAGGG ATGAAAGAAC TAGTGCAAGT TCTTCCTCAA TTCACAGAAT AAATAAGGAG AAAATTCGCC CTCGGTCTGA TTTCATCGAG TCTAGTGCGA ACCTTTATTC TGAAGATGAT CGCAATCGTG ATCGAGGCAG ATATCATGAG CGCGACAGGT CTAGAGGAAC AAACGGAGAT AGGGGGGGGG GAGAAGGTCA CAGTCATAGG GAACCTGGAA GGGGGAAAGA GCATCAGAAT GGGCAAGGGA GAGATAGGAG CCTTTACAGA GACCATTCAA GAGAGAGAGA GAGCTCGCGT GACGAGCGCG ACAGGTACAG CGAAGAGTAC AAACATCAGC GTAGCAAGGC CAGATTTTCC CCAAGTCCAA GTCCTGAACG TAGTAGACTT AAGTCATCTT TAGGCAGGCA CCGGTCACCA GTTTCGGTCT CATCGTCCTC CTCGTCCTCC GCCCGCTCGC CTCCACCCGT GCAGAATCGA GAACCTTTAA GATATAATGG CTCTCAATCT AAAAATGGGG AGAAAGAGCT TTCAAATGGA TTGCTTGAGA ACTCAATTTC CCGATCCGGG GTCAGTATCG CTGTGCCGAG AAAGCTGGAA ACTAAGCAAT TAGTTCGCCC TTCGCCCCCT CATGTCAATC TCAAGTCTAC CATACAGGAT AATCCATCTC CACCAACTGG CAAATATCCC CCTTCACCGC CTTCTTTAGA TATACTTTCG AAACCATCCT GCAATCGTGA ACCTCTTCCG GATCAGCGTC CACCCACCCC GCCACTACCG GAAAATTCAT CGCCCACATC TCCATCCCTT GAATCGTCAA GGTTTGATCA AAATCAACAC CTCTTACCTG ATGAGTTACC TCTACCGCCC CTCTCTATAT CATTTCCCCA GAAACCAGTC TCATCATCAT CATTGTCTCG TCTCTCTTCT CTATCCGCCG CTCTGTCACG ACCTAACCCT CTCGACAAAG ACGGTACTTC GATGCCGCCG TCTTTCCACT TCAGAGAAAT ATCTACTCGT CACCAGAGAC TTTCTCCTCC AAATAATGCC GAAAATCAAT TACCGGAAAC GTCCATTATA CCACCTCCAC CGTCTGAGAC TGTACCAGAA CCCCCGTGGA TTCGTCCCCC ATACATCCCC CCTCCTTGCA CCAAGCATCG TCCGGGAATA GGCAACTTTT TTATCACCAA TCTTAGAGAA AAAGTGGAAG ATAAATCAGG GAAAGAAGAG AAGAGGGTTG ATGGGATGGA AGGCGGAAAA GTTGTGCAGG TGACAGATCC GAGACTGTCA ATGACGGAGG AACAGCGAGG GAGGGGAAGA GGGAGCTCAA AGCAAAGAGC GGCGTTTTAC GAGTTGACTT ACGAGGTCCG TATCCGTATC CTTTTTCACT TCTATAAGCA AAACTGAATT AAATCACAGT GGGATTTGTA TTCCGTTACT CCCAAACCTC CTTCACCACC GACGGCTGTC TTGATCACAG GCCTTGGTCC TCTGACTACG GTTGACCAGA TTACCAAATT TTTGCGACCC CATGGCCGCA TCAAAGAGAT TGACTCTAAA GTTGACCGGA AAACAGGCAT GCAGCTTGGA ATATGTTGGG TCAAGTTCGA GGGACCCCCT TTGGGTCGTC CTGGTACCGC GCACGATGTA GCAAGTATGG CCGTCAAGGT GTGTGATGGA AAGAAGATAA GCATGGGAGG CGAGCGAATC AGAGTGGTGT TGGACGGCAG GGGGAAGAGA GCAGAACAGG CGGTGAAAGA AGAGATGGAA AGGAGGTATC CTCCTAAGAA GCCTTCTGCG TTGCCAAGCG ACATGAAGGT GACACCCCTG TCCGGGACAG CAATGGCTAC GACGTCCAGA CCTCCAAATG CTCCAAATGC AAGCACTCCT TTGATCGATA AGCAGACATT TGATACGTCT GCCAAAGCTC CGATTATTAG ACCTGGCGTA CTGAAGCCTT TAGGACAGAA GATGTATCAT CGCCCATCAG CTCCTCCCCT CGTCTTCAAC AATCACCGAT TCCGCGATGA ATCATTCCTT AATAGACCAT TCAATGCCGG TGCAGGAATG ATATCGCAGC AGCAAATGGG ATATAAGATA CTACCAGGGA AACCAGTGCA ACAGTTGGCG TCTAGCTTCA CAAGCGCGCC ATTCGTCAGA CATCCACGGG AAAGGAGAGA AGACAGTTGG ACCAATGAAC GCGGCAGGCG ATTGAAAGGA GAATCATCTA CCCGCCATTG GCGCGCGAGA TCGCTGTCAC GATCTTCTTA TTCTTCCTAC TCTTCATACT CTTCCTATTC TGAAGAGAGT GAGGAAGAGC GACCCCGGCA TCCCACGAAA GTACCATATC CTCAGAGGAA ACGCCTTGCC ACAGGACCGA GCAAAGAGGA CGAGTATAAA ATGGAAGATG TTAGGGAAGC AATCAGGGAG AATGGTCACC CATGCGTATT TATTGACGCT AAGTCTCTAC CCGCTGCAAG AGAGTATGAA AGTTATTTGA GGGATCACTT CAAGGCCTTC AGGCCGACAG AGGTAAGCCC TGCCCTGAAC GCAAAGCCTT AATTACTCAC AATTCTTTCT TTTCCTTTGT CATGGCAAAC AGATTATTCA TAGCCATTTG GGCTGGTACA TTCTTTTTGC CGACGATACC ACTGCATATA GGGTCCAGCG CGTATTGGAC ACCACAGCGG TACAAGGTCA TCGCTTATCG CTTGTTGTCC ATACATCTTC CGGACCTCGT GCGCAAACAG ACGCCTCAGA ACCTGTGACC GGTGGTGTAG GGGAAAGCAA AAAGGGCAAC TGGCGATATT TGACAATCAC GAAAAAGTCT CGACCAATGC CTGCTGTGAA GAAGTCGGGA AAGTCTGCGA CCATCAGGAG GAAGGTATAC TCACCTTCAG TATCTGGTAG CGACGACGAT GATGAGCAAG TGCCTGTTAT GGCTCAGAAC AGGAAACGGG CTCCATCATA TGCAAGCTCG ACCTCGCCAC TTTCAGAAGA TGACAGGCCG TTTGCCCGTT CGGTCCAAAG GGAGGAAAGA GATATCGACA AGGAAGGCAA GTTTTCTCTT ATTGGAAAAA AGGAAGATGT AGTATCAGTT AAGGCGGCCA AAGGTCCAAA GAGTAAAACG ATCAGAGTGG ACTCTGACGA AGTGGAAGAA AATCAAGGAG TCCCATTGGC CAGCATTGGG GAAGTCACAA AGGCGGAAGG AAAGCAGGAT GATAGTACGG TCGTCAAGCT CGAGACATTA CTTTCTGAGA GTATCTCTGA GCTTACCAAA GGCAAAAAGA GACCTACCAA AGCGAAGGGA GGGAAGGCTA CAAAGAAAGT ACGCCTTGAC CAAGAAGCCG ACGATGCGGC TACGAAAATC CAAATCGACG AAGATATTGT ACCTCAACCA CCCAAGAAAA AGAAAGTCGT TAAAACTGAG GTCGATAAAC TTTTGGCTTC CGGTGTACTT ATGGATGAAG AAGATGCGTA CTGGCTTGGT CGAGTATTAG CCGCCCAGGA AGATGGCCTC GAGCCGATTT GGTCGGATGG CGAAGAAGAC TTGGTCGATG AAGGCCACCC TCTCTTTCAC AAGTCAGGGG CTTGGCGAGC TGAAGGGTGG AAGAAAGTTG CCCAAGTGCA AAAGTCAAGA TATCTTCCGC AGCGCAACCG AGCTGTCGTC AACTCCGAGG ATGTCGGAGG GATCACTACA GGCAGGACGG CTCGTCTTGC TGGCCGTGAT CAGCACCGAC AAACAGCAGC AGTTGCTGCT AATAATACCG TCGAAAGCGA TCTGTTTGCA TTCAATCAAT TGCGTATCAG AAAGAAGCAG CTTAGATTCG CGAGGTCAGC AATTGAAGGA TATGGGCTGT ATGCGATGGA GACGATCCAT GCCGGGGAAA TGGTTTGCGA ATACGTGGGT GATTTAGTGC GGGCCACCGT GGCGGATGTG CGGGAGCAGC GATATCTAAA ACAGGGGATT GGATCATCGT ATCTCTTTAG AATTGATAAC GATATCGTTT GTGACGCCAC CTTCAAAGGA TCAGTAAGGT AAGTAAGGTG TTTCGGACAA GTCAAGCAAC GTGAAGTTGA CAGTAATTTG TTAATTTATT ATTCTTCAAC TCGTTCCCTC GAAAACGTAG CCGGCTTATC AACCATTCAT GCGATCCGTC GGCCAACGCC AAAATCATTA AAGTCAATGG ACAATCGAAA GTGAGTCACA ACGACCATGT ATACAGTCTG GGGAACATTG CATGCTAATA AGCGAATAAA GATCGTTATA TACGCTGAGC GAACCCTTTA TCCCGGAGAA GAGGTTGGTG ACCCCCCTCA ATATCAATGC GAATGGCTAA TTTGATGGCT GACGGGCGTC GATACTTTCA TGTCACAGAT TCTGTATGAT TACAAATTTC CGCTGGAATC TGATCCAGCA CTCCGAGTGC CTTGCCTTTG TGGCGCCGCG ACTTGCCGAG GCTGGCTCAA CTGAAAGCAA CCTTGTGGAC TCTGATGGAT GGATAGTGAA TGTTTTATGT TGTAATATTT ACTGACGTAT CATGACGAAT GATAATGACT GGGTTGTGGC AAATAGTAGT AGCTGCGAGT ACTGGCGACA TCAGCATTTG AAGA
|
Protein sequence | MAPHEKGVNP SESPSGSLKK APPSGPKALR GFASPSAFRN ADIGNGLLQH RGEDERISFA FPRKGKESLD RNGERPAPMS LESRLGPPVS RFNRPVGGDS LERGNGKGGW DNRDERTSAS SSSIHRINKE KIRPRSDFIE SSANLYSEDD RNRDRGRYHE RDRSRGTNGD RGGGEGHSHR EPGRGKEHQN GQGRDRSLYR DHSRERESSR DERDRYSEEY KHQRSKARFS PSPSPERSRL KSSLGRHRSP VSVSSSSSSS ARSPPPVQNR EPLRYNGSQS KNGEKELSNG LLENSISRSG VSIAVPRKLE TKQLVRPSPP HVNLKSTIQD NPSPPTGKYP PSPPSLDILS KPSCNREPLP DQRPPTPPLP ENSSPTSPSL ESSRFDQNQH LLPDELPLPP LSISFPQKPV SSSSLSRLSS LSAALSRPNP LDKDGTSMPP SFHFREISTR HQRLSPPNNA ENQLPETSII PPPPSETVPE PPWIRPPYIP PPCTKHRPGI GNFFITNLRE KVEDKSGKEE KRVDGMEGGK VVQVTDPRLS MTEEQRGRGR GSSKQRAAFY ELTYEWDLYS VTPKPPSPPT AVLITGLGPL TTVDQITKFL RPHGRIKEID SKVDRKTGMQ LGICWVKFEG PPLGRPGTAH DVASMAVKVC DGKKISMGGE RIRVVLDGRG KRAEQAVKEE MERRYPPKKP SALPSDMKVT PLSGTAMATT SRPPNAPNAS TPLIDKQTFD TSAKAPIIRP GVLKPLGQKM YHRPSAPPLV FNNHRFRDES FLNRPFNAGA GMISQQQMGY KILPGKPVQQ LASSFTSAPF VRHPRERRED SWTNERGRRL KGESSTRHWR ARSLSRSSYS SYSSYSSYSE ESEEERPRHP TKVPYPQRKR LATGPSKEDE YKMEDVREAI RENGHPCVFI DAKSLPAARE YESRQSHLGW YILFADDTTA YRVQRVLDTT AVQGHRLSLV VHTSSGPRAQ TDASEPVTGG VGESKKGNWR YLTITKKSRP MPAVKKSGKS ATIRRKVYSP SVSGSDDDDE QVPVMAQNRK RAPSYASSTS PLSEDDRPFA RSVQREERDI DKEGKFSLIG KKEDVVSVKA AKGPKSKTIR VDSDEVEENQ GVPLASIGEV TKAEGKQDDS TVVKLETLLS ESISELTKGK KRPTKAKGGK ATKKVRLDQE ADDAATKIQI DEDIVPQPPK KKKVVKTEVD KLLASGVLMD EEDAYWLGRV LAAQEDGLEP IWSDGEEDLV DEGHPLFHKS GAWRAEGWKK VAQVQKSRYL PQRNRAVVNS EDVGGITTGR TARLAGRDQH RQTAAVAANN TVESDLFAFN QLRIRKKQLR FARSAIEGYG LYAMETIHAG EMVCEYVGDL VRATVADVRE QRYLKQGIGS SYLFRIDNDI VCDATFKGSV SRLINHSCDP SANAKIIKVN GQSKIVIYAE RTLYPGEEIL YDYKFPLESD PALRVPCLCG AATCRGWLN
|
| |