Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC00950 |
Symbol | |
ID | 3256411 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 281578 |
End bp | 286424 |
Gene Length | 4847 bp |
Protein Length | 1444 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255314 |
Product | conserved hypothetical protein |
Protein accession | XP_569410 |
Protein GI | 58264508 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.148388 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAGGTAGGA AGGAAGGACG AAGAGAAGGA AAACACAATA ATCCATCCTT AACTCTCAGG CTAACACAAT CAACGATGGC TGTCCTTTGA AAGCATAAAG CATATTCCAC CATTCTATGT CAACCTATTC TCCACTTCAG CAATGTCGTT CGTATCATCG GAAACGCCAA ACGCAATCGC GTCCTCGTCA ACCGCCCATG CTGGCTCTGC CAACTCGAGA GCAAGACACA GACTCGACAA TGGATTACGA TATCACTCTA ACCATTTATC TAGTGATTGT CGACCGTTGC CAAGATCAGC GCCGCGGAAA CAGAGGGTCG CCAGCAAGTC GAAAAAGCTC AGATGGAGGA AAGGGTTGGT TACAGAGGCT TTCCGCAAGT TTGGCGAGCA TTGTGCAAGA AATCAAGTGG GTGAACAGAA GTATATATAA TGATGCCATG CTGATTCGTG TAAGATACGA ACACTTCTCA TTGACTGTCT TGTCATGACC AATTTGTTTT ACCCTTCCCT GGCGCTGTAT CTCCCGAAAA GATTTCCTCC GCCGAGCCCA CGTGCTGTTC AGGACGCTCA CTGGCCACCT TTGGACAATG ATGTTACATC GCCGTATCGT ACAAATGTGG ATCATCCGCT ATCTTTCTCT ACTTTAGGGC GTCTTTTCCC CTCAGCTCCG CCTTTACTAC CCCGTTTGAC GTGGGCTGGT TGGTGGGGCA GCGGTACTCG GGAACGGGAG GATGACGGAT GGACAGCAAC AAGGGCCATA CCTGGCGCGA AAGAAATAGG CGAAGGACAA AATGATCAGG TCTGGGTTAT GCGGATTGGA TGGGCTGATG TCGAGGACGT CCTCGATCGA CACATTGAGG ATGGCGAGCG GAAGTGGGAA GAGAGAGATC ACCACCTCTT AAAGCTGGTA CAAAATGTGG CCGAAAATTG GGAGAGACAA ATTCCTTCTA ACAGACAGAG ATGCGTCCGT CAGCTCAAAT CGTCAACTTC TGTCAGAGGG GACGGTGTTG ACTCAGAGCC CCCGCCCTGC TATGTCCTGA CGCCTTCTCC AGACGCTTTC GATTCCGCGA TACCAATATC AATGTCCGGC TTCGTTCATT TTGACCATGG AGTGTCAGCC TCACCGGATA GCGTACCTTT AGGACGGTCT CCCAACCCAG ACAACATCTA TCACACTTTT GCAGCGTTGT TCCATGTACC GCAAAATTCA ACCTCGGCAT TTGAACAACG GTGGCAGAGC GCCATGTCGG ATGTTGCGAA AGAAGTAGAA GGAGAAGTAT TTGTGGAGGC CAAAGGACCG AGAGTCGGTG ATCAAATTGG CGAATGGTTT ATTTCTGTAA AGCACTTCTC CGCTCGAATT GTTACAATCA CGCTAATTCT CGGCACAGTA CGCTTCTTCC CCTCTTTCTC AACAACCAAA CTTAACTGGT CCGATTTCAT CTGACCGTTC TAAAATCTTT TCATCGTCGC CTCCCCCGAT AATCATCGTT CTTTACGTTA TACTTTTCAC CACGCTTATC GTTCAACTCT CCAACGCCTC CAAAGCCCAT TCCCGATTTG GTCTTGCTTT TACTGGCGTC GTGCAACTGT GCTGTTCATC GGTTATGTCT TTCAGTATCC TGGCCCTTCT AGGTTGGAAT GGGTGGGGTG TACCTCATGC GGAAAGTAGT TTGCCGACAT ACGTCTTACC TTTCGTCATT GTTGTAGTTG GTGCGGAGAA CATGTCGACC CTCGTGAGTG CTTGTTCCCT GCATCAGTGC ATCAAAAGAT TCCATTGACA CAACTTAGAC GCAAGCTATA TTCTCAATAC CCTTCACTCA TTCGGTTCCC GTAAGAATAG GGCTTGGTCT GAGCAAAGTT GGAACTACCA TTGCACTCAC TTCCCTAACG GATCTTGGTA TCCTGAGTGT TGTCTGGCTC TGTGTCAACC TTCAGCCTGT TAGAGAGTTT TGTCTTTTTG CGGCTGTTGT GATCGTGACC GATTGGTTCA TGCTTCACAC ATTCTTCTTG ACTGTAAGTC GAGATAAGAC ATAAAGATAA GGATGGTGCT AAATAGCGGG TCATAGGTCT TGTCTATCGA TGCGCAGAGA CTCGAGCTAG CGGATGTTTT GGCGTCTAAC AAGGTCGGCA TGGTTTCACC GGTGGAATCT ACGGGGGAAA AAGATGCTCA GAATGAAAAT CAAGGATTCC TATGGCGAAA TATGCTGAGG GCTAGGACAA CTAAAAGTGG AAGTTTGTTA CTAGTAAGCC CCTCAGTTTA AGTATTAAAC ATAGCTGATT GAGACATTTG ATATAGTTGC TTTTTACTGT GGGACTTCTA TATTGGTTGA CTGAACGTCA TCGTTCACCC CTCAACACCA CTGCTAGTCT TTACGGATAT ACACCTACTG CCCGATCTAC TTCTGTCAGT GTCCCCACTC CAACGGCCTC TCCTTTCGTT AGTACCCCTG AAGCAATTTC CGTGTTGTCG TCTGCGGAAA AACTTTGGCG CGCCCTCAAC CCAGAAGGAT GGCCGTTCGC ACATGTCATC GTCCCCCCCG CTTCGATACT TGTACTTCCC AAATTTGGGC ATTCTATGCG ACCAGGAGAC ATTCGCAAAC TCTCACTTCC AGCCAGCCGT CTCCTTATCC CTCGACTGAA AGCTCTTTTC TACATCTTCA AGGTGCTGGT TCTTCCACAG GCGATTACAG CTGGCGCCTT GTATGCGCTT TTGCTGTACC TGCTCAAAGA TGCTGATCTC CTTGACGCTC AACGGAACCG ACTAGGGAGG ATGGGTGACG CGCATGAGGA TGAGTCTGAC TTCCGGGGGT CGACAAAGAA TGCTAGCGGT TTGTTGAACT GTCTCCGAGC ACGAATGCTA CCGTGCAGCC ATGAAGCCGA TGTGGATGTC ATCGCATCGA GCTCTGACGG ACGCATTGCT ATATCAGTGG CCATTGATAA CTCCGTATGT CTTTGGAGGT TTTTAGATAC CCCTGGAAGT GGAACTCGCG AACCTCTGCC AACGGGTGGA CTAGATGGTG GAGACGCTAT TGTTGCAGCT GCTGTGAGCG AGGATGGCCA GCATGTTGCA GTGTGTACCA ACATGGCAGT GGTACAGATA TGGGAAGTGC CCAGAGAGGG TGCGGTTGTG CCCCTGCAAG TTCGGAAGCC TGAGCAAACC TTCACCGCGA AGATTTTGGG AATAGCGTTC GATGAAACTG CCCCCAATGT CGATGACCCT TTCACGGCTA ATGAACCGGC GACTGAGGAA GCGCGCAAGG CAACTTCTAT CATGATCGGA TATGGGGACG GTTCCGTTAT GGCCTTAACC GAGATGGACG CCAAGGTGGT CATTCCTGCA CAGGATAGGG GTGCCTATTC ATCTTGTCGA GTCCTTTTCA TGAGAGAAAT TGGCGGCTCC GTAAGCATCC TAATCGCTAG ACAACATGAC ATTGACATCT GGCGAAAATC TACATTCGGA TGGGTATCGT CCTCTCTTAT TGCTGATCTT CCCGGCGAAG ACCGTGTTAC TGCTCTTTCA CCCCTCAACG CCGAGTTGCC GGGTGTCTTT GCTGTCGGCC GTCGCTCTGG TAATATCACG ATCTATGATG AATCACACGG TCAGCTGGAA TTGATGCCGC AAGGTTCGTC TATAGAAGGT GTGCGCAAGA TCCAGCTTGT AAGGCCTTCA AGCATGAAGT GTTTAGGGTG TGGCCTTCAG TCTGCGGAGG GTTACGCAAT CATCTCGTCA ACTTCGTCCC AAGTGTCCAT CGATCGTATC GCGCCTCGTA CTTCAATCCC GACATTCTGC CGTTGTACAC GACGGGTATC GTCTGCCGAT GATGTCCCTG CTCTCATGCA TCGCTCGGAC TCGCCACAGC GTAGTAAACC CAATTCTCTT ATCGTCCCTC CTGTTACCTT TCGCCAGCGC CCTACACCTG GATCATCCCC TCATAAGTCA ATCTCTCTAC TCTCTCCTGT CTCAAACGGC GAATTTCCGC TTTCCTCCCA CGGCTCTGCT CGCCGATCGA GCAATTTTCA TAGAGAAGAT GATTCTCTCA AAATAATATC CTCTCCTCAC GATCGCTCTG CCTTGTCTAG CATAGGTGGA GGCAGCGGCC TGGTCTCACC AAGTGGTGAC ATGGAAGTAA CATCTCTGGG GGGCATCAGT GCTCAAGGCG CGAGCGATGG CGGCTGGGCG ATACTCGATG GGGATGTGCT AGTAGGTGTC AGGAGAGGGC GGGAAGGCAT CGACGATGCG CAATGGCAGG TCTGGAGCGT AGACATGACC GCACCATGGG ACATGGCCGG CTTGGTCGTG GATAGCATTG ACTTGTCAGA GCTTCAACGG CGAACATTCG AGGCTGATTG TACTATCCGA GGGCAGGCAT CTGGAGGCGG AAGCACCAAT GGAGTTGTGT CGATGCGTGA TAGAAGAACT GAAAGACTTT TGAGTCTAAA TGGGCGAGCG AGCTTTCCAG AGCGGGTGGG TTCATTCGCT GTGCCCACAT ATGAGTCGCT AGGCTATGTT GAAGTCGTGG GGCTGAACAT ACTGGGTACG AAAGGGATGA TAGGCGGGTT TGGGAATAGA TTGGGAACGA TTAGCCTTGA GAGATTTGAA GAAAAGAAAG TTATAGGGAG GAGTAGCATG GATGGCCAGC TGGGGATGGG GCTGGGAGGA CTGACGCCCA CTCGGAGGCA GTCATTGTTT CCCCTCACGC CCCCACCGCC CTGTGACGGT TCCATCGGAA GATTCTAGAG TGGATGTAGA ATCGGAGTGG AGGCAACGGA AGTATTAGAG CCAAGGAGCC GTTATTAGTT TAGGTCTTCT ATATATCCCT CAATGCAAGT CCGGGAGTAA ATACCCT
|
Protein sequence | MSFVSSETPN AIASSSTAHA GSANSRARHR LDNGLRYHSN HLSSDCRPLP RSAPRKQRVA SKSKKLRWRK GLVTEAFRKF GEHCARNQIR TLLIDCLVMT NLFYPSLALY LPKRFPPPSP RAVQDAHWPP LDNDVTSPYR TNVDHPLSFS TLGRLFPSAP PLLPRLTWAG WWGSGTRERE DDGWTATRAI PGAKEIGEGQ NDQVWVMRIG WADVEDVLDR HIEDGERKWE ERDHHLLKLV QNVAENWERQ IPSNRQRCVR QLKSSTSVRG DGVDSEPPPC YVLTPSPDAF DSAIPISMSG FVHFDHGVSA SPDSVPLGRS PNPDNIYHTF AALFHVPQNS TSAFEQRWQS AMSDVAKEVE GEVFVEAKGP RVGDQIGEWF ISYASSPLSQ QPNLTGPISS DRSKIFSSSP PPIIIVLYVI LFTTLIVQLS NASKAHSRFG LAFTGVVQLC CSSVMSFSIL ALLGWNGWGV PHAESSLPTY VLPFVIVVVG AENMSTLTQA IFSIPFTHSV PVRIGLGLSK VGTTIALTSL TDLGILSVVW LCVNLQPVRE FCLFAAVVIV TDWFMLHTFF LTVLSIDAQR LELADVLASN KVGMVSPVES TGEKDAQNEN QGFLWRNMLR ARTTKSGSLL LLLFTVGLLY WLTERHRSPL NTTASLYGYT PTARSTSVSV PTPTASPFVS TPEAISVLSS AEKLWRALNP EGWPFAHVIV PPASILVLPK FGHSMRPGDI RKLSLPASRL LIPRLKALFY IFKVLVLPQA ITAGALYALL LYLLKDADLL DAQRNRLGRM GDAHEDESDF RGSTKNASGL LNCLRARMLP CSHEADVDVI ASSSDGRIAI SVAIDNSVCL WRFLDTPGSG TREPLPTGGL DGGDAIVAAA VSEDGQHVAV CTNMAVVQIW EVPREGAVVP LQVRKPEQTF TAKILGIAFD ETAPNVDDPF TANEPATEEA RKATSIMIGY GDGSVMALTE MDAKVVIPAQ DRGAYSSCRV LFMREIGGSV SILIARQHDI DIWRKSTFGW VSSSLIADLP GEDRVTALSP LNAELPGVFA VGRRSGNITI YDESHGQLEL MPQGSSIEGV RKIQLVRPSS MKCLGCGLQS AEGYAIISST SSQVSIDRIA PRTSIPTFCR CTRRVSSADD VPALMHRSDS PQRSKPNSLI VPPVTFRQRP TPGSSPHKSI SLLSPVSNGE FPLSSHGSAR RSSNFHREDD SLKIISSPHD RSALSSIGGG SGLVSPSGDM EVTSLGGISA QGASDGGWAI LDGDVLVGVR RGREGIDDAQ WQVWSVDMTA PWDMAGLVVD SIDLSELQRR TFEADCTIRG QASGGGSTNG VVSMRDRRTE RLLSLNGRAS FPERVGSFAV PTYESLGYVE VVGLNILGTK GMIGGFGNRL GTISLERFEE KKVIGRSSMD GQLGMGLGGL TPTRRQSLFP LTPPPPCDGS IGRF
|
| |