Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00470 |
Symbol | |
ID | 3257672 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 122984 |
End bp | 125945 |
Gene Length | 2962 bp |
Protein Length | 641 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256632 |
Product | conserved hypothetical protein |
Protein accession | XP_571110 |
Protein GI | 58267908 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.177859 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAGTCCCTC CAAGGCGATC CAAGACATCC CAGGAACTGA GCTATTTTGC TCTGTGTAGC ATGAAGGGGG GCACACACAT GCGCAAGGTC GATCGATAAG GAGGGAGCCC ATCGGTCCGC CGGATTCATC ACACAAACCG TTCTGCTCTT GTCGTGTGAG CGTATGATGT GAGTACAGAA ATGGTTATTT ATCACCACTT GTTAGTACTG AATGATATCA GCTTTTCAAT TCCAAACGTG AAGAGATTTG CCGAGCATAT CTTTTAAGCA AAGTAGTAGA CAACCAGCTT GTGCATCTTT TCGTCGAACA GTTGCATCTT TAAAAGCTTG TTTTCAATGA CATTTGGCTT TTATTTGACC TGCTTGGAAC TATTTTGTCC CGCTACTACG ATCAAAGGAT GAAAACTCTC GTAGTTGAGT CAAGTGTGAA CAATGTCAGG AGTCATGGGA ACTTCGGAAC ATCAGCTCGT GCGTTCTGGT TCTTGGGGCA TTGTTTCCAA GCTACGGGGT GGTATTAGTT TTTTAATACC CAAGAAATAA GCTCGTAATA ATTCCATACA GCGGAAGGCT ATCAACGCAC GTTCCATATC CCCGATATAC TTCTTCGATC GCACCGAATA CCCGCGTCTA CTGCACGGCC AAGTGATACA ACGCCAGTTA ACTTCTGGCC CTGTTAGCCT GGTCTCGCGA CAAACGGTCG TTATCAAAGT ACGCCATTCG AATTGCCACC CGATAATGGC CTACCATTGT AAGTGGCTCA CGCGAATGTC GGAACAGAAC ACACGAGCGC AAGACCACAA ACACCATTTG ATACATATAC TGTCTACTAC TTCACACCAG CCGGAGAAGG GTTTGGTTCT GCTCTGGAAA TGATACCCGC ACCTCCGTAC TCGACTGAAA ACTCTATTCC ATCTCCTACA GTTCCATATA TCTCGCTTGG TCGTCCAGCA AGGTACGGGT TCAACCTACT ACTATACCAA CCATAGCTTA TTGACTGAAT AGTACCACTC CTCGATCACG AGCGATTCTT TCCACCTTTC CAAACCACAC TACCATGCCC CCACCATCCT CCATTCCCAA CGCCAACAGC CGAAAGCGCA AAATCCTCCC AATGACGCGC TCGCAAGACA GCACTCCTTC CTCTCCTGCA TTAAGCGAAT CGTCATCTTT GGTATCGGAT GATGGCGACG GAGAATATGC CCCTGAGTCT GTTCGTCCCA TTAGGAAGGC GAAAAGGCCT CGTGACAACG ATTTCAAGTT GAACTCCGGT GGACGTTCGG GCGCGACGAC AAACAGTCAT GGCAATAGCA ATGGAGCGAC AGGAAAGAAT TGTAAAGTGA AGGGCAAAGA CATGTCCCGC GAGCAGTTGC GCAAGGTGAA CCACTCGTTG ATCGAGAGAA GGCGGAGGGA GAAGATCAAC GCGGCACTGA ATGAGCTGAG AAGGATGGTA CCGAGCTTGG GAGAGAACGG TGGAAAAGGT GGCGAGTTCA AACTTGAAGT AAGCATTGAA AATGAAATTA AAGTGTAACG GGCTGACAGT TTGATAGGTA TTGGAGAAGA CTGTAGAGCA CATGAAGGAC TTGAAGGGAC GACTGGAAGA CTTGGAACGA GGGGCTGCGG CATCAGCCAA CAACTCTTCC TGCGAATCAA ACGCTCGCGG CAAGGATAGG GAAACGGAAC TCGAAGTGGA GAGCCGAAGT CGAAGCAAGA CATCGACTTA CCCTTCTCCC TCTCCCGATA GACAACAATT TTCAAACTCA CCCCCACCTG ATCCGAACGA AACAGATGTA GAGTCCAACT TGCCACCTCC TTACACGCTC GCTAGCCGAG CTCGCGCTCG TTCTCGTGCC CATGCGTCCT CTTTGCCGTC TACTTCCGCT TGCATTTCCG CCTCCAATCG TGGCACAACG ACGAGTCAAG AATCCAAATC CCCGTCTTTC TGGTCTGGCC AAACACAAGA ACAGGTCACA GAAAATTTGC ATGGGCAAAG AGGGTACCAG CCACTCCCCA GTACCCGGCC GAAGCCCCCA ACCAGCACTT CTAACCCCAT ATTTCTTCCT TTCCCTTCAC CCTCTCCAAC CTCGCCCTTT CTCCATCCCA ATGCCAGTTT CAACACCAAT CCTAACGCAG ATACGCTGAA CAACTCTTCC GCGGCGGGAT CAATGATGTC TGGAGAGAAT GAGGGCCGCG GTTTTGGTCC CAACTACAAC TCTTCAGCCT CCGTCAATGG CAGCGTGCAA GGTGCTGCAG AGGCTCGCAA TACTCATCCA AGCCCGTTTT TGCCACCTAT ACCTAATATG AGCTTGTTCA GTATAATGAG CCTTGAGAAT TCACCAGTGG ATACGTTTCG ACAGGCTTGT ATGGAAGGAT TCGGAGGTGC GGGAAAAAGC GGTTCATTTT CACCGCCAGA GTTGAATCTT GAGGATACTT CTCAAACGGC GCGCCGTAGC TCATTTGCAG AGCCAAGAGG TGCGCTGGAC GTAGGAATGA GTACTATGAA CACAGACGTA AAGGAACGTC GGCATGATAC GAACGATAAG TCCACATCCA TTGACAGTGA CAAAAATAGA AATAAGCATC AAGACGACGT CACTACCAAG GCTGGCGCTA ATACCAATGC CAACACTGAG ATGCTACCCG AAGAAGCTGC GAACTTGCTC CTAGCATTTT CGTCTCCTGA GACATTACGC CCATTGGGTG ACGTGCCGGT CGTCCCCCTT GCTGGACAAG GGTATGGACA GGGGCAAGGG CAAATCAGAA GGACGGTGGA AGAATTTAGT CTTGATTCAG GAGCGGCACT TGGATCTGAA TCTGGATCGG GTTTGGGAGC AGTAAGAGCA ATAAAAACTG GGAAGGGAAG GGAAGTTTCA TCGGTGAAGA TGATGGGATC GACATCGAAA GATCGCCTGG TTGTGGGGAA GAGTGTAAGG GACATGCTCA AGTTGACATG AATGACGGCG GA
|
Protein sequence | MAYHCKWLTR MSEQNTRAQD HKHHLIHILS TTSHQPEKGL RKILPMTRSQ DSTPSSPALS ESSSLVSDDG DGEYAPESVR PIRKAKRPRD NDFKLNSGGR SGATTNSHGN SNGATGKNCK VKGKDMSREQ LRKVNHSLIE RRRREKINAA LNELRRMVPS LGENGGKGGE FKLEVLEKTV EHMKDLKGRL EDLERGAAAS ANNSSCESNA RGKDRETELE VESRSRSKTS TYPSPSPDRQ QFSNSPPPDP NETDVESNLP PPYTLASRAR ARSRAHASSL PSTSACISAS NRGTTTSQES KSPSFWSGQT QEQVTENLHG QRGYQPLPST RPKPPTSTSN PIFLPFPSPS PTSPFLHPNA SFNTNPNADT LNNSSAAGSM MSGENEGRGF GPNYNSSASV NGSVQGAAEA RNTHPSPFLP PIPNMSLFSI MSLENSPVDT FRQACMEGFG GAGKSGSFSP PELNLEDTSQ TARRSSFAEP RGALDVGMST MNTDVKERRH DTNDKSTSID SDKNRNKHQD DVTTKAGANT NANTEMLPEE AANLLLAFSS PETLRPLGDV PVVPLAGQGY GQGQGQIRRT VEEFSLDSGA ALGSESGSGL GAVRAIKTGK GREVSSVKMM GSTSKDRLVV GKSVRDMLKL T
|
| |