Gene CNE00470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE00470 
Symbol 
ID3257672 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp122984 
End bp125945 
Gene Length2962 bp 
Protein Length641 aa 
Translation table 
GC content50% 
IMG OID638256632 
Productconserved hypothetical protein 
Protein accessionXP_571110 
Protein GI58267908 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.177859 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAAGTCCCTC CAAGGCGATC CAAGACATCC CAGGAACTGA GCTATTTTGC TCTGTGTAGC 
ATGAAGGGGG GCACACACAT GCGCAAGGTC GATCGATAAG GAGGGAGCCC ATCGGTCCGC
CGGATTCATC ACACAAACCG TTCTGCTCTT GTCGTGTGAG CGTATGATGT GAGTACAGAA
ATGGTTATTT ATCACCACTT GTTAGTACTG AATGATATCA GCTTTTCAAT TCCAAACGTG
AAGAGATTTG CCGAGCATAT CTTTTAAGCA AAGTAGTAGA CAACCAGCTT GTGCATCTTT
TCGTCGAACA GTTGCATCTT TAAAAGCTTG TTTTCAATGA CATTTGGCTT TTATTTGACC
TGCTTGGAAC TATTTTGTCC CGCTACTACG ATCAAAGGAT GAAAACTCTC GTAGTTGAGT
CAAGTGTGAA CAATGTCAGG AGTCATGGGA ACTTCGGAAC ATCAGCTCGT GCGTTCTGGT
TCTTGGGGCA TTGTTTCCAA GCTACGGGGT GGTATTAGTT TTTTAATACC CAAGAAATAA
GCTCGTAATA ATTCCATACA GCGGAAGGCT ATCAACGCAC GTTCCATATC CCCGATATAC
TTCTTCGATC GCACCGAATA CCCGCGTCTA CTGCACGGCC AAGTGATACA ACGCCAGTTA
ACTTCTGGCC CTGTTAGCCT GGTCTCGCGA CAAACGGTCG TTATCAAAGT ACGCCATTCG
AATTGCCACC CGATAATGGC CTACCATTGT AAGTGGCTCA CGCGAATGTC GGAACAGAAC
ACACGAGCGC AAGACCACAA ACACCATTTG ATACATATAC TGTCTACTAC TTCACACCAG
CCGGAGAAGG GTTTGGTTCT GCTCTGGAAA TGATACCCGC ACCTCCGTAC TCGACTGAAA
ACTCTATTCC ATCTCCTACA GTTCCATATA TCTCGCTTGG TCGTCCAGCA AGGTACGGGT
TCAACCTACT ACTATACCAA CCATAGCTTA TTGACTGAAT AGTACCACTC CTCGATCACG
AGCGATTCTT TCCACCTTTC CAAACCACAC TACCATGCCC CCACCATCCT CCATTCCCAA
CGCCAACAGC CGAAAGCGCA AAATCCTCCC AATGACGCGC TCGCAAGACA GCACTCCTTC
CTCTCCTGCA TTAAGCGAAT CGTCATCTTT GGTATCGGAT GATGGCGACG GAGAATATGC
CCCTGAGTCT GTTCGTCCCA TTAGGAAGGC GAAAAGGCCT CGTGACAACG ATTTCAAGTT
GAACTCCGGT GGACGTTCGG GCGCGACGAC AAACAGTCAT GGCAATAGCA ATGGAGCGAC
AGGAAAGAAT TGTAAAGTGA AGGGCAAAGA CATGTCCCGC GAGCAGTTGC GCAAGGTGAA
CCACTCGTTG ATCGAGAGAA GGCGGAGGGA GAAGATCAAC GCGGCACTGA ATGAGCTGAG
AAGGATGGTA CCGAGCTTGG GAGAGAACGG TGGAAAAGGT GGCGAGTTCA AACTTGAAGT
AAGCATTGAA AATGAAATTA AAGTGTAACG GGCTGACAGT TTGATAGGTA TTGGAGAAGA
CTGTAGAGCA CATGAAGGAC TTGAAGGGAC GACTGGAAGA CTTGGAACGA GGGGCTGCGG
CATCAGCCAA CAACTCTTCC TGCGAATCAA ACGCTCGCGG CAAGGATAGG GAAACGGAAC
TCGAAGTGGA GAGCCGAAGT CGAAGCAAGA CATCGACTTA CCCTTCTCCC TCTCCCGATA
GACAACAATT TTCAAACTCA CCCCCACCTG ATCCGAACGA AACAGATGTA GAGTCCAACT
TGCCACCTCC TTACACGCTC GCTAGCCGAG CTCGCGCTCG TTCTCGTGCC CATGCGTCCT
CTTTGCCGTC TACTTCCGCT TGCATTTCCG CCTCCAATCG TGGCACAACG ACGAGTCAAG
AATCCAAATC CCCGTCTTTC TGGTCTGGCC AAACACAAGA ACAGGTCACA GAAAATTTGC
ATGGGCAAAG AGGGTACCAG CCACTCCCCA GTACCCGGCC GAAGCCCCCA ACCAGCACTT
CTAACCCCAT ATTTCTTCCT TTCCCTTCAC CCTCTCCAAC CTCGCCCTTT CTCCATCCCA
ATGCCAGTTT CAACACCAAT CCTAACGCAG ATACGCTGAA CAACTCTTCC GCGGCGGGAT
CAATGATGTC TGGAGAGAAT GAGGGCCGCG GTTTTGGTCC CAACTACAAC TCTTCAGCCT
CCGTCAATGG CAGCGTGCAA GGTGCTGCAG AGGCTCGCAA TACTCATCCA AGCCCGTTTT
TGCCACCTAT ACCTAATATG AGCTTGTTCA GTATAATGAG CCTTGAGAAT TCACCAGTGG
ATACGTTTCG ACAGGCTTGT ATGGAAGGAT TCGGAGGTGC GGGAAAAAGC GGTTCATTTT
CACCGCCAGA GTTGAATCTT GAGGATACTT CTCAAACGGC GCGCCGTAGC TCATTTGCAG
AGCCAAGAGG TGCGCTGGAC GTAGGAATGA GTACTATGAA CACAGACGTA AAGGAACGTC
GGCATGATAC GAACGATAAG TCCACATCCA TTGACAGTGA CAAAAATAGA AATAAGCATC
AAGACGACGT CACTACCAAG GCTGGCGCTA ATACCAATGC CAACACTGAG ATGCTACCCG
AAGAAGCTGC GAACTTGCTC CTAGCATTTT CGTCTCCTGA GACATTACGC CCATTGGGTG
ACGTGCCGGT CGTCCCCCTT GCTGGACAAG GGTATGGACA GGGGCAAGGG CAAATCAGAA
GGACGGTGGA AGAATTTAGT CTTGATTCAG GAGCGGCACT TGGATCTGAA TCTGGATCGG
GTTTGGGAGC AGTAAGAGCA ATAAAAACTG GGAAGGGAAG GGAAGTTTCA TCGGTGAAGA
TGATGGGATC GACATCGAAA GATCGCCTGG TTGTGGGGAA GAGTGTAAGG GACATGCTCA
AGTTGACATG AATGACGGCG GA
 
Protein sequence
MAYHCKWLTR MSEQNTRAQD HKHHLIHILS TTSHQPEKGL RKILPMTRSQ DSTPSSPALS 
ESSSLVSDDG DGEYAPESVR PIRKAKRPRD NDFKLNSGGR SGATTNSHGN SNGATGKNCK
VKGKDMSREQ LRKVNHSLIE RRRREKINAA LNELRRMVPS LGENGGKGGE FKLEVLEKTV
EHMKDLKGRL EDLERGAAAS ANNSSCESNA RGKDRETELE VESRSRSKTS TYPSPSPDRQ
QFSNSPPPDP NETDVESNLP PPYTLASRAR ARSRAHASSL PSTSACISAS NRGTTTSQES
KSPSFWSGQT QEQVTENLHG QRGYQPLPST RPKPPTSTSN PIFLPFPSPS PTSPFLHPNA
SFNTNPNADT LNNSSAAGSM MSGENEGRGF GPNYNSSASV NGSVQGAAEA RNTHPSPFLP
PIPNMSLFSI MSLENSPVDT FRQACMEGFG GAGKSGSFSP PELNLEDTSQ TARRSSFAEP
RGALDVGMST MNTDVKERRH DTNDKSTSID SDKNRNKHQD DVTTKAGANT NANTEMLPEE
AANLLLAFSS PETLRPLGDV PVVPLAGQGY GQGQGQIRRT VEEFSLDSGA ALGSESGSGL
GAVRAIKTGK GREVSSVKMM GSTSKDRLVV GKSVRDMLKL T