Gene CNB01050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB01050 
Symbol 
ID3255832 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp321100 
End bp323894 
Gene Length2795 bp 
Protein Length710 aa 
Translation table 
GC content50% 
IMG OID638254756 
Producthypothetical protein 
Protein accessionXP_569111 
Protein GI58263402 
COG category[B] Chromatin structure and dynamics
[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG5406] Nucleosome binding factor SPN, SPT16 subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.769238 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AAGTATCTAG CGAGGGCCGT CAGCGATAAG CTCCCTTTCA AAAATTTTCT TTCAGAGCCA 
GGTCAGTTCA TTTCCCCTTG AAAATTTTTT CTCCGTTTCC ACAATGATGC CCCGAAAGGG
AATTACCGCC TGAGCACATC TTCCTTCTGC GGCGTAGCAA TACGTCCAGA GGTTGGTGGC
ATGCTGTTAA AACATGGCTG TTTTATTTTT ACTGATCATT CCATTCTCCG TTACGATTTG
GATTCCACAT GACCACCGAG CTAACTCACT GCTGTTAGGT CTCTTGACCG ATCGAAAATT
TTTAGTTTCT CTCTCTTGGG GTATCCATTT TTATTTAGAG TAAGTGTTAT TTTCTATCGC
TCTACCCCAT CATACTCTAG TCTTTTGTAA TAAGTGATGA TCACAACACG ACGGTCTTCT
TACCCAAAAA ACCATGGTGT AAACGCCCGA AACCCATTTC TCCGGGCACC CAATCTGACT
CTACCATTCA CTTCTGATTT TGCGTTTAGA ATTGTGTAGA GCTTTGTGCT GACTATTGTT
TTCAGCCCTC ATAATGGCTC CACGTCAACC TGTCGCTGGC CCTTCGAAGC CGAAGCACTC
GCAGCAGCCT GCAATCCAGT CACCTCCGCC TCCTGCAGCA GTCTCGGCAT TCAACTCCAC
TCGCACCTTG TTTGCTCTAG CTTCCCCTGT TTTAGGGCAA GCGGACAAGG TTCAAGTGTG
GGATGTTGCG GGTGACCGTG TAATATCCGA ATGGGAAGTA GCTGGTGCTA GTAAGGCTTC
CTCTGTTTGC TTTACTACAG TCCCATCGGA CGCCGCTAGT AAGAAAAAGA AGCGGAGAAA
GTCTGGCTCT GGTCATGGCG CTGAGGAAGA GGTAGTCCTC GTGACAACGT CAAAGAGTCA
ACTTTTGGTT TTGTCTACTA AGCAAACCGA ACCTTTGCGA ACTCTTGATC TCCCAGCTCC
TGTAACTGCG GCCTGGTCTG AAGAACGTGC GACAATACTG GCAACTGCTT CCTCTCTTCT
TGTTTTATCG GCGGATGCCT CTAGTATCTC TCACACTTTC ACTCTTCCAT CCTCCCTCTC
ATCTCCTACT GCCCTCACAA TCCTTCCGAC ATCGACCGCT GAGTCATTGC ATGTACTCGT
TGCCTCCTCG CTCGTCGTCA CTCTTCACCT GGGATTGGCG TCTCAGGAGA TCACTTTCGT
CTCATCACCG CTTCCTGCAT CTACCTCTTC TATCTCTTCG CTCCTCCCTC TTCCTCTTAC
AGAGCAAGGT GCTTCATTCC TTGTCGTGTC GGAAGATGAC CGCACAATAT CTCAATACAC
CCTCACGTCA CCCCAGTCCT CCGCGAGACT TTCATACCGC TACGCCTCGC CTACACTATC
TTCTGCTCAC TCAATTTCGG CTGATCCAGA TCTTCTCGCC GTATTGCACG AGTCTGGAGA
GATTTCTCTT TTCCGCCTTC CGTCTGAGCT TGATCTTTCC CGCCCCCAGT CAGATGCGAA
GCCGAGCACT GTAAAGATTG TTGAAGGGAA GGAAGAACGT ACAGCCAGAT TGTGTCGCGT
TGCTTTTGCC CCTGTAGATG ATGGAGCATC GGGTGCATTG CTTTGTGGAA GATTGACAGG
AGGAGGTCGT GTCAAATGGT CGCGTGCAAT CTTCGAGCTT CCGGAGGGCG GTTTGAGACC
CGTTACGGTT GTTAAAGTCG AGGCTCAAGA ATTAGTTGGA GGCTCATCAG CGTCAGAGGT
CAGTACGATG TCACTTCTTG TGTGAACCGG TTGCTGATCT TCCTTGCAGA GTGTTCCTGT
GCAACGTTTT GTCGCGCCCA ATACGGTGAA TGAAGCTGCT CCCGACGATA TCGACGAGGC
CCCTGTCTCT CAGTTGCCTT CTGATGTTAA TATGGCGGAA CTTTCGCTCG GCGAACGCAT
GCTTGCACCA GCTTCTCAAG AAGCCGACAG CGGAAACAAA CGCGCAGCCA CTTCAGCCGG
CGTTACCCTC GACGGTCCTG TGAATGCAGC TTCCCTCACT CGTGTTCTCG TCCAAGCTCT
CCATACATCG GACCCAGCTC TTCTAACACT GTGTCTGTCC CACCGCAACC CAGTTCTTAT
TCGCAACACT ATCAGAAAAA TGCCTCCTCA ATTGGCTCTT CCTCTTCTGA AAGCTTGTGT
GGAGCGACTG GGCCAAGGCA AGGGCGCCAA CAAACGTGGT GGAGGCCGTG GTGCTGCGCA
GAACGAACAG CAAGGTCGCG GCACTGTTGA ATGGGTAAAG GGAGTGCTTG TCGAACGCGG
CTCCATCCTC ATGACTATAC CTTCTTTACC TGTCCATCTT GCTTCTTTGT CTCAGCTGCT
TCAAAATCGA CTGGAGCTCA ACCAGCCTTT GCAAAGCCTT TCTGGCCGTT TAGATCTTGC
TCTTGCCCAG ATTACCATGC GTCGCATCGC TGCTGAGCAG GCTCTGGAGA ATGCCAAGAA
CGGTGGACAG AAGGGCGGCG AAGGTGAGAT ATATGTCGAG GGTGAAAGCG AAGATGAAGA
TGAGGATTTC ATCGAAGTTG GCGAAGATGG AGGTGAAATT GAGGACATTG ATATGGGTGG
ATTGAGTGAG AGCGACGAAA GCGAAGAGGA CGAGGAGGAG GATGAAGAAT CAGATGATGA
TCCCCTTGAT TCCGGCTCGG ACAATGATCT ACTAGATCTG CAAGCTGAAG AAGAAAGCGG
AAGTGACGAC GAGGACGAAA GTGAAGAGGA GGATTAGTAG CATCATTGTA GTATCTTCAA
TACATTTGCA TGTCACATTA GCATTTCCAA TCAGA
 
Protein sequence
MAPRQPVAGP SKPKHSQQPA IQSPPPPAAV SAFNSTRTLF ALASPVLGQA DKVQVWDVAG 
DRVISEWEVA GASKASSVCF TTVPSDAASK KKKRRKSGSG HGAEEEVVLV TTSKSQLLVL
STKQTEPLRT LDLPAPVTAA WSEERATILA TASSLLVLSA DASSISHTFT LPSSLSSPTA
LTILPTSTAE SLHVLVASSL VVTLHLGLAS QEITFVSSPL PASTSSISSL LPLPLTEQGA
SFLVVSEDDR TISQYTLTSP QSSARLSYRY ASPTLSSAHS ISADPDLLAV LHESGEISLF
RLPSELDLSR PQSDAKPSTV KIVEGKEERT ARLCRVAFAP VDDGASGALL CGRLTGGGRV
KWSRAIFELP EGGLRPVTVV KVEAQELVGG SSASESVPVQ RFVAPNTVNE AAPDDIDEAP
VSQLPSDVNM AELSLGERML APASQEADSG NKRAATSAGV TLDGPVNAAS LTRVLVQALH
TSDPALLTLC LSHRNPVLIR NTIRKMPPQL ALPLLKACVE RLGQGKGANK RGGGRGAAQN
EQQGRGTVEW VKGVLVERGS ILMTIPSLPV HLASLSQLLQ NRLELNQPLQ SLSGRLDLAL
AQITMRRIAA EQALENAKNG GQKGGEGEIY VEGESEDEDE DFIEVGEDGG EIEDIDMGGL
SESDESEEDE EEDEESDDDP LDSGSDNDLL DLQAEEESGS DDEDESEEED