Gene CNC01040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC01040 
Symbol 
ID3256194 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp297399 
End bp300625 
Gene Length3227 bp 
Protein Length985 aa 
Translation table 
GC content53% 
IMG OID638255323 
Productchromatin modification-related protein, putative 
Protein accessionXP_569964 
Protein GI58265616 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.541862 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GAAATGAGGC ATGTCCGGGC GATGGAGCGT ATTGAGGCCA AAAAGGCGGA GAACAGATGG 
TCCCTTCGAC AGCCAAAAAA GGCAAGGGGT CCTGGCGTAC CAAAGTCTCA TTGGGACTAT
ATGCTTGAGG AAATGGAATG GATGCGCACG GATTTTGCTG AAGAGCGTCG GTGGAAGGTC
GTCGAGGCGA GGGAGTTCGC ATATCAAGTC GTAGAATGGC ATCTGGCGAG TCCTGAGGAA
AAGAAGGCTC TTATGGTGGG AGGTCGAGGC TGGGGAGAGT GCCGCAATGT TCCGATACCA
GGTCATGCGG GAAAGAGGAA GGAAGTCACT GTGGAAGTGG AAGCTGAGGA CGAGGATGTT
GAGATGCTGG TGGGACAGGA AGGAGAGCTG GATGGGGAAG GAGAGGCGAA CAAGGTGTTG
GAGTCAATAG ATGAAATGAG GGTTAATGAA AAGGAGAGGG AGAACCGACC AGAGGATCCT
AGAGAAACTG TCAATATAAA TCAGGACATT GGGGAAGAAG TCGATGCAGA AGGCGAAGCA
GATGCCGACG GTGAACCAGA AAATGGAGAA GCGGATGCAG AAGGAGAGGC AGATGCGGAT
GGTGGACCTG TGGGCGACGA CGTTGTAGGG CTTTCCGGTA AATATTATTC TACGTTCGCT
GCTCAGTCAA TGCTGATAAC AATCAGAAAT CGACGCTGCC CAAGATGATA CTAGGGAAAC
TTCTGAACGG CCAAGCTATC GGCGTGATAC TGTTCTACCA AACGGCCTTG TCATCCATAA
GCGGTTCGCC AATGCGTACG AGATTGCAAT TGCTCGGGGG CCCGTCCTTG ACACCCCTTT
GGCGAACGCT ACCGTTGATC TTGATACTCT GACAAAATCC TCATCTGCTG CAACTCCAGC
TACAGTCCCT GCCGAACCTT CTGTTAGCCC AGACGAACCT GCTTCCTTTG ACCAACTCTT
TCCGGATCTC GCTATGTACT CTGGCCCCGC CCCGCCTGAG AATGACAAGA AGTATCGCCG
AGATGAGGGC GGTACATACA GTCACCGCAT GGCACACACT TCTCGAATCA TGGACATTCG
GCCCATTCTT GTTTCCACCC TTCAGCCAGC GAAGAATCTA ATTGACGGCG AGTGGGATCT
TCATGATGGA CCTTATTATG AAGAGGTAAA GGGAGCTGCA GATATCCCTC CAAATGTAGT
TGCTGCTTTT AATACGCCCT TTGGCGGCAA AGCGTCAAGA CCCTTGGAGC ACATGCGAGT
GCCTGAAGTT CCCAAACCAG CTGCGCACCA TCTTCGTGCA CAATTGCTTT GGTCGCCAGA
GGAAGACAAG TGCTTGTTAA AGCTCGTCGC CATGTATCCA TTCAACTGGG ATCTCATAGC
AGACAGTTTC AATACGGAGA TGATCCTCAT TCCTGTCGAG AAACGAAACC CGTATGAATG
CTGGGAGCGA TGGTACTATA CTTTTGGGGA AGGAAAGAAC AAGCCTCGGC AGGACGCGCC
GCCCTCGGCT CCTCCACCTG CGCCTGCTTC TGCCACGCAA CCTGGTACAG CAACCGCTAC
GCCCGTGCCA CAATCAGCTG TTACCACCCC TGGAGTCCCT CCATCCGCCA ATCTCCCATC
TGCTTCGGGA CGTCCACAGC AAACTGGCGG TAACAGTGTG TCTTCTCTTC CTACTCCTAC
CGGAGAAGCG CTGCCTGACG GTGCACCTCC ACCCCCCGGG ATGTCCAAGA GAGATAGAAT
GGCGGCGAAG CCAAAGTACG AAGGGACAAA GAGGTCAGTC AGACACCAAG CGATCTATGA
TGCGGTGAAG AGGATGAACA GGAGGAGAGA AGCGGCGAGG GCGAAGAGCC GTAAGTTTGA
ACGCTAAGGA TGAGACAACA ACTGACAAGT GGGTTAGATA AAGACAATGC ACAACGAAAG
GTAATCAATG TGCACGAAAG TCACAGTATG AGCTTCCCCC ATGTTGCTGC TTCCACCCCA
TGGGAACTAG TTGAAGCTAA ATATCAGCGC GATGTGCAAA TTGCTCAGCA GCGACAGCAA
CGTGCCATGC AAGAACAGCA ACGTCAACTT GCCATTCGTC AGCAGCAAGC CATGATGAGC
GCTCAGCAAC AAGCACAAAT GAGGCCGCCA AACATGCCCA ACGTCCCGAA TATGCCGAAT
GCTCAACCTA TCCGCATGGG TCCCAATGGC CAGCCAATGC CCACTATGGC GCCAAGCCAA
CAACAGCTAT TAAATGCTGT TGCCGCTGCG ACAGCTGCCA ATAGGCAAAA TGCAAACGGC
GCTGTCCAAG GTAACCCTAA TGTTCGTCCA ATGCCTGTTG TTCAGGGGCA GTCACCTCAG
GTGCAGCAAC AAATGCTTCT TCAAGCACAG CAAATGGCCG CCCAGCAAGC ACGAGTGTTA
CAAGCACAGG CACAAGCTCA GGCTCAACAA GGCCGAGCAC CCAGCATGGG TGGAAATCTA
CAGCCACCGC AGCTGGGTGT CTCATCTCCT TTCGCCCAAT CTCGTACTCC CGATCTTCCC
GCGGAGGGCG CTGGCCCTTC TGGTATCAAC CCCACTCCGT CTCCGGCCAT GCAAGCAGCG
GCCATTGGGG CTCAATCCTC TCCTCAAATA GCAACGATGG GTCGGGCACC GTCCAATAAC
GTACCTCCCC ATCTTCGAGT ACCCAATGCA GGTACCTCTT CACCTCAGAT ATCTAGCCCG
ATGGCCTTGC CTCAAGGAAT ACCAAATGGA GCTGGGATGC CGGTACAGGG AGCCCAAACT
CAAGGGATTC AAATTCCGGC AGCAATGATG AACAATGCGA CTGTGCAGCA GCTTCTGGCG
ACACTAGCGG CCAGTGGTCA GCAAATGACA CCGGAGCAAT TACGAGGGTT AATGCTACGA
TCGGTGAGCT AATTTGCTAC TTATATTCGC ATGGCTAACA TTTCCCTAGG CCCACATGCA
AGCTCAGGCA CAAAGCCAGG TCGGCAACCC TGGGACACCT CAGATGGGGG TGCAAAACAT
CCAAGGCGTC GTGAGTTCCC TTGATAGATC GCTGTCTACC TCCAAATTTT TGCTGACCAA
AGGGGAAGCA ACATTTCGCG AGATCACCTA GTTTGCAAAA CGCCCAGTCC CAGCCTCGTT
CAAGCCCCAA ACCAGGTCCT GCCAATGGTC AAGGGACATA AGCATTAGGA TTTGACGATG
TTTTGAGTTA TTTTTTTTGG AAATTTTAGT TCAACTACGA ATGCGAC
 
Protein sequence
MRHVRAMERI EAKKAENRWS LRQPKKARGP GVPKSHWDYM LEEMEWMRTD FAEERRWKVV 
EAREFAYQVV EWHLASPEEK KALMVGGRGW GECRNVPIPG HAGKRKEVTV EVEAEDEDVE
MLVGQEGELD GEGEANKVLE SIDEMRVNEK ERENRPEDPR ETVNINQDIG EEVDAEGEAD
ADGEPENGEA DAEGEADADG GPVGDDVVGL SEIDAAQDDT RETSERPSYR RDTVLPNGLV
IHKRFANAYE IAIARGPVLD TPLANATVDL DTLTKSSSAA TPATVPAEPS VSPDEPASFD
QLFPDLAMYS GPAPPENDKK YRRDEGGTYS HRMAHTSRIM DIRPILVSTL QPAKNLIDGE
WDLHDGPYYE EVKGAADIPP NVVAAFNTPF GGKASRPLEH MRVPEVPKPA AHHLRAQLLW
SPEEDKCLLK LVAMYPFNWD LIADSFNTEM ILIPVEKRNP YECWERWYYT FGEGKNKPRQ
DAPPSAPPPA PASATQPGTA TATPVPQSAV TTPGVPPSAN LPSASGRPQQ TGGNSVSSLP
TPTGEALPDG APPPPGMSKR DRMAAKPKYE GTKRSVRHQA IYDAVKRMNR RREAARAKSH
KDNAQRKVIN VHESHSMSFP HVAASTPWEL VEAKYQRDVQ IAQQRQQRAM QEQQRQLAIR
QQQAMMSAQQ QAQMRPPNMP NVPNMPNAQP IRMGPNGQPM PTMAPSQQQL LNAVAAATAA
NRQNANGAVQ GNPNVRPMPV VQGQSPQVQQ QMLLQAQQMA AQQARVLQAQ AQAQAQQGRA
PSMGGNLQPP QLGVSSPFAQ SRTPDLPAEG AGPSGINPTP SPAMQAAAIG AQSSPQIATM
GRAPSNNVPP HLRVPNAGTS SPQISSPMAL PQGIPNGAGM PVQGAQTQGI QIPAAMMNNA
TVQQLLATLA ASGQQMTPEQ LRGLMLRSAH MQAQAQSQVG NPGTPQMGVQ NIQGVQHFAR
SPSLQNAQSQ PRSSPKPGPA NGQGT