Gene CNN00200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00200 
Symbol 
ID3255349 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp84012 
End bp86149 
Gene Length2138 bp 
Protein Length607 aa 
Translation table 
GC content49% 
IMG OID638254435 
Productnucleolus protein, putative 
Protein accessionXP_568528 
Protein GI58262236 
COG category[A] RNA processing and modification
[D] Cell cycle control, cell division, chromosome partitioning
[K] Transcription 
COG ID[COG5147] Myb superfamily proteins, including transcription factors and mRNA splicing factors 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGGCA TCGAGCTCGG CATCCAGGAC CTCGACCACA GCTATCAGAT TGACACCCAA 
GCCCTGTCAT CGGACGATGA AGAAGATCGT CAACAAGAGC CCGAACGACC AACCAAGCGA
AAACGCACTA AAGAAGAAAA GGAGAGGAGA AGGTCTGAGA AGAGAGCGAG AAAGGAGAAA
AGAAGGTCGG GAGCCGCCCA GACTGCACAG GTTCAGGCTC AAGCACAAGC TGGAATTGAA
GCAGAAGTGG CCGAAGCACC CACCGACTAT GATGAAGTTG AACAAGTAGT GGAGGATGCT
CCTGTGAGCG AGGGAGACAA GAAGCGAAAG GAGAAGAAGC ACAAGAAAGA CAAAAGCAAG
GGCAAAGGTA AAGGAAAGAA GAAGGAAGGG AGTGAAGAGC AAGAAGAAGA AGACAGGGAG
GCTATAGCTG CATCAGCTGT AGCTACCCTC GCCCAGGCAT TGGTGAGCTC GGAAAGTGGC
AAGGTTGCCC CGTCTGTGGA TAAAGTTCAA ACTCCAAGTC GCCCGACTGC TGCCACTTCT
ACCATCGCTA CCTCACGAAT AGGTCCTACC CAGTACGGCA GCATCAAAAT CAAGAAAGGC
CCCTTGGAAT CATCACCAGC CCCACCCGCA TCGTCACTCA CGCCCGCCCT TCCCGCGACC
CAGCTCATCC CTGTGACTCC ATTCACGGCC GCCTCGACCG ATGCGAGTTC ATTTGCTGTT
AGGGACAAGA TCAATTCTCT CAAGCACCCA AAGTCTTCTA GTAATGCCTC CAAAGCTATA
AGAAGCACGA GGAAGGGCGA GGATACCAAG GAAAGTGATG CGCAGCTTCG ATTGAGGTTC
CAGGACCCCA AGGCACAGGA AGAGTGGTTG GCTAGTACAT CAATTGGGAA GACTGAGCTT
CTGAGATTGG AAAAGGAGGG TAGTAAGTGT AGTTTCTAGT TTAAAATATA CAGATGGCTA
ATTCCACGTT CTAGTTCTGT CGTACAAGAA GGGAAAGTTC ACTGAGGACG AGAAGGTTTC
AATCAAAAAG GCTTTGGAGA ATTATCAAAA GATACATCGA ATAAGCTCTT TCGATCTTGT
TGAGCTCGTC ATGACCAAAA CACTTCAAGC CACGGATAAA GAAACTGTCC GTGAATTTTG
GAAAGATATC GGTATGACCA TTTATTTCAT TCTGTAAAAC ACTATTGATT GTGAAACAGC
CGCTTCTGTC CCCGGTCGCC CGATCCTCAA CGTCCAACCA TTCGTGCGAC GAATGCTCGA
CCCTAAAGCT CATAAAGGCC GCTGGACCCC GGAAGAAGAC GAACTCCTCC TTCGCGCATA
CGCACAACAC CCTCGCGAAT GGACCAAAAT CTCCTCCATC GTTGACCGTA CCGAGGTGGA
TTGTAGGGAT CGTTATTTGA AGGAACTCGT GAATCGTGAT ACCCGAACAG CGGGTAGGTG
GACAAAAGAT GAGGAGGACA AGTTGGAAGA GGTGGTGAAC AGGGTTGCGA AGGGATTGCG
TGCGGAACAG GTGCATGGGG AGAAGAGGAA GGGTCTGGAA GAAGGAGCAG AGCTGGTGGA
ACCATCGGAC GTCCCTTGGG ATATTGTTTC GAAAGAGATG GGCAACACAC GATCAATGAC
ACAGTGTCGT ATCAAGTATC GCGATGCCAT CTGGCCCAGA AAACTGGGTT TGGGTAAAGA
TGATCATGTC GGAAGGACGT TGAAGGTCCT CACAAGGTAT TTTTTTCTCT CGTTCTTTTT
TTCAAGCTTG TTCATTCCTG ATGTGACCAC TTAGACTCAA AAACTTGAAC TATGAGTCCG
AGAAGCACAT CTCTTGGTCA CAAGTCCGTG AAACCCTCGA GAAATACTCC CTCAAGGAAA
TCAGAAATTC GTATACCAAT CTCAAAAAGA GTGTAATGAG CGATCCCCAT GTTGCCAGTC
TCAATTACCC CGGTTTGTCA AACGTCCCAT ACTTTCCCAG ACGCGAAGAC GACTTGAACT
GATGATGCTT GCTGGATCCA GAATTGATCA ATGTCATGTA CGATAAAGCG GTCATGCAAA
GGGGGAGGAA AGTGAGGGCG GATCAGAGGG ATTATCCGAG TAAGGAGACG GTGGAGTCGG
GGGATGAAGC GTATTAACGA GGGTGCGCCA AGGAAGAT
 
Protein sequence
MEGIELGIQD LDHSYQIDTQ ALSSDDEEDR QQEPERPTKR KRTKEEKERR RSEKRARKEK 
RRSGAAQTAQ VQAQAQAGIE AEVAEAPTDY DEVEQVVEDA PVSEGDKKRK EKKHKKDKSK
GKGKGKKKEG SEEQEEEDRE AIAASAVATL AQALVSSESG KVAPSVDKVQ TPSRPTAATS
TIATSRIGPT QYGSIKIKKG PLESSPAPPA SSLTPALPAT QLIPVTPFTA ASTDASSFAV
RDKINSLKHP KSSSNASKAI RSTRKGEDTK ESDAQLRLRF QDPKAQEEWL ASTSIGKTEL
LRLEKEGILS YKKGKFTEDE KVSIKKALEN YQKIHRISSF DLVELVMTKT LQATDKETVR
EFWKDIAASV PGRPILNVQP FVRRMLDPKA HKGRWTPEED ELLLRAYAQH PREWTKISSI
VDRTEVDCRD RYLKELVNRD TRTAGRWTKD EEDKLEEVVN RVAKGLRAEQ VHGEKRKGLE
EGAELVEPSD VPWDIVSKEM GNTRSMTQCR IKYRDAIWPR KLGLGKDDHV GRTLKVLTRL
KNLNYESEKH ISWSQVRETL EKYSLKEIRN SYTNLKKSVM SDPHVASLNY PGLSNVPYFP
RREDDLN