Gene CNG04000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG04000 
Symbol 
ID3258806 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1130528 
End bp1132729 
Gene Length2202 bp 
Protein Length498 aa 
Translation table 
GC content51% 
IMG OID638258024 
Producthypothetical protein 
Protein accessionXP_572128 
Protein GI58269944 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.0856641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGGGAAACA CCAGCATGGA AGCCGACGAG CCGCCCAGGT ACCCAGATAT CCAGGGGTAC 
TATCTCAGCA GAGAAATAGG CGGTGGAGGG TTCTCCAAGT AAGCCCCCCT CTCCGTCGAG
GCGCCCCTGT CCTAATGATC TCAGGGTGTA CCGAGCTGTG CACAGGCAAA CAAGAGGGCT
TGCGGCATGT AAAGTGATCA ACCTTTACGT CAACCCCGCC TGGGGCCTTG GCACCCCCAA
CATCAAGGAG CTTCAAAAGG AAGTCCAGGT ACACAAGGCC CTCAAAAACC CGTATATCCT
CGAGTTCCTG CACCACGAAA CAATAAAAAT CGATAATCCC GATCATTATA CGCCGGGACT
GTATATGCTT CTCGAGCTGG CTGTTGGTGG AGATTTGTTT GACAAAATTG GTACGTATAA
TGCTTTGTCA GGGTTGACGA ACCTGACGGT CGTTATCAGC ACCGGACGTT GGCGTGCCCG
AAGATCTTGC CAAGTTTTAC TTTGCCCAGA TGGTCGCTGG CGTGGTAAGT TTTCTTGTAA
GGATACTCTA GCATCAACCA AATTAACTGC TCTGCTAGGA ATTCATCCAC GCCAAAGGTA
TCGCCCACCG AGACCTTAAA CCCGAAAACC TCCTTCTTGC AGCGAACGGT AACCTCAAAA
TCACCGACTT TGGTCTCTGT GCCGTATTCA GACACAAGGG CAAGACGAGA TTACTGAGCG
GGAGATGCGG AAGCCTGCCT TATGTTGCAC CCGAAGTAAG TCGTGAATCA CTCGGTCAAG
GCGTCTGTAA GACAAGCTTA TAGAGCAAGA AAAGCTGACA AGACGAATAG CTTGGTGTGC
CAGCAGGACA AGGGTATGCG GCAGAACCGG TTGACATTTG GGGAATGGGT GTTGTTCTCT
ACACCCTCCT CGTAGGTAGT ACGTCCATCG CACTTGATTC ATCGTCTTGA TTCCCAAGTT
GATGCTGAAC CGGATATAGA TACGCCATGG GATGAACCTT CGGAAAGTAG TCCTGAATTC
TGCGCGTACA GAACGGGCGA GTTGTTCAAT TATGATCCAT GGACAAGAAT ACGCGGTCAG
GCATTGGGTA ATGTTCTTCC ATCCAGATTG CCTCTAAAAG GTATCTCGTG AACTAACCAT
CTCCATGAAA ATAGATATTC TAAAGGGCTT GTTGTGTATC GACCCCCAGC AAAGATTAAC
GATCCCTGGT ATCAAACACC ACCCATGGTG TATGACGTGC GTCTCTCTAC CCTACTCTCT
TCGGTCCAGC CTCAAAATAA TAACTGCACT TGTGCTGATG AAATAAATCT GACTCTGTAC
TTTTCCCAAA CCAGACAAAG TCAACTGCGT CGCGAGCAGC TTTCTGAAGC TCTTACCCAA
GGTATCCGCC AAGAGGGTAT GCTGGCGTAT GCGGACCCCG TTTTCCGTTC AGGCGAATCT
CAAGCCTACG CCGCTTCGTA CGTCCCACCC GCCCAACCCC CCCTCAGAAG CAAATATGTG
ACTGACATTG CTCTTGTTTG GATGATACAG ACGCGGAGCA AGGTTGGCCG GCGATACCGA
TTGGCTTAAC AAGGAAGAAT CGCAATTCAT GCGCGGGACC GGAAACATTG TACGTTTCCT
ACATCACTCT CTTGTTCTCC TTTTCCTTTC TGCGATAGCA AGCTTACCAC CTCTTAACCA
CTTGCACAGA CGCAATCCGG TGATCTGATG ACCATCACCA CCCGGTTCTG GATGGCTCTC
CCGCCCAACG AAGCATTCCA GATCCTCGCC GCTTTCTTGC AGTCTGATGT CGCTGATGCC
AACGGTTCCG TCCGTGCTGT CGCCGCACCA TCAGCACCAG GAGACTCGTT CCCTGGTTCT
CACGGAACAG CAGGCGGAGG CGGCGGCGGC GGCGGCGGCG GCGGAGGGGG TGTGATCAAA
CTTCACAAGG CGGCGGGACA TCAGTATGTG GAGGGCAAAG TACTTGTCTT GCCTAGCGAT
AGTTTGGCGG CCGAGGGCCA GAGTCTGGTG ATCATGCAGA GGGCCAAGGG GTCGATCTTG
CATTGGAGGG CATTGTGGTG GAGTGTGGTG AGAGCGAATG AGCTGCAGGA TTATGTTATT
CGAGGAGACA TGTGAAAAAA GACATCAGCA GGTAATGAGT ATTTCAGTGA AAAGCAGGAC
AGGGCCAGAG TCATGGGATA GATATATGGG AGTATTAATC AC
 
Protein sequence
MEADEPPRYP DIQGYYLSRE IGGGGFSKVY RAVHRQTRGL AACKVINLYV NPAWGLGTPN 
IKELQKEVQV HKALKNPYIL EFLHHETIKI DNPDHYTPGL YMLLELAVGG DLFDKIAPDV
GVPEDLAKFY FAQMVAGVEF IHAKGIAHRD LKPENLLLAA NGNLKITDFG LCAVFRHKGK
TRLLSGRCGS LPYVAPELGV PAGQGYAAEP VDIWGMGVVL YTLLVGNTPW DEPSESSPEF
CAYRTGELFN YDPWTRIRGQ ALDILKGLLC IDPQQRLTIP GIKHHPWCMT QSQLRREQLS
EALTQGIRQE GMLAYADPVF RSGESQAYAA SRGARLAGDT DWLNKEESQF MRGTGNITQS
GDLMTITTRF WMALPPNEAF QILAAFLQSD VADANGSVRA VAAPSAPGDS FPGSHGTAGG
GGGGGGGGGG GVIKLHKAAG HQYVEGKVLV LPSDSLAAEG QSLVIMQRAK GSILHWRALW
WSVVRANELQ DYVIRGDM