Gene CNF03820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03820 
Symbol 
ID3258385 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp1118415 
End bp1121031 
Gene Length2617 bp 
Protein Length755 aa 
Translation table 
GC content51% 
IMG OID638257501 
Producthypothetical protein 
Protein accessionXP_571340 
Protein GI58268368 
COG category[K] Transcription 
COG ID[COG5665] CCR4-NOT transcriptional regulation complex, NOT5 subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GATCTTTTGC TTGTTTTTTT TCTTCGGTAC ACAGCACAAG CAGCGACAGA CTATGGCGTT 
AAGAAAGCTG CAAGGTGAGT AAACTGACCG ATACAATGGA CCAGCATGCT GAATACCAGT
CAGCGGAAAT CGACCGCACC CTCAAATCGG TCGCCACGGG TGTGGAGGTG TTTGAAGCTA
CCTTTGACAA ACTCAACTAT GCCACCAACA CAACGCAGAA GGACAAGCTT GAAAATGATC
TCAAGACCCA AATCAAAAAG CTGCAGCGTA TGCGAGATCA GATCAAAGCT TGGCTCGGTA
ACGGCGACAT CAAAGACAAG ACGGCGTTGC TTGAGAACAG GCGGTTGATC GAAACCCAGA
TGGAAAGATT CAAGGCACTT GAAAAAGAAA CCAAGATGAA GGCTTTCTCC AAGGAAGGGT
TGATCGCTCA GTCTAAGCTG GATCCTGCGG AGAAGGCCAA GCGAGATATG ATTGATTGGA
TCGGGTCAAC TACCGACGAA TTATCAAGAC AAATCGAGCA GACGGAGGCG GAAGTCGAAG
CTCTCCAGGT AGGCAAGAAG AAGAAGCAGG CCGGCGAGAG GCTGGATGAG CTTGAAGAGT
TAAATGAGAG AAGAGAATGG CATATTGGAA GGCTGGAGGT TGTCCAGAGA ATGTTGGAAA
ACGGACAACT AACAGTTGGA GATGTTGAGG ACATCCAAGA AGATGTCAAA TACTTCGTTG
AAGCCAACAT GGTATGTCAT ATCTTCATCT CCCCTAGCTC TCGCCGACCG CTTTATCACA
GGAAGAAGAC TTTGACTTTG ACAACGGTAT CTACGACGAA CTCAATCTTC AAGATGAGGA
AGACTTCCAC GACTATCTCC ACGAACATCC GTCCGCAACC GATGAGCTTG AACCCGAACC
CGAGCCCGTC GTCCCCACGC AGGCACCAAA AACCCCTGCC AAGGAAAAAG AAGACAAGAA
AGCCACCCCT CACAGGTTGT CTAAATCTGA ATCTCGAGAA AAGGAGGAGG AAATCCCTCC
CAGTCCTGTG GTGGTGAAGA AGACGCCCTC AAGAAAGACG ACCTTGGATA AGGACAAGAA
AGACAAGAAG GACAAAGAGA AGGAAGACAA GGAGCGTAAG GAGCGCGAAC GGGACGAAAA
AGAGCGGGAA GAAAGGTCTT CAAGCAGTCA ACCTACCCCT TCCAAACCTG CTCCCCTTCC
TCCCATCAAG TATGCTGCCG CTGCGGCCGC CGCCGTTGGG GGTACCGTTT CTGCTTCTGC
GCCATCTCAA ACAACTCCCT CCGCAACCTC AGAAGGTTCT AGGGACGACA TCGTTGCTTC
TCCGGATGAA ATTGCTGCCG CCCCTACCTC TACGTCCTTC CCACCACCAC CTCCCGGACT
TTCACGTTCA CCTTCCCAGG CCGCTACCCC CCAAACCTCG TCCCCTGGGA CACCGGGCCT
TCAATCCCAT GCGCCAAGCA CTAGCACGGG AGCAGCTGAC ATGGCTACTC CCAGTCAAGC
ACCTGGAAGC GCTGTCCCTC CTCCAGGCTA TCCTGCGCCA CCGTCTTCGT CTTCTGCCGA
GTCTTCTCGT GCTGCTGTTC AAGCTCAAGT TGCCGCTGAA CAAGCCCAGG CCCAGGCCCA
GGCTCAGGCT CAGGCTCAAG CCCAAGCACT CGTCCAGGCA CAAGCTCAAG CACAGGCCCA
AAGCCAGCTC TCTGCTGCTC AAATCCAAGC CCAGTTGCAG GCTGCTCAAG AAGTTCAAGG
TCAAGCTGGA GTCATGGGTA ACTTGATGCA AAGCTTTGAA GTTGCTAAAG AGATTTGTGA
GTATGAATGT TTTGTATGTG GGTGCGGATG GGGTTGTGTA TTTCATTACC CCCCTGGATG
GGGGAAGGGT AAAGGCAAAG GGAGAAGAAA ATGCCCTGGC CCTCATCCGG GCAGCACGAA
GATGGATTGG GAAAGGAGGG CAAGGGAGGG TGAGAGGGGG AGATCCAGTC TGTGTTTGTG
AATAGTTTGC TAATGTGGCT GTAGCCAAGC GACGCTCAGA CGACACGAAC GAGTTACATG
CCGCTTTGGA AGATAGTTTT GCGAACGCTC CGCAACAAAT GGACGCTGAG CCGTAAGCCC
ATCACTTGAT TGTTCTCGAC GTACTTGCTG ACATTCTTCA ACCAGTCCTC GCTATTACCA
CCCTCAAAAC CCCATCAAGA CCCCATCATA CTATCCTCAG TCTCGGCTTC CTATACTCGA
AGACAAATCT ATCTATTCTC GTCTCGAACT TGACCAGTTA TTCTACATCT TCTACTACAT
GACTGGGACA TACGAACAAT GGCTCGCCGC GAGGGAACTG AAAAAGCAAA GTTGGAGATT
CCACAAGCAA TATTTGACTT GGTTCCAGCG AGCGCACAAC CCGCAAGCCA TAACAAGTGA
TTATGAGCAA GGAGGATATT ACTATTTCGA TTGGGAGAAC AGTTGGTGTC AGAGAAGAAA
GAGTGATTTC AGGTTTGAAG TGAGTTTTGT TTTTCTGCGA TGTAATGTGA CATTTAGCTG
ACGCGGCGAT ACAGTACCGA TGGTTATCAG ATCACTAGTT TTCTTCTTTG ACGCGAAGCG
ATAGGATGGA ATTGGGAATG CATATTGTAG TTTTCTG
 
Protein sequence
MALRKLQAEI DRTLKSVATG VEVFEATFDK LNYATNTTQK DKLENDLKTQ IKKLQRMRDQ 
IKAWLGNGDI KDKTALLENR RLIETQMERF KALEKETKMK AFSKEGLIAQ SKLDPAEKAK
RDMIDWIGST TDELSRQIEQ TEAEVEALQV GKKKKQAGER LDELEELNER REWHIGRLEV
VQRMLENGQL TVGDVEDIQE DVKYFVEANM EEDFDFDNGI YDELNLQDEE DFHDYLHEHP
SATDELEPEP EPVVPTQAPK TPAKEKEDKK ATPHRLSKSE SREKEEEIPP SPVVVKKTPS
RKTTLDKDKK DKKDKEKEDK ERKERERDEK EREERSSSSQ PTPSKPAPLP PIKYAAAAAA
AVGGTVSASA PSQTTPSATS EGSRDDIVAS PDEIAAAPTS TSFPPPPPGL SRSPSQAATP
QTSSPGTPGL QSHAPSTSTG AADMATPSQA PGSAVPPPGY PAPPSSSSAE SSRAAVQAQV
AAEQAQAQAQ AQAQAQAQAL VQAQAQAQAQ SQLSAAQIQA QLQAAQEVQG QAGVMGNLMQ
SFEVAKEICE YECFVCGCGW GCVFHYPPGW GKGKGKGRRK CPGPHPGSTK MDWERRAREG
ERGRSSLSKR RSDDTNELHA ALEDSFANAP QQMDAEPPRY YHPQNPIKTP SYYPQSRLPI
LEDKSIYSRL ELDQLFYIFY YMTGTYEQWL AARELKKQSW RFHKQYLTWF QRAHNPQAIT
SDYEQGGYYY FDWENSWCQR RKSDFRFEYR WLSDH