Gene CNN01940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN01940 
Symbol 
ID3255384 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp561719 
End bp563999 
Gene Length2281 bp 
Protein Length480 aa 
Translation table 
GC content51% 
IMG OID638254613 
Productconserved hypothetical protein 
Protein accessionXP_568675 
Protein GI58262530 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACTTTACAAA AATGACCGAA GACATTTACA AGGACGATCT CTACGGCGGT GGGTTCTGAC 
TGCATTCCTC TACGTCGTGA GTCATGAACT AACAGCGACA GATCTCGACC TCGAGGATTT
GGATGCTTCT CAACTTGAGG AGCTTGTCGA GCCTCCTGAG CTGGATACTG CACCAACCTC
GACTCCTGCT GCATCCAACG CTGCTGCGGC CCCTTCTCAA CCCGCAATCA CCGCATCATC
GTCAACTTCC GCCCCCGTTG AACCCGGATC ACAACAATAC GACCATCAAC CATCTTATGA
CGCCGCCACG CCTTATCAGC AAGGCCAACA GGACAACAAC TTTGGCCAGC AGCAAGACGG
ACAGGATAGA ATCAAGCCCA GTGATATGCC TGATGAGGGG TTAGTAGATA TTCTGTTCAC
CTTTTCTACA TTATCATTCG TCGTGGAGTG TGCTGGATGA AAGGAGGCCG CGCAGGTTTT
TGTCGCGGCG GAAAAGCGCT TGTAGGATCC TTTAAAAGGC TCTTCAATCT TAGGATTTAC
CAGCATCAGC TAGGCCGTAG AATTAGGCCA TAGGTCACAA TAGGACTGTA AAACCCGTAC
CAGTAACGTA CCCCTTTAGC CTTGTCTTAC TTTTCCTTTT CCCTGTCTAT AGCCTTAGAT
GAGCTGAACT AATCTGTACA CAACATACAT TACCCATACC TTTCGATAAC CCACATATCG
TCTATTTACT GGATGCCATG CACCCCCTCT CGACCCGTCG CCCTTCACCT TTTTTTTTTG
CCTGAACGTA CTATGTTTTT AACCGTACAG GAAGATGTTC ATTGGCGGTC TCAACTGGGA
AACCACTGAA GGTCAGTTGC GACCACTGTC CTTTCCCCCT AAAACTGACA TTGCGACAGC
GGGTCTTTCT GAATACATGG GGCAGTTTGG TGAAATTGAT GCTTGCACCA TTATGCGTGA
TCCTTCCGGT CGTTCAAGAG GTTTTGCATT TTTGACTTAT AGAGACCCCG CCAGTGTCAC
CAAAGTGATG GCGCAGACTC ATCATCTCGA CGGTAAGCAA GTGAGTACTT CCTTCCAACT
CACTTGAATA TCTACCAACG GTGCATACAG ATCGATCCCA AACGCGCCAT CCCCCGCGCC
GAGCATGAGC GTACCGCCAA AGTCTTTGTT GGCGGTCTCG CTCCGTCCGT CACAGGTGAA
TCCCTCAAAT CTTTCCTCTG TCAATTTGGT CAGGTGATGG ATGCTACTGT TATGTTCGAT
AAGGAGACTG GCAGATCTAA GGGGTTTGCA TTTGCTACGT TCCAGGATGA AGAGTCTGTA
GGCAGAGCGA TGGCTGCTAG CGGTGTTGAG CTTGAGGGCA AGCAGGTTAG TCATACTGTA
TTTCTACGCC AGATCCGTTG GTTAAGCAGA AAATGTAGAT TGAGATCAAG AAAGCTCAGC
CAAGAGGTAC TGCTCAGGGA TCCAAATTTG GAGGTAACAT GAATCCCCGC TTTAACCAAG
GCACGGGATT CAGTGGTGGT ATGGGTAGTT TCGGCGGTGG CTTCGACCCC AGTTCGGTGG
CGATGATGTA TCAGAACATG ATGAAAACCG GAGGTAAGGG GAAGACCCTT TTTGACATAT
CCCAGCCACT GACATTGATA AAAATCCTCA GGCAATATGA TGGGCGGCTT CGACCCTAGC
GCCATGGCAA TGATGTACCA AAATATGATG AAATCCATGG GCAACGCTCC TGCCATTAAT
CCCAGTCTTG CTATGCGCAA CAATGCTGGC GGGACCACTG CCGGTGCTGC TGCAGGGGGT
GCTATGCCGA TGGGCATGGG TATGGGTGCC ATGGGGGGTA TGGGGGGTAT GGGAGGCATG
GGCGGTATGG GCGGTATGGG GATGGGTGGT ATGGGAATGG GCGGAATGGG TATGGGCGGA
ATGGGCGGAA TGGGTGGAAT GGGTATGGGA GGCGGGATGA ACCGCGTGAG TCTTTACTTA
CTCTGTGCGG CTACTTACAA ACATACTGAT GGCCGTAACT TCCTAGATGG GCAACACTCG
ACAAATTCCC AACGCTCCCC GCGGCCCTGC GGCGATGCGC GGACCAGGAC AACAGCCCAT
GGGCGGCGCT GGAAATGCTC CCCAAGGTGG TGGACCCGGA GCGCAGAGAT ATTCGACGCA
AGGGAACGCG AGGGCAAGAC CATATTAAGA TTGCGGATAG TTTAAGGAGA GGCCCACAGC
CCGCATTGAT CAAAGAAATG GAGACAGTAG GGGTGTTGTA GGCTTTTAAG CAAAATGCAC
C
 
Protein sequence
MTEDIYKDDL YGDLDLEDLD ASQLEELVEP PELDTAPTST PAASNAAAAP SQPAITASSS 
TSAPVEPGSQ QYDHQPSYDA ATPYQQGQQD NNFGQQQDGQ DRIKPSDMPD EGKMFIGGLN
WETTEAGLSE YMGQFGEIDA CTIMRDPSGR SRGFAFLTYR DPASVTKVMA QTHHLDGKQI
DPKRAIPRAE HERTAKVFVG GLAPSVTGES LKSFLCQFGQ VMDATVMFDK ETGRSKGFAF
ATFQDEESVG RAMAASGVEL EGKQIEIKKA QPRGTAQGSK FGGNMNPRFN QGTGFSGGMG
SFGGGFDPSS VAMMYQNMMK TGGNMMGGFD PSAMAMMYQN MMKSMGNAPA INPSLAMRNN
AGGTTAGAAA GGAMPMGMGM GAMGGMGGMG GMGGMGGMGM GGMGMGGMGM GGMGGMGGMG
MGGGMNRMGN TRQIPNAPRG PAAMRGPGQQ PMGGAGNAPQ GGGPGAQRYS TQGNARARPY