Gene CNF03310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF03310 
Symbol 
ID3258186 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp982179 
End bp985007 
Gene Length2829 bp 
Protein Length708 aa 
Translation table 
GC content50% 
IMG OID638257449 
Productconserved hypothetical protein 
Protein accessionXP_571608 
Protein GI58268904 
COG category[R] General function prediction only 
COG ID[COG0724] RNA-binding proteins (RRM domain) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.472473 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGTTGATTAC CCTCCCCACC TCTACACCTC ACCATGTCCC GCAGGCTAGA CCTCTCCAAA 
TTCGCAGGCT CAGACTCGGA AGGCGAAGAA GATGTCCCCA AAGCAAGTCC ACGTATCTTT
TTTAATGTGC TAGTCGTCGC CAACACTGAC CGTGGATTTG TCACAGCATC AACTGGAATT
GACTGCACTC AAGAGCAAGA CCTTTTCTCA AGGTATCACC AAAAAAACAA AGCGTGATCT
GGAAAAGGAG GCAGAGGAAC GTAAGCGGAT TGAAGAGGAA AAGTACACTG CAAGCCATAC
CTTGAATCCG CACCGTCCAC TGACATGGCC TACCTCAGAG CGGCAGCACT GGTAATGGCA
GAGATTGAAA GAGAGTTTGA AGGATCAGCT GAGGGAAGCT CGTCCGGTCA CACGGGTATG
AGGGGTTTTC GGGGAAGACC CATTGGGCCT GGGGGAGGGT TTGTAAGGGC CGGAGGCGCG
CCGATGGGAG CGCCAACGTC TTTTGCGCCC CCTAGAGGTC CAGCCGCTAT GGGCTACGGT
AGAGTAAGTC TAGATGTTAC TGACCAAGTT AACTGGGCTG ACATATCGCA GCCAAAACCC
ATGCCTGTCA GAGCGCCATC ACCGCCTCCT CCACCCAGTG GACCGAAGCC AAGGGGCAAG
AGAGCGATGG ATTCATTCTT GGAAGAAATC AAGCAGTATG TCATCCTTTC AAAAAGCGAT
CCCTCTTCTC CTCTTTATCA ATACTGATGG CAAATATAGT AACCAGAACG CCCGCGAACA
AAAATTCAGT CAGATCGCAA AGAAGGAGGG ATCTTCAGTA ACTGCCCTTG CAGGTCTGTA
CCGGCAATCT GCTATTGATG AAGACTGTGA ACCATCTTGA ACTGATCGAT TCATGTAAAA
CTAGCTTGGG AAACTGGCGG AAGTGGCTTC GACCATGAGG TACGTCTCAT ATTGTCTTTT
TTTTTTGGTT AGATATTTAG CTAATCCTGG AGTAGAGCAC CAATTTATTC ATTGTGGGTG
TTCCAACATG AAGGCCAGCG AATGTACAAT ATCTAACCTT TTTGAGCAGT CAAATTTGCC
TCAAGAGATC ACGGAAGAGA TTCTCGGTTT GCACTTTGCA AAGCAGGGAC CGGTGGCGAC
TGTCAAAATC ATGTGGCGTG AGTTTCTATA TTATTATTTT CTGTTTTTGT ATATGTCAAG
GAGTTGACGT TGTCTGTAGC CCGAGGCGAC GAAGCATTCA GCCAGGCAAG CGCCCGACGG
GGTCTCACCG GTTTCGTGTC GTACATGGAA CGGAAAGATG CTGAACGGGC TGTGAAAGAG
CTGGACGGGT CGGAATGGAT GGGAAATTCG ATTCGAGTAG GATGGAGCAA GCCTGTTGCA
AAGCCCTTGA AAGCTCTTTT CGGTGAGTCA TTTTTTCATA TCTATCACAG AAAAAGAAAA
GGAAAAAAGG AAAAAGCCTG AACAGCAATG TAGATATCAC AAGTGACAGT CACAAGCGAA
GAAGGTCAAG ATCACGATCA AGGGGCCGCA GCCCTCCTAG AAAGAAATCG CATCGCGCCC
GTTCATACTC GTACTCCTCG TCATCTTCTT ACTCTCGTTC CCCTTCGCCC GAACGGACTT
GCAAACAAAA GTGGCTGGAT AGTATCCCTG AAGAACATGG GAGGTTTATC AAGACTGTTG
CGAACCGCGT AAAGGAACAT GGTAAAGGAT TTGAAGATGT TTTGATGGAA AAGGAAAGGG
AGAACCCAAA GTTTGCCTTT TTGTATGACG ACAAGGCATG TGGCTTTTCT TTTTTTTTCT
TTCCCGAGTA TTGTTGACCA GAGGTACAGC TTCCCGACTA CCACCTTTAC CAATCCACAC
TTTCATCACA TTACCGCATT CCGTCTCCAC CCCCCGAAGC ATTCAACGAC GACGGTTACG
CCTCCATCTA CTCTTCCGAC TCGGCCGAGG ATTCCGAAAG GGAGCGAACG TCCAAGGGTA
AACTTGGCCG ACTTGCGAAA AGGCGGTTTG AGGCCATGTT ACGGGTGATG ACTGGGAAAA
GGGCGGAGAT TGCGCGGGGG ATGGAGTTTG CGCTCCGACG AGCCGAAGCG GCGGATGAGA
TTGCAGATAT CATCTGTCAG TCTGTCCAGG TGGATTCGAC GCCCGTCCCG CGAAAGATTG
CGCGTCTTCA CCTCATCTCT GATATCCTTC ACAACTCTGC GTCGCCTTTG CCCAACGTTT
GGCGATACCG CCTCGCGTTT GAGCATAGGC TACCGCCCGT GCTGGCGCAT TTGAATACGG
TGGAGAAGAG TTTGATGGTG TATTCGGGCA AGATTAGTGC GGATGTGTTT AGAGGGCAAG
TGGGGAATGT ATTGGATATC TGGGAACGAT GGCACGTTGC TTTTTTTTTT TTTGGCTGGT
TGGTTGGCGC GCTTTTTGCT GACTTTTTTT GCAAATAGGA TTGTATTCAA CACCGATACT
GCGGAACTCT TCCGTGCCGT GTTGGTCGGC GACAAGCCTC TCTCTGCGCT CGTCAAGACC
CCCCAGGGCG CATGGGTTGA CAAGGCCAAG TTGGAGGCGG AGGAAGCGGA GAGGAAGAAG
AGGGAAAACG AGCAACAAAA AGCTGAAGAG GAGGAAGATA GGTTTGAGCA GAGTGGGTTT
AAGAGTAGTT TTAAGAGGAT CAATCCGCAG GCTGGGCCTG CGCCTGCGCC TGAATCAGTA
CCGTCAGCTG TCAGATCTGC GTTTGAGGAT GAAGATTTGG ATGGGGAGGT GATGGAAGAT
TTAGACGGGG AGGCGATGGA AGATTTGGAC GGGGAGGCGA TGGAGGATTT GGACGGGGAG
GCAATGTAA
 
Protein sequence
MSRRLDLSKF AGSDSEGEED VPKHQLELTA LKSKTFSQGI TKKTKRDLEK EAEERKRIEE 
EKAAALVMAE IEREFEGSAE GSSSGHTGMR GFRGRPIGPG GGFVRAGGAP MGAPTSFAPP
RGPAAMGYGR PKPMPVRAPS PPPPPSGPKP RGKRAMDSFL EEIKHNQNAR EQKFSQIAKK
EGSSVTALAA WETGGSGFDH ESTNLFISNL PQEITEEILG LHFAKQGPVA TVKIMWPRGD
EAFSQASARR GLTGFVSYME RKDAERAVKE LDGSEWMGNS IRVGWSKPVA KPLKALFDIT
SDSHKRRRSR SRSRGRSPPR KKSHRARSYS YSSSSSYSRS PSPERTCKQK WLDSIPEEHG
RFIKTVANRV KEHGKGFEDV LMEKERENPK FAFLYDDKLP DYHLYQSTLS SHYRIPSPPP
EAFNDDGYAS IYSSDSAEDS ERERTSKGKL GRLAKRRFEA MLRVMTGKRA EIARGMEFAL
RRAEAADEIA DIICQSVQVD STPVPRKIAR LHLISDILHN SASPLPNVWR YRLAFEHRLP
PVLAHLNTVE KSLMVYSGKI SADVFRGQVG NVLDIWERWI VFNTDTAELF RAVLVGDKPL
SALVKTPQGA WVDKAKLEAE EAERKKRENE QQKAEEEEDR FEQSGFKSSF KRINPQAGPA
PAPESVPSAV RSAFEDEDLD GEVMEDLDGE AMEDLDGEAM EDLDGEAM