Gene CNI00050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00050 
Symbol 
ID3259794 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp14423 
End bp17369 
Gene Length2947 bp 
Protein Length725 aa 
Translation table 
GC content51% 
IMG OID638258489 
ProducttRNA dihydrouridine synthase, putative 
Protein accessionXP_572750 
Protein GI58271188 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0042] tRNA-dihydrouridine synthase 
TIGRFAM ID[TIGR00737] putative TIM-barrel protein, nifR3 family 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTATACACCA TGACAGACGA CCCCGCAGCC ACACCTCGTC CAGAGACTTC TGTCAACGCA 
AGGCCAGCAT TTTCCGGTCA GGCTCCGATC AAAGCCGAGT ACGTCGGATC TCGTCGCAAA
CCGTACCTCA AACTAACCAA AATTCATTCT TTAGATATCT AATCAACACC ACCCCTATCG
TCGAATCTGC CTCTGCCTCA GAGCTCAACA ACATTCACCC CGATGATGCC GCCGAGGGTC
GTACCGATTC CCGTGACTCC CGCGATGGTC GTGACAGACC CGACAACAAA CGGCGTAAAC
CCAACAAGCA AGACAAAAAA GACAAAAAAG GTCAAAATAA GGGCCGCCAC TTCCCCGTCA
TCCGGGAAGC CTCTGTCCGT ATCTGTCGTG CTTGGGAAAC AACAGGTATC TGTGACCGTG
CAGATAAGGG TGATTGCCGA TATGCTCACA GCTGGGAAGG TTACTTTGAG GTGAAACCGA
ATGATATCAG TTACCGTCCT GATTGGTCCT TGGTCGGTGA GGCGCCGTTT GTGGTGGAAG
GGGAAAGGGT GGTGGGCGGT GAGGATGTGG TGGGGAAGAC GCTTGACTTA GATACGGTTT
GCCCGGTGTT GAAGGACTTG GGGTACTGTC CTTTTGGGTG GCGATGTCGG TTCTTGGGTG
CGCACGTTAA GCGTGTGGCT GCTGCTGTAG ACGGGGAGAA GGAAAAGGAG GCGGGCCCTG
AGAAGCGAAT GGGAGAGTGG CAAGTCGAGA ACTGGGTACA AAGCGAAGTG GAGAACGGGT
GGAAACAAAA GGAGACGAAT TGGCCTGAAC ATGAAGTGCT TAACGCTCTT CGCCGTAGTA
CTGTAAGTCC AACTCTACCG GTCCTCCCCA TTCCGCTTGA CATATCTCTC CGACATACGT
TTGGCAGTCA AGGCAAATTA CTGTGCTGAT GAAGTTTCAC AGGCGTCATT CCCGTTCTCT
GAAGCATACC TCAAGAAAGT TGATCCCGAC AAACCTTTTA CCCTTCAAAA CAAGAAACCC
ACCAAACAAC AGCCACACAA ACGCAAAAAC AATGTTCTCG ACGAAGAAGA AGCTGCAAAT
GGACCAACCG GCATTCCCTC CGCTGGGGAT GATGAGGAGA ACGCTATGAA CGCCACAGAA
AACGAACGGA ATGAGGAGAA GGGTAAAGTG TACGGTGAAC CGGAAGCGAT TGACGTGCCA
CTCAGACCAG AGGAGAAGAG GAGGTTAAAC TGGGAAGGTG GAAGATATCT CGCTCCTCTG
ACAACCGTCG GTAATCTTGT AGGTCTCCCT TTCCCTTCTC CTCATTCCTC CTTCTCCTTT
CCCGCTCCGC CCAAACTAAC TTCCTCCCCT TCCAGCCATT CCGCCGCCTC TGTGTTGACT
ACGGCGCCAC CATCACCGTC TCCGAAATGG CTCTCGCCCA ACCCCTCGTC TACGGCGCTA
AAGAAGAATG GGCTCTCGTC CGTCGACACG AGAGCGAAAA GATGTTTGGT GTCCAAGTCG
CCGGTGGGTT CCCGAACCGG ATGGTACCCG CCGCGGAAGT CATTGCGAAT ACTATAGGAA
AGGGTGGGGG GGTGGATTTT GTGGATGTTA ATATGGGTTG CCCGATTGAT TTGGTCTTCA
ACCAAGGTGC GGGTAGCGCC CGTAAGTTTT TTTTCTGTTC GGCCCCCTGA GGAAGGCCGC
AAGAAAACTG ATGTTTTTGA TGATCAGTTA TGGACTCCCC TGGACGATTG GGTAAGCTGT
TGGTGGGCAT GAACAGGGCC CTTGGTGATA GTAAGTCACC TCTTTTCTCC TCCATTTGTT
TTGTAATTTG AATGGATAGG CTGACATATC CGTCTTTCTT CGATTTTAGT CCCTCTGACC
GTCAAATTCG TACGTTTTTC CCTTGCTCAA CATCCCAACC TGTCCAGGCA TGTGGATTTG
CATGACTGAC GCAACATTTT CATATAGAGA ACTGGTGTTG CGCATGGGAA ACCTAATGCT
CACAAGTTGA TTCCTCGTTT CGTCACTGAA TGGGGAGCGG GCGCTTTGAC CGTAAGTCCA
CTGTCATCTT CCTTTTTCCC ATACGCCATA CCGAACCCAA CACTGACACT ACAACCCTTC
TGCCCTCCAG ATTCACGGTC GATCTCGCCA ACAACGCTAC TCCAAACCTG CCGACTGGGA
ATACATTAAG ACTTGCGTCA CCGCCCTGCG CGAGTCCGTT GCCGACGCCA ACCTTCCTCC
CGTTCCCATC TTTGGAAACG GTGATTGTTT CTCTGCTGCT TCGTATTATG AGGAGATGGA
CAGAAGTGGG GTGGATGGAG TGATGGTCGC GAGAGGGGCG TTGATCAAGC CATGGATCTT
TACGGAGATC AAGGAAAGAA GAGAGTGGGA TATTTCTGCA GTGGAGAGGT TGGAGGGTAT
CAAAAAGGTG CGCTTTTGTG TCTTTTTCTT CTTCCAAAAT TTTCCATTTG GTCTCGAAGT
ATTGTATAGG GCTATACAAA CTAACCACAT CTACCTTCAC AGTTCGCCGA ATTCGGTCTC
TCCCATTGGG GTTCCGATAC CCAAGGTGTC AATACCACCC GCCGATTCCT ATGCGAAGCC
CTCTCCTTCC AACACCGATA CATCCCCATT GGCCTCCTCG AACGTCTCCC CGCCAAACTC
AACGAACGAC CCCCAGCCTA CAGGGGTAGA AACGAGCTGG AGACGCTTTT GGCGAGTCCG
TTTGCCGGTG ATTGGGTGAA AATTTCAGAG ATGTTTTTGG GCAAGGTGGA TGAAGGGTTT
TCGTTTGTGC CGAAACATAA GAGTAACGCG TATGGAGGGG AAGAGGCGCA GGGCTAAATG
GCGGGTATTG GCATGTTGGG CGTTTTTGTG GGGATTTTTT CTTTTTGTTC AAGGTCATCC
GGTGGGCGAG CTGTCTTGAT TACGCGACCG AATCTAGTTT CTCGCTACTG TATACTTTAC
GTACCGG
 
Protein sequence
MTDDPAATPR PETSVNARPA FSGQAPIKAE YLINTTPIVE SASASELNNI HPDDAAEGRT 
DSRDSRDGRD RPDNKRRKPN KQDKKDKKGQ NKGRHFPVIR EASVRICRAW ETTGICDRAD
KGDCRYAHSW EGYFEVKPND ISYRPDWSLV GEAPFVVEGE RVVGGEDVVG KTLDLDTVCP
VLKDLGYCPF GWRCRFLGAH VKRVAAAVDG EKEKEAGPEK RMGEWQVENW VQSEVENGWK
QKETNWPEHE VLNALRRSTA SFPFSEAYLK KVDPDKPFTL QNKKPTKQQP HKRKNNVLDE
EEAANGPTGI PSAGDDEENA MNATENERNE EKGKVYGEPE AIDVPLRPEE KRRLNWEGGR
YLAPLTTVGN LPFRRLCVDY GATITVSEMA LAQPLVYGAK EEWALVRRHE SEKMFGVQVA
GGFPNRMVPA AEVIANTIGK GGGVDFVDVN MGCPIDLVFN QGAGSALMDS PGRLGKLLVG
MNRALGDIPL TVKFRTGVAH GKPNAHKLIP RFVTEWGAGA LTIHGRSRQQ RYSKPADWEY
IKTCVTALRE SVADANLPPV PIFGNGDCFS AASYYEEMDR SGVDGVMVAR GALIKPWIFT
EIKERREWDI SAVERLEGIK KFAEFGLSHW GSDTQGVNTT RRFLCEALSF QHRYIPIGLL
ERLPAKLNER PPAYRGRNEL ETLLASPFAG DWVKISEMFL GKVDEGFSFV PKHKSNAYGG
EEAQG