Gene CNN00820 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNN00820 
Symbol 
ID3255510 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006683 
Strand
Start bp264612 
End bp266723 
Gene Length2112 bp 
Protein Length560 aa 
Translation table 
GC content49% 
IMG OID638254498 
Productpseudouridylate synthase, putative 
Protein accessionXP_568627 
Protein GI58262434 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0101] Pseudouridylate synthase 
TIGRFAM ID[TIGR00071] pseudouridylate synthase I 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCCCAC GCGTAACAGC ATTTCTTAGC AATATACTCA GGCAGCCAGC TTTAAAACGC 
ATAATGGAAC ACACAGAACC CATGAAGAGG CCCAGATCGC CATCTCCGCA GCAGGCAATG
GTACCAGAGG CCAAAAGACC CCATATCGAA CCGGCACCGG TACCTGCTGC AGTCCAAGTT
GATGCCGAAG AAGCAATGTT CAATGTCGAA GAAGAAACTC AAGGTGGCAA AGGACGGAGA
GGGAAGAGAG GAAATGAAGG CCAGGCTCGT AAAGAGAAGA AAGAGAAACG AGATGCCAAG
GACCCGAGGG CTCAACGAGC TTGGGAACCC AGCGAGAAGA CCAATGGCGA AAAACGGTTG
CCCAAGAGGA GGTGCGCTGT GTTGATTGGG TGGGTCTTGC AACGTTCTGG GCGTGGTAGC
AGCTTTATGA CTAATGCGAA TACAGCTACT GCGGTACTGG ATACCAGGGT ATGCAAATGT
GCGCTTTTTT CATTGTCTCT GGCCGCTCAT TTGCCAATCC TTACATTATT GTAGACAAGA
CCACACTGAC CGAACTATTG AGGGCGAAGT CTTTGCTGCT CTTGTCAAAG CTGGCGCTGT
CTCTGCCGAC AATGCTATCG ATGCGCGCAA GGTCGACATT GCTCGAGCTG CTCGAACGGA
TGCTGGCGTT CACGCCGCCG GTAATGTTAT CTCCATCAAA ATGATCACAG AACCGCCTCT
TCCCGAAGGC TTCAAAGATG TCGCCGAGTA TGTCAACACT TTCTTACCAG ACCAAATTAG
GATGTGGGGC TGGGTCAGAA CCGTCAAGTC CTTCAACGCC CGAACGTGAG TCTTCGCTAC
CTGTGACTGT ACCATGGATT ACGGCTGATG ATGGCCCTGG TAAAGGGCGG CCGACTCTCG
TATATACGAG TACCTCCTTC CGTCATACTG CCTCATACCT CCCCACAAAG ATGACTCTCT
TGCCAAGCAT CTCGATTTAT CCTCTCCCGA CTGGCGAGAA ATCGTCGGTG AGGGTCCTTG
CTCCTTTGCC GACGCTAGAC TCCCTATGCC CACTTCTGAC GAAGGCGAAG TCGACCCCAA
GGTTCGAGGA GAGTACGAGA GAAAAAGAAA GTGGAGAGTG GATGAAAAGA CTTTGGGCCG
GTTCAGAGAC ATCATTGCCC AGTACAAGGG TACTCAGTGA GTGTTCGTGT AGAACAGATA
CAACTGGCCA GAACTGAGGG AATGTCAGCA ACTTCTACAA CTACACTGTT GGCAAGCCTT
TTAATGACCG AGCAGTCAAG AGGTTTATGA TCAAGCTTGA GGTGAAGGAA CCCAAGGTGT
ATGGAGAGAT TGAATGGATT TCCGTTCAAA TCCACGGACA AAGTTTCATG CTTCATCAAA
TCGTAAGTGT ACTATATTTA TACCAGTAGA CATTATCAGC TCATGGTGCC ACAGCGAAAA
ATGATCTCCA TGGCGATGCT CGCCTGCCGA ACTGGTTCTC CTCCCTCTCT CCTCCCCGAG
ACATTTGGTC CCAAGAAAAT TCACATTCCC AAAGCCCCCC CTCTCGGTCT CTTGCTCGAG
GCTCCTCAGT TTGGCGTTTA CAACGACAGG ATCACCCAGA AGTTGAATGG CATCACCGAA
GACAGGGATC CGGTAAACTT TGGTCTGTAT GCGGATGAGA TCTATGCTTT CAAGGTGAAG
TGGATCTATG AAATGCTGAG GAAGGAGGAG TTAGAGAAGA ACGTGTGAGC CAAAAGCTTG
CATGGTATGA CTGTTTCAAT GCTGATGAAT GGATAGTTTC CACAAGTGGA TCCAAATGAT
GGACAACATC AAGAACGATT CTCTCGGTTA CCTCAAGTAT GTCACTTCAT CCATCTTCGC
CGCAGTGATT CATGCTGACA TTCAATTTAG CACTAAGGGC ATTATCCCGG CAGAAGCCAC
TGCCTTGGTA CTTGAGCAGG AGAGCAAGCG AAAGGAGGGT CAAAAGACTC AGAAGGAAGG
TGTTGAAACC GGAGTCGAGG AGATTGAGAG TGATGACGAG GAGGTTGACC AAGAGGCCTT
GAAGAGGGGT GAATTGGAAG GGTAGTTCAC CGCTGTAAGA TAAGCAATTA CATTATACTC
ATGTATGCAT AT
 
Protein sequence
MIPRVTAFLS NILRQPALKR IMEHTEPMKR PRSPSPQQAM VPEAKRPHIE PAPVPAAVQV 
DAEEAMFNVE EETQGGKGRR GKRGNEGQAR KEKKEKRDAK DPRAQRAWEP SEKTNGEKRL
PKRRCAVLIG YCGTGYQGMQ IQDHTDRTIE GEVFAALVKA GAVSADNAID ARKVDIARAA
RTDAGVHAAG NVISIKMITE PPLPEGFKDV AEYVNTFLPD QIRMWGWVRT VKSFNARTAA
DSRIYEYLLP SYCLIPPHKD DSLAKHLDLS SPDWREIVGE GPCSFADARL PMPTSDEGEV
DPKVRGEYER KRKWRVDEKT LGRFRDIIAQ YKGTHNFYNY TVGKPFNDRA VKRFMIKLEV
KEPKVYGEIE WISVQIHGQS FMLHQIRKMI SMAMLACRTG SPPSLLPETF GPKKIHIPKA
PPLGLLLEAP QFGVYNDRIT QKLNGITEDR DPVNFGLYAD EIYAFKVKWI YEMLRKEELE
KNVFHKWIQM MDNIKNDSLG YLNTKGIIPA EATALVLEQE SKRKEGQKTQ KEGVETGVEE
IESDDEEVDQ EALKRGELEG