Gene CNG00230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00230 
Symbol 
ID3258832 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp54780 
End bp56250 
Gene Length1471 bp 
Protein Length374 aa 
Translation table 
GC content49% 
IMG OID638257637 
ProductU5 snRNP-specific 40 kDa protein, putative 
Protein accessionXP_571754 
Protein GI58269196 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.718611 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGCCCGAG GAAATATTTA TTATTTCTTA TCACAAACTG CCCGCCATGT CCGTCAGGAA 
GTCGCCACCG ACAGCAGGCC CAGGAATGGC CCTCTCGAAA CGTGCTCGAG TGGAGGATGA
GGCCGACGAA AACACCATGG TCATGACCGT TGCTTCTTCA GGAGAAGGAC AGCGGAAAAA
CGCTTTGATA AGGAGTGTCA AGAGGACGAG TAGTCTTGAA GCGCCTATCG TGTCATTGTC
GGCTGCTCAT GGTGTATGTC TTCAAATATT GACGACGTAC GCGGATGAAA CTGATAAAAG
CAGGGCGAGA TTACGGCTTG TGTGTTTGAC CCTTCAGGAC AGACTCTTGC GGCTAGTTCG
GTGGACCGCA GTATTTGTAA GTTAAGATAT TACTTGGATG CCTATCTCTT ACAATCCTGT
AGCTTTGTGG AAGTCCTATC CCCCACACGA CAACTACGGT ATCCTTCCAA ACGTCCATAA
GACCGCTATC CTTGATATCG CCTATTCCCT CGACTCTGAA ACTATCTACT CTGTCAGTCT
ATCATGCGCT TTCACAAATC ATCACTGACA ACATACTAGG GTGCTGCAGA CGGCACTCTC
ATATCTACTG ACTTACGTAC CGGTGAACGC ATTTCCCGCT ACTTTGCACA CTATGGCCCC
TTAAACTCTA TATCCGTCAC CATCTCTGGC GGTCGAGAGC TCGTGTTGAC AGGTGGTGAT
GATGGGATTG CTCGTGTCTG GGATTTTGCA TTGGATGGGA AAGACCCTGT GGCAGAGTTT
GATGATGAGC GAGATTGTCC AGTGACAGCT GTGGAATGGA GTTCAGACGG GAACCAGTGT
TTCGTTGGTG GAGTTGACAA CACCATCAAG GTAGGTTATA CCCCCGCGAG TTATTTCAGT
CCTGAAGCTG ACAGATTGTA GGTATGGGAC CTTCGAACGA ACAAAGTTCT CTACACGCTT
CACGGCCACA CCGATACCAT TGCTTCCCTT TCTCTCTCGC CTAACGGCCA TTACCTCGCC
TCCTATGCTC TCGATTCTGC TCTCATCATC TACGACGTCC GACCCTTTTC TTCCGACCCC
ATGCGCGTGT ACAGATCTCT CACCGGCGCA CCAGCAGGTT TTGAGCAAAC CCTCATACGA
TGTGCGTGGA CAAGACATGA TGGCGGACAA AGAATAGCGG CAGGAGGTGG AGATAGGACC
GTTACTGTTT GGGAAGTTGA GACGGGCAAG GTGCTGTACA AGCTTCCGGG GCATAAGGGA
ACTGTGACTG GCGTGGATTT CCATCCTAGG TATGTCACTA TTAATTCTGT TTCCGATATC
GAGAGGCAAA GTCGCTGATT GGTATGCATT CAGAGAACCA ATCATCTTGA CAGGATCAAA
AGATACAAAC ATGTTACTTG GCGAGCTGGA TGCTCAAGAC TTCTCATAGA CGGGCGGCAG
TACGAAAAAA GTATTAGTGC GATAAGCAAT G
 
Protein sequence
MSVRKSPPTA GPGMALSKRA RVEDEADENT MVMTVASSGE GQRKNALIRS VKRTSSLEAP 
IVSLSAAHGG EITACVFDPS GQTLAASSVD RSISLWKSYP PHDNYGILPN VHKTAILDIA
YSLDSETIYS GAADGTLIST DLRTGERISR YFAHYGPLNS ISVTISGGRE LVLTGGDDGI
ARVWDFALDG KDPVAEFDDE RDCPVTAVEW SSDGNQCFVG GVDNTIKVWD LRTNKVLYTL
HGHTDTIASL SLSPNGHYLA SYALDSALII YDVRPFSSDP MRVYRSLTGA PAGFEQTLIR
CAWTRHDGGQ RIAAGGGDRT VTVWEVETGK VLYKLPGHKG TVTGVDFHPR EPIILTGSKD
TNMLLGELDA QDFS