Gene CNB02890 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB02890 
Symbol 
ID3256018 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp852322 
End bp853720 
Gene Length1399 bp 
Protein Length410 aa 
Translation table 
GC content53% 
IMG OID638254939 
Productpurine-specific oxidized base lesion DNA N-glycosylase, putative 
Protein accessionXP_569014 
Protein GI58263208 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID[TIGR00588] 8-oxoguanine DNA-glycosylase (ogg) 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value0.259256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCCCGGC CGCCGTTCCC CGCCGGGTGG GCGTCCGTGC GCATGGACCC CCGCAACCTC 
TCGCTGGCAA ACACGCTGCC CGTCGGCCAG GCCTTCCTCT GGCACAGGCT CCCCCTCCCC
GCCAGCGATC CCCCCTTCGA AGAGTATTCC AGGGCAGTAC ACTCCCCCCC TCGTGTCGTG
TGTCTCCGCC AGTCGCCCAC GCACATATAC TACACTGCCG TATATCCCTC CGGGTCAGCC
CCAGAGCCAG AGCGCAGCAA TCTCTCCACC AGGCAGTGGC TTGAGGACTA CTTCCAGCTG
GTCAGATATC CAGACTTGGA AGCCTTGTAT CTCGACTGGC GGCGCAGAGA CCCAGAGCTG
TTTGGTAAAG TACACGTCAA TGACAGAGCC ACTGGGATCC GGGTCCTCAG GCAAGATCCA
TGGGAATGTC TCTTAGCGTA CGTAGCCAGT GTCTCCTTTC AGCGCTGACA GGCAGCTTCA
TCACATCCAC CAACAACCAC ATACCACGTA TAACTTCGCT ACTGCATAAA TTCTCGCAAT
CTTTCACAAA GCCGGTGCTT ACTCTCAAAC ATCCCTCGAA TGGTATCTTG ATTCCATACC
ACCTGTTCCC CGCCCCTCAT CAGATACCAA CAAGACTGGA AAAACCCCTG CGTGACATGG
GTTTTGGGTA CCGCGCTCCG TTTATCGAAG CCTCTTTGCA GCTGCTCCGC AACAAATTCG
GGGATAAAGA GGGAGATATA GAAGCTGGGC TCGTAGCGTG GAGAAACGAA GATGTGGACA
TTGTACGTGA GAACCTTATT GCATTGAAAG GCGTCGGAAG AAAAGTCGCA GACTGTGTAA
TGCTCATGTG TCTCGATAAA GTGAGTGACA AGATCCCATG GATGCCGCGT GTCCAGTGCT
CCTGACAGGT GAACAGCCCT CTCTGATTCC CATCGATACC CATATTGCAC ATATTGCCGC
AAGACATCCT GCCTTTCCAT CCCGTCTGAA GAACAAGGCA ATGTCGAAAC AGGTCTATGA
GGAGACGCAG GAATTTTTGC TTGGTCGTTG GGGCCCAATG GGAGGATGGT GTCAGGCCGT
CTTATTTGCG GCTGATCTCC CTCAATCCCG AGGCAATATA AAAGTGAAGA CAAAAGTTGA
ATCTATTGTG AAAACTGTTG TAAAGACCGA GACAAGCCTG GATAATGTTG GTAACGTCAG
GCGGAAACGA GGGGAGAGTG AAGAAGAAAA ACTGCCCGTC TTGAAGCGTA CTCGCAGTGC
GACAAAGCAA GCGCAACAAG TGCTAGTGGG CGTGGATGTA AAGTATGATG AAAATACGGA
CCGGTAGCAT CGGATCACTA GATTACTCGT AGTGTAGAGG ATTGATCGTG CACAAATAGT
TAATAGCGTT TCTATGATG
 
Protein sequence
MSRPPFPAGW ASVRMDPRNL SLANTLPVGQ AFLWHRLPLP ASDPPFEEYS RAVHSPPRVV 
CLRQSPTHIY YTAVYPSGSA PEPERSNLST RQWLEDYFQL VRYPDLEALY LDWRRRDPEL
FGKVHVNDRA TGIRVLRQDP WECLLAFITS TNNHIPRITS LLHKFSQSFT KPVLTLKHPS
NGILIPYHLF PAPHQIPTRL EKPLRDMGFG YRAPFIEASL QLLRNKFGDK EGDIEAGLVA
WRNEDVDIVR ENLIALKGVG RKVADCVMLM CLDKPSLIPI DTHIAHIAAR HPAFPSRLKN
KAMSKQVYEE TQEFLLGRWG PMGGWCQAVL FAADLPQSRG NIKVKTKVES IVKTVVKTET
SLDNVGNVRR KRGESEEEKL PVLKRTRSAT KQAQQVLVGV DVKYDENTDR