Gene CNL04530 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04530 
Symbol 
ID3255007 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp261141 
End bp262696 
Gene Length1556 bp 
Protein Length461 aa 
Translation table 
GC content49% 
IMG OID638253924 
ProductDNA-3-methyladenine glycosidase, putative 
Protein accessionXP_568002 
Protein GI58261184 
COG category[L] Replication, recombination and repair 
COG ID[COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCATCTA CAAGAAGTGA GACAGCTAAG TCAGATCCTG CTCTAGGTAT CCCGTCTGTA 
ACTAGAGCAG CAAGGACGAA CACGACGAGC ACTGCTACTT CCAAGCGAGC GAAACCAGAT
AGTGGAGCAC TTTCTCTCTC AGCAGGGAAA GGCCACTTGA AGCCGTCCAA AAAAAAGCAG
AAGGTGGAGC CAAGTATAAT GGATATCCAG CCTCCTTCTA CAGCTGTTGA CCTACCTACG
GTTCCTCAAC CTACTCTGCT CCCTCCGACA CTGAATTTCA ACCTCCCCTC AGCCATATCT
CATCTCTCTG CCTTGGACCC GCGGTTTTCA CTCTTTTTCG AGCATTTACC TTGTCGACCA
TTTGTGAACC TGGAAGCTAT TGATCCCTTC AGGACACTTG TAACTTCCAT TATTGGACAA
CAGGTTTCTT GGATGGCGGC CAGAGCTATA AATACAAGGT TCAGGGCTTT GTTCGGGTTT
ACACATGAGA AGGAGGGGTT CCCTTCACCT CAAATGGTCC TTATGCAAGA TGTAACATCA
CTGAAAGGAG TTGGTTTGAG CGGAAGAAAG GCGGAGTACG GTATGTAGAT TGAATATATC
TCAGTATTCA CGATGCTAAT CTGTCGTCCT TCAGTTTTGT CGCTGGCTGA CCACTTTGCC
TCCGGTCAAC TGTCAACTCA ATTGCTGCAA AGTGGAACAG ACGAAGAAAT TTCAAAAGCT
CTCATTGCTG TCCGCGGTAT TGGGCAATGG ACAGTAGACA TGTTTATGAT ATTCTCTCTT
CGTCGCCCAG ACATTTTGGC CGTGGGCGAC TTGGGTGTAC AGAAGGGTCT TCTCAAATGG
GCTTTAGCTG CTCATGGTGC ACTAGAAAAG AAATCTTCTG GTACCAACAC ACCCAAGAAG
ACCAAGGGAA AAGCCAACAA GGGCGAGAAG AAGGAAGTCA AGGAAGAGGT TGATGATAAG
GGGGAAGGTG AGCTGGATAC GAACATAAAA GGAAGTACTC CAGTCAAGCA TGTGACCTCG
AGCGCTTTTC CTCCTACGCC ATTGACTCCT AACGACACAT CCGAGAACCC TCTACCTAAA
ATGGGCGCCT TACATACGCC TGCGGCCCCA TCAGGTGCAA GCGTTGCACA AGGGCCTCGA
ACACCATCCA CACCATCTGC TATGCCCCGC GAAACTGTCG AAGTTCCACC AAAGACACTT
CCGGGCCCGA CTCCGGAGGT GATGCTTACA GCCCCATTAG AACATCCTGA TTGGGACCCC
CATCGTGCCG TTCCGTTACT AGAAGGTCTG TCTGTAGATA TTCTGAAATC CCGCTTGAAC
GGAAAGAAAG TCAAGTGAGT CTTTGCCTCT CAGATGATAC GACATATCCA TCGCTAATTT
CAATTGTGCT CCAGGGGCGG AGCCTATTTG ACACCAAAGG AAATGGAGGC TCTCACTGAA
GGGTGGAGAC CATATAGATC GTTGGCAGTG TTCTACATGT GGCCTGTCGC TGGAGAATGA
TCACCCCAAA GCCTTCCGTG AACGCATAGT AGTATCCAGC GCTCTTCAAA ATGCAT
 
Protein sequence
MPSTRSETAK SDPALGIPSV TRAARTNTTS TATSKRAKPD SGALSLSAGK GHLKPSKKKQ 
KVEPSIMDIQ PPSTAVDLPT VPQPTLLPPT LNFNLPSAIS HLSALDPRFS LFFEHLPCRP
FVNLEAIDPF RTLVTSIIGQ QVSWMAARAI NTRFRALFGF THEKEGFPSP QMVLMQDVTS
LKGVGLSGRK AEYVLSLADH FASGQLSTQL LQSGTDEEIS KALIAVRGIG QWTVDMFMIF
SLRRPDILAV GDLGVQKGLL KWALAAHGAL EKKSSGTNTP KKTKGKANKG EKKEVKEEVD
DKGEGELDTN IKGSTPVKHV TSSAFPPTPL TPNDTSENPL PKMGALHTPA APSGASVAQG
PRTPSTPSAM PRETVEVPPK TLPGPTPEVM LTAPLEHPDW DPHRAVPLLE GLSVDILKSR
LNGKKVKGGA YLTPKEMEAL TEGWRPYRSL AVFYMWPVAG E