Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04530 |
Symbol | |
ID | 3255007 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 261141 |
End bp | 262696 |
Gene Length | 1556 bp |
Protein Length | 461 aa |
Translation table | |
GC content | 49% |
IMG OID | 638253924 |
Product | DNA-3-methyladenine glycosidase, putative |
Protein accession | XP_568002 |
Protein GI | 58261184 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0122] 3-methyladenine DNA glycosylase/8-oxoguanine DNA glycosylase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCATCTA CAAGAAGTGA GACAGCTAAG TCAGATCCTG CTCTAGGTAT CCCGTCTGTA ACTAGAGCAG CAAGGACGAA CACGACGAGC ACTGCTACTT CCAAGCGAGC GAAACCAGAT AGTGGAGCAC TTTCTCTCTC AGCAGGGAAA GGCCACTTGA AGCCGTCCAA AAAAAAGCAG AAGGTGGAGC CAAGTATAAT GGATATCCAG CCTCCTTCTA CAGCTGTTGA CCTACCTACG GTTCCTCAAC CTACTCTGCT CCCTCCGACA CTGAATTTCA ACCTCCCCTC AGCCATATCT CATCTCTCTG CCTTGGACCC GCGGTTTTCA CTCTTTTTCG AGCATTTACC TTGTCGACCA TTTGTGAACC TGGAAGCTAT TGATCCCTTC AGGACACTTG TAACTTCCAT TATTGGACAA CAGGTTTCTT GGATGGCGGC CAGAGCTATA AATACAAGGT TCAGGGCTTT GTTCGGGTTT ACACATGAGA AGGAGGGGTT CCCTTCACCT CAAATGGTCC TTATGCAAGA TGTAACATCA CTGAAAGGAG TTGGTTTGAG CGGAAGAAAG GCGGAGTACG GTATGTAGAT TGAATATATC TCAGTATTCA CGATGCTAAT CTGTCGTCCT TCAGTTTTGT CGCTGGCTGA CCACTTTGCC TCCGGTCAAC TGTCAACTCA ATTGCTGCAA AGTGGAACAG ACGAAGAAAT TTCAAAAGCT CTCATTGCTG TCCGCGGTAT TGGGCAATGG ACAGTAGACA TGTTTATGAT ATTCTCTCTT CGTCGCCCAG ACATTTTGGC CGTGGGCGAC TTGGGTGTAC AGAAGGGTCT TCTCAAATGG GCTTTAGCTG CTCATGGTGC ACTAGAAAAG AAATCTTCTG GTACCAACAC ACCCAAGAAG ACCAAGGGAA AAGCCAACAA GGGCGAGAAG AAGGAAGTCA AGGAAGAGGT TGATGATAAG GGGGAAGGTG AGCTGGATAC GAACATAAAA GGAAGTACTC CAGTCAAGCA TGTGACCTCG AGCGCTTTTC CTCCTACGCC ATTGACTCCT AACGACACAT CCGAGAACCC TCTACCTAAA ATGGGCGCCT TACATACGCC TGCGGCCCCA TCAGGTGCAA GCGTTGCACA AGGGCCTCGA ACACCATCCA CACCATCTGC TATGCCCCGC GAAACTGTCG AAGTTCCACC AAAGACACTT CCGGGCCCGA CTCCGGAGGT GATGCTTACA GCCCCATTAG AACATCCTGA TTGGGACCCC CATCGTGCCG TTCCGTTACT AGAAGGTCTG TCTGTAGATA TTCTGAAATC CCGCTTGAAC GGAAAGAAAG TCAAGTGAGT CTTTGCCTCT CAGATGATAC GACATATCCA TCGCTAATTT CAATTGTGCT CCAGGGGCGG AGCCTATTTG ACACCAAAGG AAATGGAGGC TCTCACTGAA GGGTGGAGAC CATATAGATC GTTGGCAGTG TTCTACATGT GGCCTGTCGC TGGAGAATGA TCACCCCAAA GCCTTCCGTG AACGCATAGT AGTATCCAGC GCTCTTCAAA ATGCAT
|
Protein sequence | MPSTRSETAK SDPALGIPSV TRAARTNTTS TATSKRAKPD SGALSLSAGK GHLKPSKKKQ KVEPSIMDIQ PPSTAVDLPT VPQPTLLPPT LNFNLPSAIS HLSALDPRFS LFFEHLPCRP FVNLEAIDPF RTLVTSIIGQ QVSWMAARAI NTRFRALFGF THEKEGFPSP QMVLMQDVTS LKGVGLSGRK AEYVLSLADH FASGQLSTQL LQSGTDEEIS KALIAVRGIG QWTVDMFMIF SLRRPDILAV GDLGVQKGLL KWALAAHGAL EKKSSGTNTP KKTKGKANKG EKKEVKEEVD DKGEGELDTN IKGSTPVKHV TSSAFPPTPL TPNDTSENPL PKMGALHTPA APSGASVAQG PRTPSTPSAM PRETVEVPPK TLPGPTPEVM LTAPLEHPDW DPHRAVPLLE GLSVDILKSR LNGKKVKGGA YLTPKEMEAL TEGWRPYRSL AVFYMWPVAG E
|
| |