Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH01280 |
Symbol | |
ID | 3259293 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 814842 |
End bp | 816609 |
Gene Length | 1768 bp |
Protein Length | 463 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258355 |
Product | DNA-(apurinic or apyrimidinic site) lyase, putative |
Protein accession | XP_572318 |
Protein GI | 58270324 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0648] Endonuclease IV |
TIGRFAM ID | [TIGR00587] apurinic endonuclease (APN1) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 0.328575 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCTCAGACG ATCAAGACAC TCCTACCCCA TAGCAATGGC GCGCGTGATA ACATCAGCAA CTCCCAAGCG CGAACGGTCA GCGTCGTCCC CTTTGACAGA GCTAGAGCCC GAAGTTCCTG CTCCCAAGGC CGTAAAGCCT AAGAGAGCCG TTCGATCAAC CAAGCCGAAA AATGAGGATA CCAAAGAGAA TGATAATAAC GAGGACGCAC CTGCTGCTGC AAAGAAGCAG CGTGTTTCCA AGGCCAAAGC TTGGCCCCCA GCTGAACTAG AACCGATGCT TCACCTTCCT CGTCAAGGTT ACCCCGCATT CAAGCTCCCG TGTTCTACAG CCTCTTCTAA CGGAGGTATT GCCCCTCAGA ACGACAAATC ACAACCGATG CTTTTGGGAG CACATGTATC TGCTGCTGGT GGTCCGGCTA CAGCATTACT GAGAGCAGGT CTAGCAGGCG CGAATGGGTT GGCTCTGTTT GTCAAGAGTC AAAGACAGTG GAAGAGTAAG CCGTATGAAG ATGAGACAGT TCAGAGATTC AAAGAGCTCA TGAAGAGCAA GGAAGAAGGT GGTGAGTAAA TTGCCGTATA TGAACCATCG AGTGCTGAAT TACACATCCA AAGGAATGGG CTATGGCCCG GAGAGTATAC TGGTCCACGG CAGTTATCTC ATCAACCTAG GGTGAGTTAA TTGTCGCGTC GCACATGAGC CTTTTGCTAA CACTTCGCCT CTGCAGAAAC CCCGACCCGT AAGCTTTCAG AGATTCAAGC CTATATAGAC ATCTGACATC ACAATAGGGC CAAGTGGAAA GTCTCCTACG AATGTTTCAA AGATGATATT GCACGTTGCC ACCAGCTTGG TATTAAACTC TACAACTGGC AGTATGTGCT TTCCGTCTCT ATGCCTTTCG AATCCTCACC CGAGTAGTCC TGGATCCACA GTTGGCGCTT GTACCAAGGA AGAAAGTTTT GCTCTTATCG CAAAAGCCAT CAACCAAGTA CACAAAGATG TCCCTGAAGT CATCACCGTG ATCGAGAATA TGGTTAGTCC CCAGTCGTTA CTAAAGATCA CTGTCTAATT TAATTCAGGC CAACGCAGGA TCCAACATTG TTGGTACAGC ATGGTCAGAC CTTTCCTCTA TCATTAAACT CGTCGAAGAC AAGTCCCGTG TCCGCGTCTG TATCGACACT TGCCATACTT TTGCTGCTGG TTACGATATC CGAACGCCGG AGACATACGC CGAGACTATG AAAAGGTTTG ACGAGGTGGT TGGAAACAAG TACTTGGCTG GTGTACATCT AAATGATTCT AAGGCGAATC TGGGCGCGAA CAAGGATTTG CACGAGAATA TCGGTCTGTA GGTGGCCCTG TGTCTTGTTT CACGAGTGAT CAAACAAAGC TAACTTAAGT CTTAAGTGGC GAGATCGGTC TCACAGCGTT CAGATGCATC ATGCGCGACC CTCTCATGAC GGGTATACCG CTCGTCCTTG AAACACCCGC GCCAGACGCC CCAACCCCCG CCGAACATCT TTCCATCTGG ACAAAGGAAA TTGCCCTCTT ATACGAGATC CAGGCCATCG AAGATGATGA ATGGGATGTC AAGAAGGGGG AGATCGAAAA GCGGTGGAGG AAAGAGCGGG ATGCGATCAA TCCGCCCAAA GAAAAAAAGA AGCCCGCTGC CAAGGGGAAA GCGAAGAAGG CCAAAAAAGT GGAGGACGAC GGATGCTCCC ATGATGAGGA CTAGAGTCGG GGATAGAATT CCAGCCGGAG AGGTCAGA
|
Protein sequence | MARVITSATP KRERSASSPL TELEPEVPAP KAVKPKRAVR STKPKNEDTK ENDNNEDAPA AAKKQRVSKA KAWPPAELEP MLHLPRQGYP AFKLPCSTAS SNGGIAPQND KSQPMLLGAH VSAAGGPATA LLRAGLAGAN GLALFVKSQR QWKSKPYEDE TVQRFKELMK SKEEGGMGYG PESILVHGSY LINLGNPDPA KWKVSYECFK DDIARCHQLG IKLYNWHPGS TVGACTKEES FALIAKAINQ VHKDVPEVIT VIENMANAGS NIVGTAWSDL SSIIKLVEDK SRVRVCIDTC HTFAAGYDIR TPETYAETMK RFDEVVGNKY LAGVHLNDSK ANLGANKDLH ENIGLGEIGL TAFRCIMRDP LMTGIPLVLE TPAPDAPTPA EHLSIWTKEI ALLYEIQAIE DDEWDVKKGE IEKRWRKERD AINPPKEKKK PAAKGKAKKA KKVEDDGCSH DED
|
| |