Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI04070 |
Symbol | |
ID | 3259476 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 1082604 |
End bp | 1084718 |
Gene Length | 2115 bp |
Protein Length | 654 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258902 |
Product | DNA-(apurinic or apyrimidinic site) lyase, putative |
Protein accession | XP_572926 |
Protein GI | 58271540 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0708] Exonuclease III |
TIGRFAM ID | [TIGR00633] exodeoxyribonuclease III (xth) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.434844 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTAGAT TACCACCCGT AGGTCACCAC AAGTTGCGTC GCCTGCGATG CCATCTCACT CGAAGCATCT ACAGGTTCAG CTCGATGAAG AAGAAGAACA TCGAAGGCCT GCTCGATGAA CTGGATGCCC AGATATTTTG CTTTCAAGGT GAGTTGATAG GCTATTATGT TGAGCAAAGC TGACAGTACA ACAGAACATA AAACGGTCCG CACACGTCTA GAGAAGTCAA TGGCTTGCCC GGGCCCATAC GACGGCTTCT GGACTTTTCC TCGTTCCAAA ACTGGTTATA GCGGAGTCTG CACCTATGTG GATTCTCGCT ATTGCGTGCC TCTCAAGGCG GAAGAAGGTA TTACCGGACT TCTTTTAGGC GACCGGCTGA GTACGATGAA GCCACCATGG ACCGATGCGG AAAGAATAGG CAGTTATCCT GATGTGAATG ATATGGAATG GATGGATGAG CTGGACGGGA CAAAGTTTGA TGTCAAGAAG CTCGACATGG AGGGACGGGC TGTGGTATGC GATTTCGGGT GAGTAATCTC CATATAGCGT TGCAGGGTCA TGGCCCGGCT AATTTAAAGC TAGACTGTTC GTCTTATTCA ACCTTTACTG CCCGAACGAG ACAAATAGTG CTAGGCGACC ATACAAAATG AACTATCTAC ACGCTCTTCA AGAACGTGTA CAGCTCCTTC AGGCTGCTGG GCGCGAAGTC ATGCTCGTGG GAGATATCAA CATTGTACGC CAGCCGATGG ATTCCGGAGA AGGACCTGTG AGGTCTTCAG CGGAGCAACA TTATTCCCAC CCTGCAAGAC GGATACTTGA TGATTGGTGT GCGCCGAAAG GACCGATGAT AGATGTCGTC AGAGAGAGCT GGCCTCAAAG AGATGATATG TTCACCTGTT GGAATCAGAA ACTCGACGCC AGGTGAGTTT GTAATTTTGG TAAGAGTGGA CCCCTCTCAC ATGTTTCACA GGTCGGCAAA CTATGGAAGC CGTATTGACT ATGTACTATG TACACCTGGT CTTCGTCCAT GGATCAAGGG CGGCGATATA CTTCCCAAGG TGTACGGGTC TGATCACTGT CCCGTGTATG TCGACTTGCA TGAGTCCATC GTCACTCCAG AAGGGGAGAC TCTCCACCTT CGCGATATGC TTAATCCCAA AGATCGACCC ACTAGCACGT CTCCTGTATA CCCAAATGAT GTCAAGAGGG AAGCACCTGA GCCGCCAAGG TTCGCAACCA AATTTATGGA CGAGTTCTCA GGGAGACAGA CAAGTCTAAA GAGTTTTTTT GGTGGAGGAT CGAAGCGGGC ACAGGAGAAG ACGAATGGAG CCAGTCTTAG TACAAGTGTG AGCGCGAGTG CTAGTCCGGC TCCAACCCCT ACTGCATCAG AATTATCATC GGCAGTGCAA GCATCCGAAT CTGGTGCTAC AAAAATTGTT TCACTCCCGC AAGCGCCGGA AGAATGCGTT TCCGCCCCAT TCAGCCTTGC TCGAACTGCT TTCAGTTCGT TGGACAATCC CTCCCCTGTT CATAGCCGGG AATTATCTTC CAAAACAAGC GACGGTGCCC CAGTAAAGGC AACCTTATCG TCCAAACAAG ATAAATCTAG CGCAAAGCCC ATAGACATGA CATTAGATGA CGATGAGGAT GATGAGCCAA TCTTGGTATC GTCAAAATCG GAAAACAAGC CCCCACCGAA ACCTACGAGG TCATCGTTGT CTGGTTCGAA ATCAGCCTCC TCACAAGTCA AGCTCTCTTC ATTTTTCAGT CAACCGCATA CCGAAGGCAA AAGAAAATCC CCACCACCAC CATCCTCAGC TCCTTCTATC TCAAAACGGC CCTCATTGGC ACCACTTCCT CAGTCAAATA ATGCTCCGTT TGCTGCCTCT CCTTCCGCCA CAACTGCGGA AGATGAGTGC CAGGGCATGA CAGAAGAAGA GAATCAACTT ATTTCGCAAG CTATTGCAGA AGCAGACGCA GAAAGAGCAG AAAAAAACGC AAAAGCTGCT CCGCAATGGT CCAGCCTTTT TGCCAAAAAG CTGCCACCGT TGTGCACAGT TCATCATAAG CCGTGCAAGG ATTTTGGTAA GTTGCCGCAA TGGATTTTCT GTTAA
|
Protein sequence | MSRLPPVGHH KLRRLRCHLT RSIYRFSSMK KKNIEGLLDE LDAQIFCFQE HKTVRTRLEK SMACPGPYDG FWTFPRSKTG YSGVCTYVDS RYCVPLKAEE GITGLLLGDR LSTMKPPWTD AERIGSYPDV NDMEWMDELD GTKFDVKKLD MEGRAVVCDF GLFVLFNLYC PNETNSARRP YKMNYLHALQ ERVQLLQAAG REVMLVGDIN IVRQPMDSGE GPVRSSAEQH YSHPARRILD DWCAPKGPMI DVVRESWPQR DDMFTCWNQK LDARSANYGS RIDYVLCTPG LRPWIKGGDI LPKVYGSDHC PVYVDLHESI VTPEGETLHL RDMLNPKDRP TSTSPVYPND VKREAPEPPR FATKFMDEFS GRQTSLKSFF GGGSKRAQEK TNGASLSTSV SASASPAPTP TASELSSAVQ ASESGATKIV SLPQAPEECV SAPFSLARTA FSSLDNPSPV HSRELSSKTS DGAPVKATLS SKQDKSSAKP IDMTLDDDED DEPILVSSKS ENKPPPKPTR SSLSGSKSAS SQVKLSSFFS QPHTEGKRKS PPPPSSAPSI SKRPSLAPLP QSNNAPFAAS PSATTAEDEC QGMTEEENQL ISQAIAEADA ERAEKNAKAA PQWSSLFAKK LPPLCTVHHK PCKDFGKLPQ WIFC
|
| |