Gene CNI04070 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI04070 
Symbol 
ID3259476 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp1082604 
End bp1084718 
Gene Length2115 bp 
Protein Length654 aa 
Translation table 
GC content49% 
IMG OID638258902 
ProductDNA-(apurinic or apyrimidinic site) lyase, putative 
Protein accessionXP_572926 
Protein GI58271540 
COG category[L] Replication, recombination and repair 
COG ID[COG0708] Exonuclease III 
TIGRFAM ID[TIGR00633] exodeoxyribonuclease III (xth) 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.434844 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTAGAT TACCACCCGT AGGTCACCAC AAGTTGCGTC GCCTGCGATG CCATCTCACT 
CGAAGCATCT ACAGGTTCAG CTCGATGAAG AAGAAGAACA TCGAAGGCCT GCTCGATGAA
CTGGATGCCC AGATATTTTG CTTTCAAGGT GAGTTGATAG GCTATTATGT TGAGCAAAGC
TGACAGTACA ACAGAACATA AAACGGTCCG CACACGTCTA GAGAAGTCAA TGGCTTGCCC
GGGCCCATAC GACGGCTTCT GGACTTTTCC TCGTTCCAAA ACTGGTTATA GCGGAGTCTG
CACCTATGTG GATTCTCGCT ATTGCGTGCC TCTCAAGGCG GAAGAAGGTA TTACCGGACT
TCTTTTAGGC GACCGGCTGA GTACGATGAA GCCACCATGG ACCGATGCGG AAAGAATAGG
CAGTTATCCT GATGTGAATG ATATGGAATG GATGGATGAG CTGGACGGGA CAAAGTTTGA
TGTCAAGAAG CTCGACATGG AGGGACGGGC TGTGGTATGC GATTTCGGGT GAGTAATCTC
CATATAGCGT TGCAGGGTCA TGGCCCGGCT AATTTAAAGC TAGACTGTTC GTCTTATTCA
ACCTTTACTG CCCGAACGAG ACAAATAGTG CTAGGCGACC ATACAAAATG AACTATCTAC
ACGCTCTTCA AGAACGTGTA CAGCTCCTTC AGGCTGCTGG GCGCGAAGTC ATGCTCGTGG
GAGATATCAA CATTGTACGC CAGCCGATGG ATTCCGGAGA AGGACCTGTG AGGTCTTCAG
CGGAGCAACA TTATTCCCAC CCTGCAAGAC GGATACTTGA TGATTGGTGT GCGCCGAAAG
GACCGATGAT AGATGTCGTC AGAGAGAGCT GGCCTCAAAG AGATGATATG TTCACCTGTT
GGAATCAGAA ACTCGACGCC AGGTGAGTTT GTAATTTTGG TAAGAGTGGA CCCCTCTCAC
ATGTTTCACA GGTCGGCAAA CTATGGAAGC CGTATTGACT ATGTACTATG TACACCTGGT
CTTCGTCCAT GGATCAAGGG CGGCGATATA CTTCCCAAGG TGTACGGGTC TGATCACTGT
CCCGTGTATG TCGACTTGCA TGAGTCCATC GTCACTCCAG AAGGGGAGAC TCTCCACCTT
CGCGATATGC TTAATCCCAA AGATCGACCC ACTAGCACGT CTCCTGTATA CCCAAATGAT
GTCAAGAGGG AAGCACCTGA GCCGCCAAGG TTCGCAACCA AATTTATGGA CGAGTTCTCA
GGGAGACAGA CAAGTCTAAA GAGTTTTTTT GGTGGAGGAT CGAAGCGGGC ACAGGAGAAG
ACGAATGGAG CCAGTCTTAG TACAAGTGTG AGCGCGAGTG CTAGTCCGGC TCCAACCCCT
ACTGCATCAG AATTATCATC GGCAGTGCAA GCATCCGAAT CTGGTGCTAC AAAAATTGTT
TCACTCCCGC AAGCGCCGGA AGAATGCGTT TCCGCCCCAT TCAGCCTTGC TCGAACTGCT
TTCAGTTCGT TGGACAATCC CTCCCCTGTT CATAGCCGGG AATTATCTTC CAAAACAAGC
GACGGTGCCC CAGTAAAGGC AACCTTATCG TCCAAACAAG ATAAATCTAG CGCAAAGCCC
ATAGACATGA CATTAGATGA CGATGAGGAT GATGAGCCAA TCTTGGTATC GTCAAAATCG
GAAAACAAGC CCCCACCGAA ACCTACGAGG TCATCGTTGT CTGGTTCGAA ATCAGCCTCC
TCACAAGTCA AGCTCTCTTC ATTTTTCAGT CAACCGCATA CCGAAGGCAA AAGAAAATCC
CCACCACCAC CATCCTCAGC TCCTTCTATC TCAAAACGGC CCTCATTGGC ACCACTTCCT
CAGTCAAATA ATGCTCCGTT TGCTGCCTCT CCTTCCGCCA CAACTGCGGA AGATGAGTGC
CAGGGCATGA CAGAAGAAGA GAATCAACTT ATTTCGCAAG CTATTGCAGA AGCAGACGCA
GAAAGAGCAG AAAAAAACGC AAAAGCTGCT CCGCAATGGT CCAGCCTTTT TGCCAAAAAG
CTGCCACCGT TGTGCACAGT TCATCATAAG CCGTGCAAGG ATTTTGGTAA GTTGCCGCAA
TGGATTTTCT GTTAA
 
Protein sequence
MSRLPPVGHH KLRRLRCHLT RSIYRFSSMK KKNIEGLLDE LDAQIFCFQE HKTVRTRLEK 
SMACPGPYDG FWTFPRSKTG YSGVCTYVDS RYCVPLKAEE GITGLLLGDR LSTMKPPWTD
AERIGSYPDV NDMEWMDELD GTKFDVKKLD MEGRAVVCDF GLFVLFNLYC PNETNSARRP
YKMNYLHALQ ERVQLLQAAG REVMLVGDIN IVRQPMDSGE GPVRSSAEQH YSHPARRILD
DWCAPKGPMI DVVRESWPQR DDMFTCWNQK LDARSANYGS RIDYVLCTPG LRPWIKGGDI
LPKVYGSDHC PVYVDLHESI VTPEGETLHL RDMLNPKDRP TSTSPVYPND VKREAPEPPR
FATKFMDEFS GRQTSLKSFF GGGSKRAQEK TNGASLSTSV SASASPAPTP TASELSSAVQ
ASESGATKIV SLPQAPEECV SAPFSLARTA FSSLDNPSPV HSRELSSKTS DGAPVKATLS
SKQDKSSAKP IDMTLDDDED DEPILVSSKS ENKPPPKPTR SSLSGSKSAS SQVKLSSFFS
QPHTEGKRKS PPPPSSAPSI SKRPSLAPLP QSNNAPFAAS PSATTAEDEC QGMTEEENQL
ISQAIAEADA ERAEKNAKAA PQWSSLFAKK LPPLCTVHHK PCKDFGKLPQ WIFC