Gene CNG03640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG03640 
Symbol 
ID3258636 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1020653 
End bp1022226 
Gene Length1574 bp 
Protein Length452 aa 
Translation table 
GC content48% 
IMG OID638257988 
ProductDNA-(apurinic or apyrimidinic site) lyase, putative 
Protein accessionXP_572070 
Protein GI58269828 
COG category[L] Replication, recombination and repair 
COG ID[COG0177] Predicted EndoIII-related endonuclease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.433978 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCAAACATCA GAATAATGTC GAGAAGGCCA AATCTCCGCT CAACAAAGTT AGCACTCAAC 
CAGAAGGTCT CTATTATCCC AACAGATGCG GTCAAGGCAG AGCCGCCAAG CTCCAAGCTT
ACGTATAGTT CCCTCCGTCG ACCGACCAGA TCGAGTGCTA CAGTGGAAGA ATTGGCATCG
CCCGTCAAGA AGAACAAATT AAACATCGCC AAGTACGAAT ACAAGGGCTC GATACCTTCC
CCTAGGAAGC GTCCGAGGAT AGATGATGTT GTCAAGGAGG AAGAAATTGA AACAAAGGCA
AGCCCAATAA AGTCTCCGGC AAAAAAACCA TTGCCACAGG TAGCGCTTGC AAAACCTCAT
GCGGCCCCTG CAAAATGGGA AGAACAGTAC CGATTGATTG AAAAGATGAG ACGGGGTATT
GTCGCTCCTG TTGATGATAT GTAAGCGATT CGTGACCTCT CGTGCCGAAC GCATATACTG
ATAGCTTTGT TTAGGGGCTG CGAACGGCCG AGAACCAATA CCGAAGGAGA TCCAAAGGTA
TTTATTTTCA GTTAAAGCCT CATGTCGGTG CTAATCCGAG CATAAGACTT TTCGTTTCCA
CATCCTCATA TCTCTCATGC TCTCCTCTCA AACAAAAGAT GCTGTGACCT CAGCAGCCGT
CACCTCTCTT CACACCTCTC TGCCAGGTGG TCTTTCTGCC GCCTCTCTGG CCGCTGCACC
CTTGGAAACC ATCCAGGAAT GTATCAACAA GGTTGGATTC TGGCGACGAA AGGCAGAATA
CATCCAAGAG GCTGCAAAGA CACTTTTGGA ACAAGAAGGA GATGAGAAAG GAGACGTGCC
AAAGACGGTC GAAGGTTTGT GCAAGTTGAA GGGCGTAGGG CCTAAAATGG CTTTCTTGGC
CCTGCAATGC GCTTGGGATA TGTATGTAAT CTTTCGTTAT CCTTCATCTA TACCTCTCAA
CTGACATAAA ATAAAAAGTA ATGCTGGAAT CGGAGTTGAC GTCCACGTTC ATCGCATCAC
AAATCGCCTC AAATGGCACC GTCCACCTAC ATCCACCCCA GAACAAACCC GACTCAACCT
TCAATCATGG CTTCCCCCCC ATTTACATAA ACCTATCAAC CCCTTGATGG TCGGTTTTGG
TCAAGTGATC TGCCTCCCAG TTGGGCCTAG GTGCGATATC TGTCTGTTAG GCCAAAAGGA
GATATGCCCA AGTCGAGTAA AAGGGGCGAA TGCCAAGGGC AGAAAAGAGG TGACGTATAG
CTTCAAGGAA GAGGAGGATG AACTTGCTAT CGGGCAGTGG CGGTGGGGTC AAGCGAAGGG
AGTTAAGAGT GAGGCCAAGG TTGAGATTGG ATATGAGGGA GGATTAGAGA AAATCAAAGA
TGAGGAACCA GAGAATTCGG TCGAGGTGGA GCAGATGATC AAGGAACCAG GGATGAGACG
ACCTGATGAA GTGTTAGAGG TCTTGGATCA GGTAGATGGC CCCACGGATA TCGGGGCAGA
GCCTGTCATA AAGACCGAAA ATGTCGATTG GTAATCATTG GGTATCATAA TCAGTCTTTG
CATCATTCGT CGGT
 
Protein sequence
MSRRPNLRST KLALNQKVSI IPTDAVKAEP PSSKLTYSSL RRPTRSSATV EELASPVKKN 
KLNIAKYEYK GSIPSPRKRP RIDDVVKEEE IETKASPIKS PAKKPLPQVA LAKPHAAPAK
WEEQYRLIEK MRRGIVAPVD DMGCERPRTN TEGDPKTFRF HILISLMLSS QTKDAVTSAA
VTSLHTSLPG GLSAASLAAA PLETIQECIN KVGFWRRKAE YIQEAAKTLL EQEGDEKGDV
PKTVEGLCKL KGVGPKMAFL ALQCAWDINA GIGVDVHVHR ITNRLKWHRP PTSTPEQTRL
NLQSWLPPHL HKPINPLMVG FGQVICLPVG PRCDICLLGQ KEICPSRVKG ANAKGRKEVT
YSFKEEEDEL AIGQWRWGQA KGVKSEAKVE IGYEGGLEKI KDEEPENSVE VEQMIKEPGM
RRPDEVLEVL DQVDGPTDIG AEPVIKTENV DW