Gene CNA03100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA03100 
Symbol 
ID3253403 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp813738 
End bp818294 
Gene Length4557 bp 
Protein Length1323 aa 
Translation table 
GC content50% 
IMG OID638252641 
Productsingle-stranded DNA specific endodeoxyribonuclease, putative 
Protein accessionXP_566738 
Protein GI58258651 
COG category[L] Replication, recombination and repair 
COG ID[COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) 
TIGRFAM ID[TIGR00600] DNA excision repair protein (rad2) 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCATCT CTGATATTTA CGCGCGACGC GTTATAACGA CTGGAGCGGA CACGGAGATG 
ATCTCCGTCT CAGCCTACTC ATCGTCATCA TCACTAAGTT CTTCGTTGGT CCAGTCAACC
GTTGCCTACT TTGACGAGAG TCCCGCTTGC CTGTTTTACC ACCACGTCCA TAGGTGGAAG
CGAATAATCA ATATGGGTGT CAAAGGCCTT TGGTCTCTGC TCAACCCAGT CGCTCGTCCA
GTCCAGTAAG CTGGGCCACG ACCATGTAAT GGACCTCGTG CTGATGTCTA TCCAGGATCG
AGAGTATGGA AGGGAAGCGA TTGGCGATTG ATAGTTCCAT CTGGCTTTAT CAAGTAAGAA
TTCTGCCATT CTGTGGCAAG ACAATTCATA TGCTAATATT ATTATCATTG CACAGTTCCA
AGCGACAATG AGGGATAAAG ATGGTAGGGT CTTAGTGAAT GCACACGTTC TAGGTAAGTT
CTCAAATGGA TCGCTACAAT GTACGGAATT CTAATCATAG TCATCTACTT ACTTAGGCTT
CCTACGTCGG ATAAATAAAC TGTTGTTCCA CGGGATAAAA CCAGTGTTCG TCTTCGATGG
AGGTGCTCCG GCGCTCAAGC GATCGACTAT CGTGAATAGC AGCTTTATAG CGTACATTTT
CATAGCTGAT ATGAGACCAA TGTATAGGCC GAAAGAAAGC GAAAGAAGAC TGGTGCTGCT
GCCAATCATG CCAAAGTGGC TGAGAAGCTT TTTGCCGCTC AGATGCGTCG CGAGGCAGTG
AAGGCGGCAC AAGTGTTAGT GATTTCTATT TAAGCTAAGA TGGCTTCTGG AAACTGACTG
TTCTCAGCGC ACAAGAACAA AAAGAGGCCA AAGCATCAGC TGAGGCAGCT GCAGAGTACG
CAAGGCGCTA TCCTGATGAA GCGGGAGAAC AAATCGCTGA GGGAGCTGTA TATCTCGAAG
ATTTAGAAGG TCGTGCCGGC CCTTCCCGGC CTCGTTCTCC CGGACCTGCA CAGACCGAGG
GTGTCGATCC AAGTGCAGTA CCGACAGACC CGGAAAAACG AAGAAAATAC TTCAAAAAGC
ATGATCCTTA TCGCTTACCT GCAGCCGAGA TGCCCACAGT ATCGACGTCT GACAGACCAG
ACGCCAGATT AGCTACAGAG GAGGAGCTCA AACAATTCAT AGACGAGGTC CACCCCGAGG
ACATCGACAT AGAATCAGCC GAATTTCGTG CTTTGCCCAC CGAGGTCCAG TACGAAATTA
TCGGTGATTT ACGTATCCGG TCCCGCCAAC AATCTCACCG CCGACTCGCT GATATGCTTC
GCGCCGCTCC TACTCCTTTA GACTTTTCCA AAGCACAGAT AAAGCACCTC TCCCAACGTA
ACGCTTTGAC CCAGCAACTT TTAACGGTCA CGGATATGGT TGGTAAAGCC CACTTGACCA
TTCCTGTGAG AATCGCGGCA GAAAGAAACA GAGAGTATGT ATTAGTGAAG AAAGACGAGA
CTGAAGGTGG AGGATGGGCT CTGGGAATCA GAGAGGGATC GAAGGAGAAG CCAATAGAGG
TTGAGCCTGC GGAACCCAAG ACTGAAAGCG AGCATGAATC GGACATTGAA CCCCTATCAC
CGTGAGTTTA AGTTCTCATG GATTATATTG GAGATTTACC TGACGTACAG AATTTAGGCC
TCCGCAGGCT GCTATGGATC AAGACCTCCG AGAGTATCGT CGTCAACAAG TGCTTGAAGC
TATTGCTGCC CGCTACGCTC CCAAACGTCA GGCACGAGCC CCTCTCGACG TTGCAGTCAA
ACCCTTTGGG CCATCACGAA CGGCATCGTC CAAGCCATTA TTCGATGTCG ATGGCGAGGA
AGAAGAAGGG GAGGAAGAGG TTGTGCCGAC AGCTAACGAT GAAGCTTTGG CCTTGGCTTT
GCAACAAGAA GAGCTGGGAG ACGATGAGAC AGAGGTTGAT GAAGATCTAG CAAAAGCCTT
GGCTCTCAGT CGGAGAGAGG CTGAAAGGAA ATCAAGGAGC GAAAATGAAG GATTCAGAGT
TGTTGACGTC GGAAAAGACG AATTGACGGA GGAAGAGGAC GGAGATATGG AAGAAGTAGA
ACTCGTGCCT AGTGGCACTG TGACACCTGC TCAAGTTGAC ATAGAGGCAG AAGACAGCGA
AGATGAGGAT GAGTTTGAAG AAGTCGACAC GCCATCAACG ACGTTGAGCT CTTCTCGGGT
GTCATTAGGT GCTTCTATCG CCCCAGGCAT GGAAACCCCG GAAATTGTTA CCATTCATCC
GAATTCTACG CGGATGATCG ACCCTATAGT CATGGATGAT GATGACGATG ATGATGATGA
TAAAGGAGAG CCCCTGGTCA TGTCAATGTT TAAAGAGAAG CGGGCTTCTG TCTATCTACC
TCCCCTGGCG TCGGACCTGC CCGAAAGAGC TGCTCAGTCA TCTGGCGAAC AGGCACCGCC
TGCCATCACT TCTCGATCCC AAATCTCACA ATCGACAGCC GTGCCGGTAC GGCCCAAAGC
ATTACAAAAG CTGAATTCAG CGGTCAATGT ACCTTCTCCT TTGAGGAACG TAGCTCAGCC
AAAATCATCG CCAAAATCTT CTCCAATCTC TGTCGATGAT GTTTCCGTCC CTGCTGGTGC
CCCAGACCTA CCCTTTACGG TGGAAGCCAC AGATCCAACT GAAATCAGAG GCGCACCGTC
CCCGGACATG ATCCCGCCTC GTTCACCACC ATCACCTGAG CTGTATGGGT CTGGATCGCT
TGAAAGACCG CATTTTTTGG AACGCCCCTC GCCACGCTCG CCTGTGATGT ACAATGAGAT
GGACGACGAG GACGGCAGCG AAGCGGATGA GAAGGATGAG ACCCGATCCA CGTATTCCTG
GTCACCTTCC CCGACCCCGC CACCCCGTCC TCTCCGGACG ACTGAATCGG ATGTTTCGCT
CAACACTGTT GCATCTCCTT CTCCTGCCTT GATCGCAGCG CCCCGAGATG ACGATGAGGA
TGACGGTGAC CTGGCGCCAG CTGATGTGGC GGCAGAATCC GATGACTATG CACGTTTCGT
GGCGTCGATT AAGAATCGGG ATCTCAACGA GGTTCGCGGG GAGATTGATG ACGAGATACG
GGTGTTGAAC TCTGAGAATA GGGTGGCGAT GAGAGACTCG GATGAGATTA CCCAAAGTAT
GATTGCGCAA ATCCAGGCAA GTCGTTTGTG TTTCTGTGTG GGGTCAAAAG GATATCTGTG
CTAAAAACTA GGCTCAAAAA ACAGACACTT TTGAGGCATT TCGGTATTCC TTACATAACA
GCACCCATGG AAGCGGAAGC GCAATGTGCC AAACTGGCAC AGCTAGGTCT CGTAGACGGT
ATCATCACCG ATGATTCCGA TGTTTTCCTC TTTGGCGGTG TCCAGTGCTT CAAAAACATT
TTCAACGACG CCAAATACGC AGAGTGTTTC CTGCTGGCAG ATGTGGAAAG AGAATTGATG
TTGACGCGCG AAAGGTTGAT TTCACTCGCG TATTTCCTGG GGAGCGATTA TACGTTGGGA
TTGCCCGGGA TTGGGCCTGT TATGGGTCTC GAGATCCTGG CGAATTTTCC GGGGGAGAGG
GGTCTGTATG ATTTTAAAGA ATGGTGGGGA CGAGTGCAAA AAGGGAATGA TACAGAGGAA
GAGAGTGGGA CAAAATGGAG GAAATCGTTT AAAAAGAGAT TCTTGAAGAG TATATACCTC
ACTGCAGACT GGCCTGACCC GCTTGTGGTG AGTTTTTTGA CCAAAAAAGG GGGGGGGAAC
AAACGAGTAT TGATGATTAC CATTGCGAAT AGAGGGAGGC ATACCTATAT CCAACCGTCG
ATGAATCGGA AGAGCCCTTC CATTGGGGAT TCCCCAGACT TTCAGCCCTG CGAACGTGAG
TAACCTGTCC CCGTTTATTG TGCTTTGAAT GCTGAAAAAA ATGCACCCCT ACCCGTTTCA
GTTTTCTCCA CGAAGAGTTA TCTTGGTCTA TATCCAAGGT GGACGACGAA TTGACGCCTA
TCGTGCAACG TATATCCCTA CGGGGCAAAC ATGGGGCTTT GAATAAACAA GGGACGCTTG
ATCCATTTTT CGATATGTCC GCTGGAGCGG GACATTATGC GCCTAGAAAA AGAGGTATGA
ACGTTAGTAA ACGGTTGATG GGGGTTATCA AGCAGTTTAA GGAGGCCGAA ATTAGGATGA
GCAAAGGCGA GGATTTGGAT GTGGACGCGA TACTTGCAGA CGAGGAGAAG GAGAAGAAGC
GTAAAGCAAA GGGGAAACGA AAAATGGACG AAAAGGAGGC TGGAGATGAT GAAAAGGAGG
AGGGGAGAAA CAGCAGTAGC AACAAGAGGA AGAAGACGAG TGCGCGAGGG AGGAGGAGGG
CTGGAACGGC TTCGAGTGTT GGAGATTCGG TGGCGTCAAG CGAAGGAGGG AGTCGGAGTA
CGAGCACGAG TGGACGGGGT AGGGGTGGGT CGAGGGCCCG TGGTCGGGCG AGGGGCAAGG
GCCAAGAGTA AGCCATTTGG TGCAACGTTA CTGTAGCAAT TAGCATGTAG TAGTATT
 
Protein sequence
MTISDIYARR VITTGADTEM ISVSAYSSSS SLSSSLVQST VAYFDESPAC LFYHHVHRWK 
RIINMGVKGL WSLLNPVARP VQIESMEGKR LAIDSSIWLY QFQATMRDKD GRVLVNAHVL
GFLRRINKLL FHGIKPVFVF DGGAPALKRS TIAERKRKKT GAAANHAKVA EKLFAAQMRR
EAVKAAQVAQ EQKEAKASAE AAAEYARRYP DEAGEQIAEG AVYLEDLEGR AGPSRPRSPG
PAQTEGVDPS AVPTDPEKRR KYFKKHDPYR LPAAEMPTVS TSDRPDARLA TEEELKQFID
EVHPEDIDIE SAEFRALPTE VQYEIIGDLR IRSRQQSHRR LADMLRAAPT PLDFSKAQIK
HLSQRNALTQ QLLTVTDMVG KAHLTIPVRI AAERNREYVL VKKDETEGGG WALGIREGSK
EKPIEVEPAE PKTESEHESD IEPLSPPPQA AMDQDLREYR RQQVLEAIAA RYAPKRQARA
PLDVAVKPFG PSRTASSKPL FDVDGEEEEG EEEVVPTAND EALALALQQE ELGDDETEVD
EDLAKALALS RREAERKSRS ENEGFRVVDV GKDELTEEED GDMEEVELVP SGTVTPAQVD
IEAEDSEDED EFEEVDTPST TLSSSRVSLG ASIAPGMETP EIVTIHPNST RMIDPIVMDD
DDDDDDDKGE PLVMSMFKEK RASVYLPPLA SDLPERAAQS SGEQAPPAIT SRSQISQSTA
VPVRPKALQK LNSAVNVPSP LRNVAQPKSS PKSSPISVDD VSVPAGAPDL PFTVEATDPT
EIRGAPSPDM IPPRSPPSPE LYGSGSLERP HFLERPSPRS PVMYNEMDDE DGSEADEKDE
TRSTYSWSPS PTPPPRPLRT TESDVSLNTV ASPSPALIAA PRDDDEDDGD LAPADVAAES
DDYARFVASI KNRDLNEVRG EIDDEIRVLN SENRVAMRDS DEITQSMIAQ IQTLLRHFGI
PYITAPMEAE AQCAKLAQLG LVDGIITDDS DVFLFGGVQC FKNIFNDAKY AECFLLADVE
RELMLTRERL ISLAYFLGSD YTLGLPGIGP VMGLEILANF PGERGLYDFK EWWGRVQKGN
DTEEESGTKW RKSFKKRFLK SIYLTADWPD PLVREAYLYP TVDESEEPFH WGFPRLSALR
TFLHEELSWS ISKVDDELTP IVQRISLRGK HGALNKQGTL DPFFDMSAGA GHYAPRKRGM
NVSKRLMGVI KQFKEAEIRM SKGEDLDVDA ILADEEKEKK RKAKGKRKMD EKEAGDDEKE
EGRNSSSNKR KKTSARGRRR AGTASSVGDS VASSEGGSRS TSTSGRGRGG SRARGRARGK
GQE