Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA03100 |
Symbol | |
ID | 3253403 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 813738 |
End bp | 818294 |
Gene Length | 4557 bp |
Protein Length | 1323 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252641 |
Product | single-stranded DNA specific endodeoxyribonuclease, putative |
Protein accession | XP_566738 |
Protein GI | 58258651 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0258] 5'-3' exonuclease (including N-terminal domain of PolI) |
TIGRFAM ID | [TIGR00600] DNA excision repair protein (rad2) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCATCT CTGATATTTA CGCGCGACGC GTTATAACGA CTGGAGCGGA CACGGAGATG ATCTCCGTCT CAGCCTACTC ATCGTCATCA TCACTAAGTT CTTCGTTGGT CCAGTCAACC GTTGCCTACT TTGACGAGAG TCCCGCTTGC CTGTTTTACC ACCACGTCCA TAGGTGGAAG CGAATAATCA ATATGGGTGT CAAAGGCCTT TGGTCTCTGC TCAACCCAGT CGCTCGTCCA GTCCAGTAAG CTGGGCCACG ACCATGTAAT GGACCTCGTG CTGATGTCTA TCCAGGATCG AGAGTATGGA AGGGAAGCGA TTGGCGATTG ATAGTTCCAT CTGGCTTTAT CAAGTAAGAA TTCTGCCATT CTGTGGCAAG ACAATTCATA TGCTAATATT ATTATCATTG CACAGTTCCA AGCGACAATG AGGGATAAAG ATGGTAGGGT CTTAGTGAAT GCACACGTTC TAGGTAAGTT CTCAAATGGA TCGCTACAAT GTACGGAATT CTAATCATAG TCATCTACTT ACTTAGGCTT CCTACGTCGG ATAAATAAAC TGTTGTTCCA CGGGATAAAA CCAGTGTTCG TCTTCGATGG AGGTGCTCCG GCGCTCAAGC GATCGACTAT CGTGAATAGC AGCTTTATAG CGTACATTTT CATAGCTGAT ATGAGACCAA TGTATAGGCC GAAAGAAAGC GAAAGAAGAC TGGTGCTGCT GCCAATCATG CCAAAGTGGC TGAGAAGCTT TTTGCCGCTC AGATGCGTCG CGAGGCAGTG AAGGCGGCAC AAGTGTTAGT GATTTCTATT TAAGCTAAGA TGGCTTCTGG AAACTGACTG TTCTCAGCGC ACAAGAACAA AAAGAGGCCA AAGCATCAGC TGAGGCAGCT GCAGAGTACG CAAGGCGCTA TCCTGATGAA GCGGGAGAAC AAATCGCTGA GGGAGCTGTA TATCTCGAAG ATTTAGAAGG TCGTGCCGGC CCTTCCCGGC CTCGTTCTCC CGGACCTGCA CAGACCGAGG GTGTCGATCC AAGTGCAGTA CCGACAGACC CGGAAAAACG AAGAAAATAC TTCAAAAAGC ATGATCCTTA TCGCTTACCT GCAGCCGAGA TGCCCACAGT ATCGACGTCT GACAGACCAG ACGCCAGATT AGCTACAGAG GAGGAGCTCA AACAATTCAT AGACGAGGTC CACCCCGAGG ACATCGACAT AGAATCAGCC GAATTTCGTG CTTTGCCCAC CGAGGTCCAG TACGAAATTA TCGGTGATTT ACGTATCCGG TCCCGCCAAC AATCTCACCG CCGACTCGCT GATATGCTTC GCGCCGCTCC TACTCCTTTA GACTTTTCCA AAGCACAGAT AAAGCACCTC TCCCAACGTA ACGCTTTGAC CCAGCAACTT TTAACGGTCA CGGATATGGT TGGTAAAGCC CACTTGACCA TTCCTGTGAG AATCGCGGCA GAAAGAAACA GAGAGTATGT ATTAGTGAAG AAAGACGAGA CTGAAGGTGG AGGATGGGCT CTGGGAATCA GAGAGGGATC GAAGGAGAAG CCAATAGAGG TTGAGCCTGC GGAACCCAAG ACTGAAAGCG AGCATGAATC GGACATTGAA CCCCTATCAC CGTGAGTTTA AGTTCTCATG GATTATATTG GAGATTTACC TGACGTACAG AATTTAGGCC TCCGCAGGCT GCTATGGATC AAGACCTCCG AGAGTATCGT CGTCAACAAG TGCTTGAAGC TATTGCTGCC CGCTACGCTC CCAAACGTCA GGCACGAGCC CCTCTCGACG TTGCAGTCAA ACCCTTTGGG CCATCACGAA CGGCATCGTC CAAGCCATTA TTCGATGTCG ATGGCGAGGA AGAAGAAGGG GAGGAAGAGG TTGTGCCGAC AGCTAACGAT GAAGCTTTGG CCTTGGCTTT GCAACAAGAA GAGCTGGGAG ACGATGAGAC AGAGGTTGAT GAAGATCTAG CAAAAGCCTT GGCTCTCAGT CGGAGAGAGG CTGAAAGGAA ATCAAGGAGC GAAAATGAAG GATTCAGAGT TGTTGACGTC GGAAAAGACG AATTGACGGA GGAAGAGGAC GGAGATATGG AAGAAGTAGA ACTCGTGCCT AGTGGCACTG TGACACCTGC TCAAGTTGAC ATAGAGGCAG AAGACAGCGA AGATGAGGAT GAGTTTGAAG AAGTCGACAC GCCATCAACG ACGTTGAGCT CTTCTCGGGT GTCATTAGGT GCTTCTATCG CCCCAGGCAT GGAAACCCCG GAAATTGTTA CCATTCATCC GAATTCTACG CGGATGATCG ACCCTATAGT CATGGATGAT GATGACGATG ATGATGATGA TAAAGGAGAG CCCCTGGTCA TGTCAATGTT TAAAGAGAAG CGGGCTTCTG TCTATCTACC TCCCCTGGCG TCGGACCTGC CCGAAAGAGC TGCTCAGTCA TCTGGCGAAC AGGCACCGCC TGCCATCACT TCTCGATCCC AAATCTCACA ATCGACAGCC GTGCCGGTAC GGCCCAAAGC ATTACAAAAG CTGAATTCAG CGGTCAATGT ACCTTCTCCT TTGAGGAACG TAGCTCAGCC AAAATCATCG CCAAAATCTT CTCCAATCTC TGTCGATGAT GTTTCCGTCC CTGCTGGTGC CCCAGACCTA CCCTTTACGG TGGAAGCCAC AGATCCAACT GAAATCAGAG GCGCACCGTC CCCGGACATG ATCCCGCCTC GTTCACCACC ATCACCTGAG CTGTATGGGT CTGGATCGCT TGAAAGACCG CATTTTTTGG AACGCCCCTC GCCACGCTCG CCTGTGATGT ACAATGAGAT GGACGACGAG GACGGCAGCG AAGCGGATGA GAAGGATGAG ACCCGATCCA CGTATTCCTG GTCACCTTCC CCGACCCCGC CACCCCGTCC TCTCCGGACG ACTGAATCGG ATGTTTCGCT CAACACTGTT GCATCTCCTT CTCCTGCCTT GATCGCAGCG CCCCGAGATG ACGATGAGGA TGACGGTGAC CTGGCGCCAG CTGATGTGGC GGCAGAATCC GATGACTATG CACGTTTCGT GGCGTCGATT AAGAATCGGG ATCTCAACGA GGTTCGCGGG GAGATTGATG ACGAGATACG GGTGTTGAAC TCTGAGAATA GGGTGGCGAT GAGAGACTCG GATGAGATTA CCCAAAGTAT GATTGCGCAA ATCCAGGCAA GTCGTTTGTG TTTCTGTGTG GGGTCAAAAG GATATCTGTG CTAAAAACTA GGCTCAAAAA ACAGACACTT TTGAGGCATT TCGGTATTCC TTACATAACA GCACCCATGG AAGCGGAAGC GCAATGTGCC AAACTGGCAC AGCTAGGTCT CGTAGACGGT ATCATCACCG ATGATTCCGA TGTTTTCCTC TTTGGCGGTG TCCAGTGCTT CAAAAACATT TTCAACGACG CCAAATACGC AGAGTGTTTC CTGCTGGCAG ATGTGGAAAG AGAATTGATG TTGACGCGCG AAAGGTTGAT TTCACTCGCG TATTTCCTGG GGAGCGATTA TACGTTGGGA TTGCCCGGGA TTGGGCCTGT TATGGGTCTC GAGATCCTGG CGAATTTTCC GGGGGAGAGG GGTCTGTATG ATTTTAAAGA ATGGTGGGGA CGAGTGCAAA AAGGGAATGA TACAGAGGAA GAGAGTGGGA CAAAATGGAG GAAATCGTTT AAAAAGAGAT TCTTGAAGAG TATATACCTC ACTGCAGACT GGCCTGACCC GCTTGTGGTG AGTTTTTTGA CCAAAAAAGG GGGGGGGAAC AAACGAGTAT TGATGATTAC CATTGCGAAT AGAGGGAGGC ATACCTATAT CCAACCGTCG ATGAATCGGA AGAGCCCTTC CATTGGGGAT TCCCCAGACT TTCAGCCCTG CGAACGTGAG TAACCTGTCC CCGTTTATTG TGCTTTGAAT GCTGAAAAAA ATGCACCCCT ACCCGTTTCA GTTTTCTCCA CGAAGAGTTA TCTTGGTCTA TATCCAAGGT GGACGACGAA TTGACGCCTA TCGTGCAACG TATATCCCTA CGGGGCAAAC ATGGGGCTTT GAATAAACAA GGGACGCTTG ATCCATTTTT CGATATGTCC GCTGGAGCGG GACATTATGC GCCTAGAAAA AGAGGTATGA ACGTTAGTAA ACGGTTGATG GGGGTTATCA AGCAGTTTAA GGAGGCCGAA ATTAGGATGA GCAAAGGCGA GGATTTGGAT GTGGACGCGA TACTTGCAGA CGAGGAGAAG GAGAAGAAGC GTAAAGCAAA GGGGAAACGA AAAATGGACG AAAAGGAGGC TGGAGATGAT GAAAAGGAGG AGGGGAGAAA CAGCAGTAGC AACAAGAGGA AGAAGACGAG TGCGCGAGGG AGGAGGAGGG CTGGAACGGC TTCGAGTGTT GGAGATTCGG TGGCGTCAAG CGAAGGAGGG AGTCGGAGTA CGAGCACGAG TGGACGGGGT AGGGGTGGGT CGAGGGCCCG TGGTCGGGCG AGGGGCAAGG GCCAAGAGTA AGCCATTTGG TGCAACGTTA CTGTAGCAAT TAGCATGTAG TAGTATT
|
Protein sequence | MTISDIYARR VITTGADTEM ISVSAYSSSS SLSSSLVQST VAYFDESPAC LFYHHVHRWK RIINMGVKGL WSLLNPVARP VQIESMEGKR LAIDSSIWLY QFQATMRDKD GRVLVNAHVL GFLRRINKLL FHGIKPVFVF DGGAPALKRS TIAERKRKKT GAAANHAKVA EKLFAAQMRR EAVKAAQVAQ EQKEAKASAE AAAEYARRYP DEAGEQIAEG AVYLEDLEGR AGPSRPRSPG PAQTEGVDPS AVPTDPEKRR KYFKKHDPYR LPAAEMPTVS TSDRPDARLA TEEELKQFID EVHPEDIDIE SAEFRALPTE VQYEIIGDLR IRSRQQSHRR LADMLRAAPT PLDFSKAQIK HLSQRNALTQ QLLTVTDMVG KAHLTIPVRI AAERNREYVL VKKDETEGGG WALGIREGSK EKPIEVEPAE PKTESEHESD IEPLSPPPQA AMDQDLREYR RQQVLEAIAA RYAPKRQARA PLDVAVKPFG PSRTASSKPL FDVDGEEEEG EEEVVPTAND EALALALQQE ELGDDETEVD EDLAKALALS RREAERKSRS ENEGFRVVDV GKDELTEEED GDMEEVELVP SGTVTPAQVD IEAEDSEDED EFEEVDTPST TLSSSRVSLG ASIAPGMETP EIVTIHPNST RMIDPIVMDD DDDDDDDKGE PLVMSMFKEK RASVYLPPLA SDLPERAAQS SGEQAPPAIT SRSQISQSTA VPVRPKALQK LNSAVNVPSP LRNVAQPKSS PKSSPISVDD VSVPAGAPDL PFTVEATDPT EIRGAPSPDM IPPRSPPSPE LYGSGSLERP HFLERPSPRS PVMYNEMDDE DGSEADEKDE TRSTYSWSPS PTPPPRPLRT TESDVSLNTV ASPSPALIAA PRDDDEDDGD LAPADVAAES DDYARFVASI KNRDLNEVRG EIDDEIRVLN SENRVAMRDS DEITQSMIAQ IQTLLRHFGI PYITAPMEAE AQCAKLAQLG LVDGIITDDS DVFLFGGVQC FKNIFNDAKY AECFLLADVE RELMLTRERL ISLAYFLGSD YTLGLPGIGP VMGLEILANF PGERGLYDFK EWWGRVQKGN DTEEESGTKW RKSFKKRFLK SIYLTADWPD PLVREAYLYP TVDESEEPFH WGFPRLSALR TFLHEELSWS ISKVDDELTP IVQRISLRGK HGALNKQGTL DPFFDMSAGA GHYAPRKRGM NVSKRLMGVI KQFKEAEIRM SKGEDLDVDA ILADEEKEKK RKAKGKRKMD EKEAGDDEKE EGRNSSSNKR KKTSARGRRR AGTASSVGDS VASSEGGSRS TSTSGRGRGG SRARGRARGK GQE
|
| |