Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00310 |
Symbol | |
ID | 3255791 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 84020 |
End bp | 88848 |
Gene Length | 4829 bp |
Protein Length | 1376 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254682 |
Product | gamma DNA-directed DNA polymerase, putative |
Protein accession | XP_568772 |
Protein GI | 58262724 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0749] DNA polymerase I - 3'-5' exonuclease and polymerase domains |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 3 |
Plasmid unclonability p-value | 0.00002081 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGCGG CACTGGCCGA CTCTGGCGAA CCGAGTACGT TTGTATGGTG TCCTCCCACT CCGCTGAAAA CTCCCGATTG GCCTGATTAA TTGCATGTCG CAGAACCGGT TATGCTGTCC CGATCTCAGT CTGTTCACAG TTCTTATGAT TAGCTTTTAA TCATTTTCGG TTTATTTGTT TGTCACATCT GATTTTTATC GCACCATATT CATCGTTCCC CGATATCATT CATTTCGACT CATGTTCATA CTGGCTTGTG TTCCGCCGGT CTCACGTCTG TAATACTTTA CATCAACATG CGCAAGGCGC TCGATATTTC CCGGCTTACT CGGCCTGCTC GTATACGATG CCGACCGTCT CTTTTTCTTC GAAACGGGTC ACTCTCATCG TCTGCTTCCC AAAGCAAACC ATCGGACGCA CCTGTGAAGG TTTCGGATGT GAAGGAGAAG GGAGTTGAGA AACCTCTAGT CCCTGCCTTT GGAGCTCGAC GAGCGGAAAT GGAGGATTAT ATATTGGCTA TGGAAATGGC TAAATTGGAA GATGGTTATG ACCAACCTCG AGGTATGTCC AAGAGACCCG GAAGAAGCTC CACTAATAAC CATACATTGT CTTCACAGTC AGGAAGATTC GCAAATCCAA ATTGCCCTCC CTGCATGATC CGCAATCTTT TCTATGGGAT TCGAAGCAAG CCTCGTCATC CAAGGTTACT TCATCCGCAT CACCAACATT GCAACCAAGC AGAAAGGGTA AAGAGAAAGA AGTAGTTGCA AATGAATATG AAACGAGTGA GCCAAATCAG GAAGTCCAAA CTCTGGAGGA TGCCAAACCC ATCGACTCGG AAGGAAGGAA GAGCAGTACG TTTTTCGTAT AGGATGCCCT TTTCCGATTG CTGATACGAA AACAGGTCCA CGGAGGAATC CGGTTGGGGT CCAAATGCTC TCTCCATCTC TCCACTCTCA GCTCTTTCCT GGTCAGCCTT TGCCCAAGCC GCCTCAAGCT CTCTTGGATA TCTCAAAAAG TCATCTCAAA GATAATGATC TTTTTCCAGA AGGAGCCGCC GTCCTCCCCG AAATATCGTT TGATCTGCCA TCTTTACGTG GGAATAACAT ACGAGACCAT TTTCACGCTC TCGGGCAGTA CACGGCAGAA CCACATGCCA GCATGGCCAG GAAATTTGCA GCGACCAGAC TTCCTGCCAA ACCTGATAGG TGGGAGATGG GGCGGTCAGG GTGGACCAAG TACTATTCAG ATGGAAGAAT GGAGGCGGTA GACGATCTGG GAGATGAGAC TTTGGTTTCA TTCGACGTCG AAGTCCTTTA TAAACTCTCT CGCTTCCCGG TCATGGCGAC GGCAGTCACA CCAAATGCCT GGTACTCCTG GTTATCTCCT GTCATCTTTC AGTCTCCACC CGCCGAAATT CCCGAACCGC CCCCTCCATG GGAAGCTAGC ACTCCAACCT ATCACCCTCA CGAACTCATC CCCTTGTTCA ACAATAAGTC TTCAATACCC CGAATTGTCA TTGGTCATAA CGTGGGCTAT GATCGAGCGC GGGTAAAGGA AGAGTATTCC TTAGAAAGGA CGCAGACACG ATGGCTTGAT ACGCTCTCAC TTCATGTATC CACGCGAGGC ATTACATCTG TCCAACGTCC TGCTTGGATG GCGTATAAGA AAAATAAGAA AGCGAAAAAG CTACGTGAGC AAGAAAACCT CTCAATCCTA CAGGAGATGG CGGAGAAAAG CGGTGACGAT ACAATCATGG ACAGTCTACA GGAATTCGGG GCAGCTAGCG AAACCGAGGA AGCCGAGGCC CTGCAGAGTC GATGGGAAGA CGTTACTTCA ATGAACTCTT TGGCGGAAGT GGCAGCGTTA CATTGCGGAT ACCCAGTTGA CAAGTCAGTT CGGGACCGCT TTGGTGATGA CTCAATTAAG CATGCTTCAC AAATTCACAG TGAACTTCAT CAGCTCCTGT CCTATTGTGC CGATGACGTC CGTGTGACTC ACGATGTCTA CGCTAAAGTC TTCCCGCTTT TCCTTGAATC TTGTCCTCAT CCTGCCACGT TGTCAGGTGT ATTGTCCATG GGTAGTTCTT TCCTACCTGT TGATCAAAGC TGGAAGGAGT ATCTCAGAAA TGCAGAGGAG ACGTATAGGG AGATGGATGT GGCTGTTAAG AAAGCGCTGA GGTTGCTGGC CGAGAAATTA AGAGCTGAGG GTGAACCAAA GGAAGGAGAT CCTTGGGCAT CGCAACTTGA CTGGTCGCCA AAAAATGCTA GATGGTCGGA CGAGGACCTT GAAGCAACCG GAAAACACTC TGTGCAACCA CAAGAAAGCG CTCAACCTCA CAAACCAGGG TTCAACTCTG GTGCCTCCTC TCCTACATGG CTCACGCAGA TATCTTCTAA TCACTCCATC CTGAAATCCA ACATGTCACA ACGCTACCTT TTGCCCCTCC TGCTTCGCAT GTCTTTTAAG GGCCATCCAG TTGCATACCT TTCCGAACAT GGCTGGTGCT TCATGGTGCC ACATGACCAA GTTGGCGACT ATTTCGACAC ACACGGTTCT CCACATATGC TTAGCGCAAA AGACAATAGA TTGGAAAAGT TGGAAGAGAG TTATTCGTTC TTCAGGATTG GAAACGCGGG TTCCCCAAAA AAGACAAAGC TCGTAGGACC GTCAATAAAG CCCTTCGTGA AGAGCGGAGA CTTGACAAGC GCGTACCCTG AGCTATTGAT CAAGGTGATG AAGACAGATC TCAGTGATGC CGTGGAAGAC TTGTGGGAAT GCGTGGTGGA TATAGGGAAT TTGAAAGAAA GTGAATGGGG ACAGCAGCTT GACTGGACAC CCGCAACTGA AGGTGAGTGC TGTAATCCCC TATATCGTCT CTTCTGCTAA TTTTCTCTAC TTAAGGTAGT GCTTCGTCCA ATGATGTGCC ACTCTCTTCC TCATCATCTA GTCTGCGACC TTCATCAATC AAGAAATCCA AAACCAACCA TGGCACTTGG CCTAAATGGT ATTGGGATCT CACAGGACCA GTCTCCCGTC TTCCGGTTGG TGAACTCGAC CTTACTTGCA AAAAAACAAT CGCACCTCTT CTCTTGCGAC TTCAATGGCA AGGTTTTCCG CTAGTGCATA GTAAAGAACA TAAGTGGCTG TATCGTCTTC CAAGGAAAGT GTACGAAGAG GAGGATGAAC GTATTGCGAA GGCTCGAGGG CTTCCCGTTT CTTTCAGAGA AGAGGGACCG GATGCTGTCT TTGCCAAGGA TGACGATCAC GTTTACTTCC GACTACCGCA CAAAGATGGA GAGGGCAAGA ATGTCGGCAA CCCCTTGTCC AAGGGCTTTG TCAAAGCCAT CGAGTCCGGA GAACTTGCTT CGGCGGCGGC AGAGAGCGGG GACGACGTCG CTGCAAAAGC TGCAGCGGAC GCAACAAATA TGAACGCATT CTGTAGCTAC TGGATCAGTT CACGAGAGAG GATAATGGAT CAAATGGTTG TATACAGGGA TCAGGAATTT GGTATGATCC TACCTCAGGT CATTACCATG GGCACTGTCA CTCGCCGAGC CGTTGAAGCG ACTTGGTTGA CCGCTTCGAA TGCCAAAAAA AACCGTGTAG GTTCCGAACT CAAAGCTATG GTCCGAGCCC CTCCAGGCTA TTCTATTGTG GGTTCAGACG TCGATTCCGA AGAGCTCTGG ATCTCTAGTG TTATGGGAGA CTCGCAGTTT GGTATGCATG GGGCGACTGC AATCGGATGG ATGACCCTAG AAGGAAACAA ATCGGCAGGA ACAGATCTGC ACTCAAAAAC CGCCAGTATC TTGGGTATCT CTCGAGATGC GGCCAAGGTC TTCAATTATT CGCGAATCTA TGGTGCGGGG AAAAAGCACG CCGTACAACT GTTGTTACAA GGTGACTCGA AGCTTACGAA AGAGACGGCG GGGAAGTTGG CCGACAACTT GTATAAGTCG ACCAAAGGTG CCAAAGCTCT TCGAGCAAGG AACCTTCCTG TCGCATCTGT TCCATCGCTC TGGCACGGAG GTTCAGAAAG TTACCTCTTT AACACCCTCG AAGCTATTGC ACTTAGCGAT CGACCAACCA CCCCAGCGCT TGGCTGTGGT GTAACCAGAG CTCTTCGCAA GTCTTACCTA GAAGAAAGCG CCTCGTATCT TCCTTCGCGA GTGAACTGGG TTGTACAGTC ATCTGGTGTC GATTATCTCC ACCTCCTCAT TGTCTCTATG GAATATCTTA TCAAGAAGTA TGATATCAAA GCTCGTTATC TTATCTCTGT GCACGACGAA GTCCGCTATC TCGCGAAGGA AGAAGACCGC TATCGGACAG CTCTTGCTCT ACAAGTTGCG AATGCTTGGA CCAGAGCCTT ATTCTGCTTC AACCTAGGAA TCGACGATAT GCCGCAGGGC ATCACGTTCT TCTCGGCAGT CGACATTGAT CATGTGTTGA GAAAGGAGGT TTTCCTTACA TGTGAGACAC CCAGCCATCC AAAGGTCATT CCCGCTGGCG AATCACTTGA TATCAATTCG CTTCTTGAGA AGGTTCCGCG GGGGGACCTT GGTACCCCTA TTCCCGACGA TCTTCAGCCA CCAACTGACA TCAAACCCCC AGTTGCTCTA TTCCCTAACA TTCAATCCGC TCAACATCGT CAGTTCCTCC AGGCACAGGC CAGTAAAGGA GGTATGGGCG CGAAGAAGTG GCTGGATAAC TTGCCTCCGG TGCAATATAT CGATGAGGTG AATGAAGAAA ATGGGAAACC TTATCAGAAG AGCCACAAGA AAGCTGTGCT GTCGTCTAGC AAGAAACCTC GATGGTAGAA GGATGTTTA
|
Protein sequence | MSAALADSGE PIRKIRKSKL PSLHDPQSFL WDSKQASSSK VTSSASPTLQ PSRKGKEKEV VANEYETSEP NQEVQTLEDA KPIDSEGRKS SPRRNPVGVQ MLSPSLHSQL FPGQPLPKPP QALLDISKSH LKDNDLFPEG AAVLPEISFD LPSLRGNNIR DHFHALGQYT AEPHASMARK FAATRLPAKP DRWEMGRSGW TKYYSDGRME AVDDLGDETL VSFDVEVLYK LSRFPVMATA VTPNAWYSWL SPVIFQSPPA EIPEPPPPWE ASTPTYHPHE LIPLFNNKSS IPRIVIGHNV GYDRARVKEE YSLERTQTRW LDTLSLHVST RGITSVQRPA WMAYKKNKKA KKLREQENLS ILQEMAEKSG DDTIMDSLQE FGAASETEEA EALQSRWEDV TSMNSLAEVA ALHCGYPVDK SVRDRFGDDS IKHASQIHSE LHQLLSYCAD DVRVTHDVYA KVFPLFLESC PHPATLSGVL SMGSSFLPVD QSWKEYLRNA EETYREMDVA VKKALRLLAE KLRAEGEPKE GDPWASQLDW SPKNARWSDE DLEATGKHSV QPQESAQPHK PGFNSGASSP TWLTQISSNH SILKSNMSQR YLLPLLLRMS FKGHPVAYLS EHGWCFMVPH DQVGDYFDTH GSPHMLSAKD NRLEKLEESY SFFRIGNAGS PKKTKLVGPS IKPFVKSGDL TSAYPELLIK VMKTDLSDAV EDLWECVVDI GNLKESEWGQ QLDWTPATEG SASSNDVPLS SSSSSLRPSS IKKSKTNHGT WPKWYWDLTG PVSRLPVGEL DLTCKKTIAP LLLRLQWQGF PLVHSKEHKW LYRLPRKVYE EEDERIAKAR GLPVSFREEG PDAVFAKDDD HVYFRLPHKD GEGKNVGNPL SKGFVKAIES GELASAAAES GDDVAAKAAA DATNMNAFCS YWISSRERIM DQMVVYRDQE FGMILPQVIT MGTVTRRAVE ATWLTASNAK KNRVGSELKA MVRAPPGYSI VGSDVDSEEL WISSVMGDSQ FGMHGATAIG WMTLEGNKSA GTDLHSKTAS ILGISRDAAK VFNYSRIYGA GKKHAVQLLL QGDSKLTKET AGKLADNLYK STKGAKALRA RNLPVASVPS LWHGGSESYL FNTLEAIALS DRPTTPALGC GVTRALRKSY LEESASYLPS RVNWVVQSSG VDYLHLLIVS MEYLIKKYDI KARYLISVHD EVRYLAKEED RYRTALALQV ANAWTRALFC FNLGIDDMPQ GITFFSAVDI DHVLRKEVFL TCETPSHPKV IPAGESLDIN SLLEKVPRGD LGTPIPDDLQ PPTDIKPPVA LFPNIQSAQH RQFLQAQASK GGMGAKKWLD NLPPVQYIDE VNEENGKPYQ KSHKKAVLSS SKKPRW
|
| |