Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI01000 |
Symbol | |
ID | 3259446 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 245948 |
End bp | 248965 |
Gene Length | 3018 bp |
Protein Length | 748 aa |
Translation table | |
GC content | 45% |
IMG OID | 638258585 |
Product | DNA helicase, putative |
Protein accession | XP_572723 |
Protein GI | 58271134 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1112] Superfamily I DNA and RNA helicases and helicase subunits |
TIGRFAM ID | [TIGR00376] DNA helicase, putative |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.359255 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGACGAGCCG AAAAGTACAA AGTAGCTTCG CCCAATCAGA CAGAATGACC TTGGACATCC TCCAGCCCAA GCCGACTCCG GACGCCTCCG TCCAGTCATT TCTGAATAGG CATAAACAGC TTTTGGAACT GGAGAGAAAG GCAGAGGAGG AGCAGACAAG ACTGCTCAAC TCGAAATGTT CGCCTTCTCT GCTTGAGCAG AGAGGTCTGG CTTTGAATGG TTTAGGGGTT TCTGGTATTA GCATTGGACT AGGAGGGAAA AGGTAACTTC AAGAAACAGA CTTTATAATG AGGAATAAGT GTGATACTGA CGATTTATTA GTTTAATCGA GCTCCATAGG CCTTTGGCAT ACCACACTTC GCCAGCGTTG CCTCCACACA CGTTTCGTCC AGGAGACCCA GTGCGTATAG AAGCGCACGT GGCCACTACA TCTGGCAAGA GCAAAGGAAA GAAAAGAGAA TCAGAAGATG AAGGTGCTGT GGAGGGTGTA GTCTATCGGG TCGGGCCTGA AAAGGTGGTC GTCGCCGTGA ATGAGTCGAA AGAAATAGAT CTGCCAGAAA GATTGAGGCT GTAAGTGCCT TCTTTTTTGT GTGACGGGTA GTATGCTCAT ACGTATTCAG ACTCAAACTT GCCAATTCAG TCACCTTCGA TCGCATGGAA AAAACTTTGG CACACCTTGA ACGTTTGGTG CTCCCTTCAG GAGGAACTCC TTCACCTCCA TCTTTCAACA TGCCGCTTGT ACAAGCCCTT CTCGGTAAAC AACTGCCAAC ATGGAAGGAA ACAATACCTC CTCAAAATAA TATAGAACCT TTGGGGCAAC TGTCATCTGA TGAGGACATG AAGTGGTTTG GCCAACATTT AAATGACAGT CAGAAAGAAA GCATCAAGTT CTGCCTAAAA GCAAATGAGG TCGCTTGTAT TCATGGTCCA CCAGGAGTAA GTTTGGGTTG GTTTCTGTAG CTTCCGTTAT GTATCTAAAA TCACTCCCAT CAACAGACTG GCAAAACACA TACACTTGTC GAACTCATTT TCCAGCTTCT CTCTCGACCT GCTGCCCCAA ACACTACGCT TCCACCCCGT ATCCTCATCA CGACCCCGTC AAATTTGGCT CTCGATAATT TACTTATCCG TCTCCATATT CTGGCACAGC AACCACCTTA CAGCTCTCTC CTCCCACGCA ACTCATTTCT TCGTATGGGG CATCCTACTA GAGTCCACCG AGATCTCGTA AAAGAGACTT TAGATTGGAA AGCAGCAAAC GGTGACCAGG GTGAACTATT GAAAGATGTG GGAAAAGAAA TGCAAGGTCA CCTTGATGCT TTGGGCAAGA AGAGAGGTGA AAAAGGTGCC GTAAAAGGGA AAGAAAGAGG GAAGAAATGG GGAGAAGTCC GAGAGTTGCG TAAAGAGTGC GTTTTGGCTG TCCTATGGGC ATTTTGTAGC TAACACGTCT GTAGATATCG TCAGCGAGAG GGCAAAGTTG TCAAGACGGT GGTGAACGGG GCACAGGTTG GTTTTTTCTT GACTGTCTCA TTGTATATGT ACTTAATGCA GCGTTGTTTA GATTGTCCTT GCAACCTGCC ATAGTGCCGG ATCTCGACAG CTTAACAATA TGATCTTCGA TGTATGTATC ATCGATGAAG CAACACAAGC TGTTGAGGCT GTTTGTTGGG TTCCCATCTT GAAATCCAAG AAGTTAATTC TTGCTGGTGA CCCTCAACAG GTAAGTACTT ATCTCATAAT CTACAAGAAA TAGGGCCTAA TATAACTCGA AATCAAGCTT CCCCCGACTA TCATGAGCAA AGAAAATGCC CCGCCCTTGA AAGACCTTCA AGAAGCAATC GATCAGATTA AACTAGGTGA TAGCCCATCC CTGAAATCTC CACGAACTCT GGAAACAACC CTTTTCGAAC GTCTAGAAAA ACTTTACGGT CCCGGGATCA AGCGTGTTCT TCAGGTCCAG TATAGGATGA ACGAACACAT TGCAGTGTTC CCATCAGAAA CCCTATATGA GTCGGCGCTC ATATCCGATG CTTCAGTTGC TAAGCGTACG TTGCTTGAGC TTCCATCAGT GAAAGACAAG ACAAGTGAAG ACGTCAAGGA TGATTTGGAG CCTACGGTGG TCTTCTTTGA TACCGCCGAT TGCGAATTTT ACGAGAGAAC CGAAGGAGAT GGAGAAGCTA CCAAGAGCTC CATTGGGGAA GGCAGTAAGA GTAACGAGAA TGAGGCAGAA ATTGTGGCCA GGTGGGCGAG GAAGCTGGTA TGTTGTTCTT CCTGATCGAC AAATGTCAGC TTATCATACA CAACAGATAT CATTGGGAAT ACCCCCCATT GAAATTGGCA TCGTTACCCC TTACCAAGCT CAAGTCACAC TTATTTCGTC TCTCCTACAC GAAGAGTACC CTGAGATGAC GATTGGAAGT GTTGACGGAT TGCAAGGCCA AGAACGAGAG GTGAGCTGTG TTTGATTTAT CTCTTGAGAC AAAGCTGATC GTTTACAGGC CATCATCCTT TCACTGGTTC GATCAAATCC TTCAGGAGAA GTGGGATTTC TTGGTGAATA CAGGAGATTG AATGTTGCAA TGACAAGAGC AAAGAGACAA CTGGTCAGTA ATTGTGGCAT GACAGTTGCG GTGTCACACT ATCGACGAAG CTGACAGAAT TTTAGTGTGT TGTGGGAGAT TCCAAAACCG TCTCGAAAGG AACCAAATAC CTGAAAAAGT GGATGGATTG GCTTGAGGCA GAAGCGGATG TTAGATGGGC TGGAGAAGAG ACGGTATAGA AACCATTAAG TGCGTGGATC ACTTGGTGTA ATTTGCCATA TGAGCGTAAC ATGGACGGTA AGATTCTGAA TGTCCATAAG CTGTGTCACA TCCACGTTAT ACAGAATGAT TATTATAGAT TACGATGACT TGTATTACAG TACAATTGGT AACAAACATA ATATGTGACA GGTAAATTCT TCTGTCTTAC AACATTAAAT ATTGGCTAAC TACTAAACTC CTCAGATTAG ATTATAGA
|
Protein sequence | MTLDILQPKP TPDASVQSFL NRHKQLLELE RKAEEEQTRL LNSKCSPSLL EQRGLALNGL GVSGISIGLG GKSLIELHRP LAYHTSPALP PHTFRPGDPV RIEAHVATTS GKSKGKKRES EDEGAVEGVV YRVGPEKVVV AVNESKEIDL PERLRLLKLA NSVTFDRMEK TLAHLERLVL PSGGTPSPPS FNMPLVQALL GKQLPTWKET IPPQNNIEPL GQLSSDEDMK WFGQHLNDSQ KESIKFCLKA NEVACIHGPP GTGKTHTLVE LIFQLLSRPA APNTTLPPRI LITTPSNLAL DNLLIRLHIL AQQPPYSSLL PRNSFLRMGH PTRVHRDLVK ETLDWKAANG DQGELLKDVG KEMQGHLDAL GKKRGEKGAV KGKERGKKWG EVRELRKEYR QREGKVVKTV VNGAQIVLAT CHSAGSRQLN NMIFDVCIID EATQAVEAVC WVPILKSKKL ILAGDPQQLP PTIMSKENAP PLKDLQEAID QIKLGDSPSL KSPRTLETTL FERLEKLYGP GIKRVLQVQY RMNEHIAVFP SETLYESALI SDASVAKRTL LELPSVKDKT SEDVKDDLEP TVVFFDTADC EFYERTEGDG EATKSSIGEG SKSNENEAEI VARWARKLIS LGIPPIEIGI VTPYQAQVTL ISSLLHEEYP EMTIGSVDGL QGQEREAIIL SLVRSNPSGE VGFLGEYRRL NVAMTRAKRQ LCVVGDSKTV SKGTKYLKKW MDWLEAEADV RWAGEETV
|
| |