Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC05910 |
Symbol | |
ID | 3256407 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1748374 |
End bp | 1751507 |
Gene Length | 3134 bp |
Protein Length | 949 aa |
Translation table | |
GC content | 52% |
IMG OID | 638255812 |
Product | hypothetical protein |
Protein accession | XP_569816 |
Protein GI | 58265320 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGGTATACCA CCACAAAAAT GAGCGGAACC CACTCTAGAA AGTCTCTCAA GACTAGGTAC GTTGGTGCCG AATTTTTCTT GCCTAAAACA TCCACTGACC TTTACTGTAG CTTCCGACCG TCTCCGCGGT CCATCCGCCC TATCTATACT GGCGGACCCG TTCTTCTTAC CAAGGATGGC CAGTGGATAA TCACTACCAT GGGAGAAGAG GCCTTGGTGA CGGAAGTGCA GACTGGGTTG GCTATCGCTA GAATACGAGG GGCAAGTTTC ATCGCATTTC TTTTCACTCT ATCTGATCCC ATGTACTGAC TTTATTCATC AGGACGGCAC ACCCATTACG TCCCTCTCGC TCTCCTACCA TACTTCACCA CCTACTCTCA TCACATCGCA CATGTCCATG ACTGTCCGGT ACTACCCTCT TCCTGAATCA GCACCCCTCT CATCTACTCC CAAACCCCCT TCATTAACCT ACACCCGTAT CCTCAACAAA GCCCACTCGG CTCCTATCCT CGTCTCAAAA GTTTCCCCCG ACAACACGCT TTTAGCTACT GGATCCTCTG ATGGAATTGT CAAGGTCTGG GACTTGGCCG GTGGATACGT GACACATCTG TTCAGAGGAC ATGGCGGTCC CGTTTCGGCA TTACACTTCA ACTTCCCCAT CATCTCTGGA GAAGAACGAC GTCGTATGGA GCTTCTCACT GGGTCGACCG ACTCCAGGGT ACGGGTTTAT GATCTTCGAG ATGCTAATGC CCGTGTTGTC GGTGGTGGGA ATGCGGTCAA GCCTAAAGCA GTGCTCGAGG GTCACGTCTC TGTCGTTAGA GGGATTGATG TTACCCCCGA CGGCAAATGG GCTGTCACCG GTGGTCGAGA CAAGGTCGTG CTTGTATGGG ATATGCTTTC AGGAGAAACA ACGGCTTTGG CCAAAAAGGG TAAAGGAAAG GCGACAGCGG GCCCCAAGTT GGTGCAAACT ATCATCGCAC AAGAGCAAGT CGAATCTTTG GGTTTATTGC CCCAAGAAGA GCAAGTTTCT GGTGCAGCCG AAGGGAGATG GCTTTGCTAC ACCGGTGGAG ATAAAGGATT GGTGAGGGTC TGGGACGTCC TCAAGGGAAC CCAGGTTGCT ACAATGAAAG GCGTCGAAGG GGTCGATGAA ACAGAGTTGG ACGAGGACGA ACAGCGCGGC GTTCTTTCCG TAATGTACTC CCCAACCTCC TCGTCCCTCG TATCTATCCA CGCCGATCAA AACATCATCT TCCATTCCCT TTCCACTCTC CTTTCAACAC GTCAAATTAT AGGTTTCAAC GACGAAATCG TCGACGTCGC CTTCCTCTCC CATCCCTCCG CACCAACCAC CTCCCCTTCC TCACTTCCGG AAACACCAGA CATTCCCCAC TCCCACATGG CAGTCGCTAC AAACTCCAAC CTCTTGCGAG TCTACTCCAC ATCCTCGTTC AATGCCCGAC TCCTTCCCGG ACACAGCGAT ATGATCCTCT GTCTTGACAT CTCCCCCGAC CACCAATGGC TCGTCACCGG ATCCAAAGAC CATACTGCTC GCGTATGGGC GCCTACCACC TGCGCAGAGG GCGATGGGTA TACATGGAAA TGCATCGCCA TCTGTGAAGG CCACGCGGAA TCGATCGGTG CCGTCGCTTT TGCGCGCAAG CCTTCCGACA ACGGTCATGC CCGGTTCCTC TTCACTGCAA GTCAGGACCG TACCATCAAA ATGTGGGATC TCACTCCCCT TTCCAACTCT CTTTCTCCGT CTCCTATCCG CCCTCGGTCG ATGGCCACCC TCCGTGCCCA CGAAAAGGAT ATCAACTCAC TCGATATCGC GCCCAACGAC AAGTTCCTCG TCTCGGGTTC CCAAGACAAG CTTGTCAAGC TTTACGCCAT TGATTTCAAC CCGCCCAAGG TACCTGGAGA AGGAAAAGGT GCGGAAGGAG GTTTCAAGGC TTTGGGTACT TGTGCAGGGC ATAGAAGAGG TGTATGGACC GTGAGGTTTA GCAGGAATGA TAAAGTTGTG GCGAGCGGTA GTGCCGATAG GACCGTCAAG CTTTGGAGTT TGGACGACTT TACTTGTCTC AAGGTTGGTT TTCATTTGGC GTTTTTGTGT GTTGGCCTGG ACTTTCCCTG TTACTTTGCT GACTCTTGTA TATCCAGACT TTTGAGGGTC ACACCAACTC TGTGCTTCGA GTGGACTTTT TGTCTCATGG TCAACAGCTC GTGACTTCTG CGTCTGATGG GTTAGTGAAG CTCTGGAATA TCAAGGAAGA AGAATGTGTA AAGACGTTGG ATAATCACGA AGACAAGGTG AGAGTTGCCT CCTACTCTCA CTTACAACGT ACCATTTCAC TAATCTCTTT TTTATTTTTC CCTTTTCCTT GCAGATCTGG GCGCTTGCAC ATTCATCCGA CGAATCCACC CTCCTCTCCG CCGGCGCCGA TTCCCTCCTC ACCATCTGGC ACGACACCTC TCTCCTCGAA CAATCCGAGG CAAACGCCAC TCTCATCAAG TCCGTCCAGG TCGAACAGGA TTTCATCAAC TACGTCGCTC TCAAAGACTA TCGCCGCGCC ATCTTACTCG CACTCTCCAT GAGTCAGCCC GGACGTCTCT TCAACCTCTT CAGTACCGTC GTCAAGGGCC GTCAGCCCGA TTTGACAGAG GAAGAACAAG GGATCACTGG GTCCAAAGAG ATTGATGAGA TTATGAAGAC CTTGCCTGGA ATAGAGTTGG TGAGGTTGCT AAAGTTTGTG AGAGACTGGA ATGCGAATGC CAAGATGGCA CCGGTGGCGC AGGTGATTTT GCACGCCATC TTCAAATTGA GGAGTGCGGA GGATATCCTT GCGGCGTTTG AGCAGGCCAA CAGGTTGCCC AAGCGTTCAG AGGAGGAAGA GGAGGATGAA GATGAGGACG AGGAGAAGGA GGAGGGAGAG GAAAAGAAGA AGAAGAAGAA GAAGGAACGA CCGTCTCTCG GCGCGCCTAT AAGCATCAAG GATCTTCTCG AAGGGCTTAT CCCATATTCT GAGAGGCATT TCAACAGGGT TGACAAGCTT GTGCAGGAAA GTTACATGTT GGACTATGTG CTTGGCGAGA TGGAAGGTGG ATTGTTTGGT GAGGAGTTAA TGGACCTCCA ATAA
|
Protein sequence | MSGTHSRKSL KTSFRPSPRS IRPIYTGGPV LLTKDGQWII TTMGEEALVT EVQTGLAIAR IRGDGTPITS LSLSYHTSPP TLITSHMSMT VRYYPLPESA PLSSTPKPPS LTYTRILNKA HSAPILVSKV SPDNTLLATG SSDGIVKVWD LAGGYVTHLF RGHGGPVSAL HFNFPIISGE ERRRMELLTG STDSRVRVYD LRDANARVVG GGNAVKPKAV LEGHVSVVRG IDVTPDGKWA VTGGRDKVVL VWDMLSGETT ALAKKGKGKA TAGPKLVQTI IAQEQVESLG LLPQEEQVSG AAEGRWLCYT GGDKGLVRVW DVLKGTQVAT MKGVEGVDET ELDEDEQRGV LSVMYSPTSS SLVSIHADQN IIFHSLSTLL STRQIIGFND EIVDVAFLSH PSAPTTSPSS LPETPDIPHS HMAVATNSNL LRVYSTSSFN ARLLPGHSDM ILCLDISPDH QWLVTGSKDH TARVWAPTTC AEGDGYTWKC IAICEGHAES IGAVAFARKP SDNGHARFLF TASQDRTIKM WDLTPLSNSL SPSPIRPRSM ATLRAHEKDI NSLDIAPNDK FLVSGSQDKL VKLYAIDFNP PKVPGEGKGA EGGFKALGTC AGHRRGVWTV RFSRNDKVVA SGSADRTVKL WSLDDFTCLK TFEGHTNSVL RVDFLSHGQQ LVTSASDGLV KLWNIKEEEC VKTLDNHEDK IWALAHSSDE STLLSAGADS LLTIWHDTSL LEQSEANATL IKSVQVEQDF INYVALKDYR RAILLALSMS QPGRLFNLFS TVVKGRQPDL TEEEQGITGS KEIDEIMKTL PGIELVRLLK FVRDWNANAK MAPVAQVILH AIFKLRSAED ILAAFEQANR LPKRSEEEEE DEDEDEEKEE GEEKKKKKKK ERPSLGAPIS IKDLLEGLIP YSERHFNRVD KLVQESYMLD YVLGEMEGGL FGEELMDLQ
|
| |