Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC03670 |
Symbol | |
ID | 3256520 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1158687 |
End bp | 1161932 |
Gene Length | 3246 bp |
Protein Length | 875 aa |
Translation table | |
GC content | 53% |
IMG OID | 638255588 |
Product | hypothetical protein |
Protein accession | XP_569593 |
Protein GI | 58264874 |
COG category | [K] Transcription |
COG ID | [COG0571] dsRNA-specific ribonuclease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGCCC CGCTGTTCAC CTCCCCCTCG GGCGCCGAGC TCTCGCTGCA GAACGCTGGG GGCCATCTCT CGCGCTGGCA CGTGGCGGTC GGGCACACCC CCGCCTTCCG CTTCCACCGC AGCGACGCGC CCCCCGCCAC GCTCTGCACC GTCCACCTCC CCGGCCTCGG CCCCGTCACC TCGGACCCCT GCGCCAACAA GGCCCGTGCA AAGCAGTCCG CGTCCTTCCG CGCAGTGCAG CGCCTCCTCG CCAGCCACGA GCTCGACGAC CGCTTCCAGC CCACGCCCTC GCAGCCCCGC ACGAGGGCGA GAGACGACAT CATGGCCACA AAGTCCAGGT GCAACTATCG CTGGGAGAGC AGGAAGCTCT TGATCAAGGA CGAGCTGCCG CAGGGACACG GGAGCGGCGA CTATGCGTAC CGCACCACTC CCCAGTTCTG GAAGGAGTGC CCGCCCTTTG CGCCCGGAGA ACGGGTGCAT GCCGCAGTGC TCCTGCTCTC ACCGACGGCC GAAAAGACGT TTGGCCACCC CGAGTGCAGG CCCATGTGCT TGCTCACCAG CAGGCCCTTG CCGCTGTTCG AAACGCACGA TACGGTCGAG GTGAATCTCG GCGTAGGGGA GCAGCCCACC CACCGCGGCT GCGTGAGGAT GACCAGCGCC GGATCCCTGG AAGGCGGCCT CGACGACGTC ACGCTCGGCC AGACACTGAG GTTCACAGAG CGGCTGATGA GGGCTGAGCT GCAGAGACTC ATCAAGGCAG ATCTTCATCG AATCAAGTGG CTGCTGCTTC CCCTCACGTA CGGGTACTCT TCCGCTCGAC AACCAGGCCC GTTGGCGCGT GAGGATATCG CTTGGGAGCA AGTCGAAGCT GTTGCCAATG GGCCCCTGAC AGTTCCGTTC ACCCTTAAAG ACGAGGACGA CCTCGCCGCG CAATGTTTCG ATGCCATGGC CTCGGCGAAT TCTGAACTGA CCCGCCGAAG CTACATTACT GCTGTCCGTC AAGACCTCAA TCCCGACTCT CCCCATCCGG AGCGACCTGA GAAGACGATC CTGCAAGTGA TCCAGGGCGA GAACTGTCTG TTTCCCCAGC TTGTACACCG CAAGCAGCCC ATCCTCCAAG TCGAGACGGC GTGTCTCGCC AAAAATGGCT CATATATCAC CACCTCTGCC ACGCCTGCCG AGCAGCGCTC AACGCAGTTC CTCATTCCAG AGTATGAAAA TCGCCACTGT ATCCCCGCCA GCATCTTCCG CGCCCTCACC CCCTTTCCCT TTTTCATCCA CCAACTCGAA TGCATGCTCA TCGCGTACGA AATGTCGACC AAACTGTTTG GATCATTGCT CCATCCCGAA CTCGCTCTCC AAGCTCTCAC AGCCCCTTCC TCGCAAAACG TCACCCCATG GACCTATGAG CGCCTCGAGA TCCTTGGCGA TACCCTGCTC AAATTCTTCA TCACCATCCA CGTCTATCTG CACGGTGGAG GCGCAAACTC TCGTGAAGAC TCGTTGAAAG TGTGGCAGGA TAGACACAAG CTTGTCAGTA ACCGTACACT CACCGCCAAC GCAATCAAGC TCGGGCTGGT GGAATTCGTG AGAGATAAAA GGCTTAAAGT GAAAGAGTGG GTACCACGAG ATTGGGAGCT AGAGCTGTTG CCGGGGCAGA TTGCGCCCAA GAAGGCTGTG CAAACAAGTA ACGATGGTCC AGAAATGAGG AAACTGGGAG ATAAGGCAGG TTTTTTTTTC AATTTTTCTA TAGCCAAATC TAACATTTCC ATTTCTATAG CTCCTAGCAG ATATTGTTGA AGCCATCATT GGAGCCTCGT ATGGCAGGGA CAAGAACTTT GACAATGTCA TCTCCACATT GATACATCTC AATGTCCCCC TCGATCTTTT TAGTACTTGG GAAGAGATCA AACATGTGCT TCCACTGTCG GATGATGAAC ATGATACCGA AGAACATGGG GCTGCCGATG CCGAAACCGA AACGGACTCT TTCTCAGCAT GTGAAATCCT GGGGTACAAA TTCAAGCGTA AAAGGAGGTA CGAGGATGTC ATTGTGAGTT GATTCTGCCA TTCTTTTTTA CTGAACCAAT CCATGTGATC TGACATGTTA TCTCACAGAG TATGAGTAAT CAGCCTCATG CGAGGAGGAT CCGCGAAAAC TACAAGATGC TAGGTAATGC AGTCTTGGAT CTTCGTGAGT GTTATTCCCC CCCAAAGGTA TTCTAAACAA AAAAACTGAA GCTTGGGGCA GACGTCATTG AATTCCTGCT GGCAAAGTAC CCTGAAGAAG GACCTGGTTC TTTATCCAAC ATGAAAGTAT GTCCATCCAT TGAATCGTTT GTGATGGGCC GGTCACTGTT GCTAATGCAT AACAACCATG CAGACGTTTC GTACCACAGA GGTATCTTTA TATATATATA TTCCAATCAG ATCGCTGACG AACATGAAAT AGGGACTTCG ATGCGCGCTA GCGACTGAGC TAGGCCTACA AGACCTCCTC AAGGATGGTG ATGAAAGGGC GGAGAGGGAA CTGGGTCGCG CGACATACTT TATGCGCGAC GCAAAGGCTC AGGCGGATGC TGGCCTTTCA GACGAAGAAC AGGGAGGTAT ACACTACTGG GATGGTGTCG CTGTCAACCA TGTGAGTTTG AATTTTGATC CTAGTTTATG AAAAGATGGA GAGGTGGAAA AAACTAAAAC TTGAACGACT GTGCAGTTTA CGGGAAGCAT TATCGAAGTC ATCTACGGTG CTATTTTCCA GGACTCTGGT TTTTCCCTTG AACATACCCA GAGGATATTC GATAACCACA TCGTGCCTTT TATCGAGAAA TATTGCAAAG GTCCCAGTGC AAGCGATCTT CACCCAAAGG GCTTGTTAAC TAGATGGATG CAGGCCAAGG GCTGTGCATA TTGGAAACTA GATGCAGTCG GGCCGAAATT ATGCGAGGGT GTCGGTGAGT GTGTATTCCT TGTGGTTAAA AATACTACTT GGGTCCATTC TTCAAGCTGA AAAGATAAAT AGTTACCTGC CATGATCAAG ATGTGGCGCG TTGCAAGGCT TTCACTACGC ATGTGGCCGT GAGGAACGTA TGTGAAGAAG CTATAAAACG TTTGCGAGAT GAAAACCGAA TCAAAGATAT TTGCGACTGC CCGTCGTTCA AAGAGATCGT CCCCGAGGAC AAGACGTTTG GGAAATGGAG ATGATAAAAA AGTTTCTAGA AGTGTATATG GAATCCCCTA TACTTG
|
Protein sequence | MSAPLFTSPS GAELSLQNAG GHLSRWHVAV GHTPAFRFHR SDAPPATLCT VHLPGLGPVT SDPCANKARA KQSASFRAVQ RLLASHELDD RFQPTPSQPR TRARDDIMAT KSRCNYRWES RKLLIKDELP QGHGSGDYAY RTTPQFWKEC PPFAPGERVH AAVLLLSPTA EKTFGHPECR PMCLLTSRPL PLFETHDTVE VNLGVGEQPT HRGCVRMTSA GSLEGGLDDV TLGQTLRFTE RLMRAELQRL IKADLHRIKW LLLPLTYGYS SARQPGPLAR EDIAWEQVEA VANGPLTVPF TLKDEDDLAA QCFDAMASAN SELTRRSYIT AVRQDLNPDS PHPERPEKTI LQVIQGENCL FPQLVHRKQP ILQVETACLA KNGSYITTSA TPAEQRSTQF LIPEYENRHC IPASIFRALT PFPFFIHQLE CMLIAYEMST KLFGSLLHPE LALQALTAPS SQNVTPWTYE RLEILGDTLL KFFITIHVYL HGGGANSRED SLKVWQDRHK LVSNRTLTAN AIKLGLVEFL LADIVEAIIG ASYGRDKNFD NVISTLIHLN VPLDLFSTWE EIKHVLPLSD DEHDTEEHGA ADAETETDSF SACEILGYKF KRKRRYEDVI SMSNQPHARR IRENYKMLGN AVLDLHVIEF LLAKYPEEGP GSLSNMKTFR TTEGLRCALA TELGLQDLLK DGDERAEREL GRATYFMRDA KAQADAGLSD EEQGGIHYWD GVAVNHFTGS IIEVIYGAIF QDSGFSLEHT QRIFDNHIVP FIEKYCKGPS ASDLHPKGLL TRWMQAKGCA YWKLDAVGPK LCEGVVTCHD QDVARCKAFT THVAVRNVCE EAIKRLRDEN RIKDICDCPS FKEIVPEDKT FGKWR
|
| |