Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA03130 |
Symbol | |
ID | 3253390 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 821702 |
End bp | 824003 |
Gene Length | 2302 bp |
Protein Length | 573 aa |
Translation table | |
GC content | 49% |
IMG OID | 638252644 |
Product | anon-23da protein, putative |
Protein accession | XP_566754 |
Protein GI | 58258683 |
COG category | [R] General function prediction only |
COG ID | [COG0429] Predicted hydrolase of the alpha/beta-hydrolase fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.913558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCATCGCC GATCTCTCCG TCCTTATCTG CGGTATCCTT TCCCTGTAAC TCAACACCTC AGTTTCAGAT CAGTGCACCA TTTTTACAGC TACCGAGAAA TAAACGAAAA AAATGAAAAG CCTATAAAAT ATTCCCACAC ACCCACAATT CACTTTTCTC CTTATTGCAA TATTCATTCA CCTTTACCTG TCCACTCACA TCCCCACTCT TCTACAGATA CCAACGAACA CACGGTCCAT CGTTCAATGG ACCCAGCTTT TCACCCTCCT TTTTCCTTAC CAAACTTATA TCAGACCCTT CCTTTCGGGA CATGCTGCGC TATATTCATA CTGTGGGCGG TACACCACAT ACTCTGTCAA CGTTATGCAC CCATCACAAT TACGGGGCCT ACTCGCAGAG AAGAAGTAGG CGCTCATGGA GAGACGACTG AAGAGCTCGT CCATAAGTGG TGCAAAAGCC TAAAGGAGGG CTTCACAGCT AGCTGGTGGT TGCCAAAGTA CGTGTTGGGA CAGGGGAGCG AGCAAGTGTA GAAGGGACTA ACCAATGGGT ATGAAGTGGC CATGCTCAAA CGATTTATTC GGGATTGGCA GACTTTTCTA TGGATGATCA TGTAACTTAT CAGAGGTAAG TGGGGTGTGC ACGCGACCTG GTGGCAAGAA CGGCGACTTC CCGTGGGATC AAAACGAATC CATTACCGAA CGACACGAGG CTGACATTAC AATAATCAGA CAGCTTCTTC GACTTCCGGA TGGAGGAACA ATGTGGGTTG ATCAGACTTC AGGTGTGATA GTAGAGCCCT GTCCGTGAAC AATACTGACA AGAGGACCAG CGGTGTTGAT GTCTATCCGC CTCTCACTAC TGAGCTGGCA GACGATGCCC CAGTCATCGT CGTCAACCAT GGCCTTACCG GGGGCAGCCA TGAAAGTTAT GTGAGGAACT TGGTCGTTTG GCTCACCAAG CCTATCGCTG AAGGAGGTCT AGGTGGACGA GCAGCTGTAG TCAATGTAAG TAGATCATAT AGAGTTGAAC ACACGGAAAG ACACTAACCG ATCCATAGTT CCGTGGATGT GCCTCAACTC CACTGACTTC TCCTCACCTT TATTGCTCCG GCAACACCAT TGACAATCAC ACGGCTACAA CATATCTCGC CAGTCTATTC CCCGATGCCC CTTTACTCGG AGTCGGCTTC TCCCTTGGCG CAGCGGTTAT GACGAGATAC CTCGGTGAGC AGGGCGATAA AAGTCGTTTG CGAGCCGCTG TTGTCCTCTA CTGTCCTTTG GAGCTAAAGG CGATGAGCGC CAAGTAAGTA CCAGTGCTCC CTCATCAATC ACCTTTTTAT GCTGAATCAA CCATTACAAG ACTAGACTCT GCCCATCTGT TCCCTCGCCT CTATTCCCTT ACAATGGCTC GCAAAATTCT CAAGTCCATC TCTCCCCATT TACTTCCCCC TTCACCGCTA TCATCTCCAT CGTCGCCTTT ACATGTCAAC ATTCCAGAAA TCCTTTCCCT ATCCTCTTCA GTCAAGTATA AATGGACACT TCGTGCCAGT AAAGTGACCG AATTAGTAGT CACAAAAGTC GGCGGCAGTG CGCCTTGCTT CCCCTTTGAA GGCATGGACC AGTTCCTGGA GTGGGCTTGT CCCAGTGGAT GGATAGGTCG CATCAAGAGG TGAGTATAAG GTTTTCCCTC ATGATACTGG GTACGCATAT TGATTGGTGT TCCAGGCCGA CGCTGGCTAT TTCTGCTCTC GATGATCCCA TAGTATCAGG TGGTTAGTCC TTTATGACGC TTGTCTTTTA AATCCTTTTC TTTTTCTTTT TCTTAGCTAA CTTTTTACGA TTCAGACTGC CTTCCCTATT CTGCAGTCCG CGCTTCATCG CACATGATAC TTGCGGGAGT CGCACAAGGC GGACACTTGG GCAGTTTTGA TTCGCCTTCC CCCTTTGGAC CTGATAGACA CCGTCGATGG CATGTTCGAC CTACGATCGA ATTCTTGCGT GGGGTTATCA AGGATTTGCC CAAATCAGCT TCTGAACAAA AGCTGGGTAG GGTTCAAGTC GAAGAGAGGG AAGACGGGTG GTCTTGGGCC GGAGAAGTCG GATGGAAACT TGTAGGTGAG GAAGAAGAGT GCGGTTGGGT GGGAAATGGT ATTTGTGAGA GTGAGATGGA TGGGGCAGGT GCGAGCGTGT AGGCGGGGGA GAGAGCTGCC GCTTTTTGAT TGCCGATGAT ATGGAATGAA AGATAGATTA TTACGAGACG AAATGTTGAT GTAGATGGAT AGCGACGAGT GACATGTAAC TC
|
Protein sequence | MHRRSLRPYL RYPFPVTQHL SFRSVHHFYS YREINEKNEK PIKYSHTPTI HFSPYCNIHS PLPVHSHPHS STDTNEHTVH RSMDPAFHPP FSLPNLYQTL PFGTCCAIFI LWAVHHILCQ RYAPITITGP TRREEVGAHG ETTEELVHKW CKSLKEGFTA SWWLPNGHAQ TIYSGLADFS MDDHVTYQRQ LLRLPDGGTI GVDVYPPLTT ELADDAPVIV VNHGLTGGSH ESYVRNLVVW LTKPIAEGGL GGRAAVVNFR GCASTPLTSP HLYCSGNTID NHTATTYLAS LFPDAPLLGV GFSLGAAVMT RYLGEQGDKS RLRAAVVLYC PLELKAMSAK LDSAHLFPRL YSLTMARKIL KSISPHLLPP SPLSSPSSPL HVNIPEILSL SSSVKYKWTL RASKVTELVV TKVGGSAPCF PFEGMDQFLE WACPSGWIGR IKRPTLAISA LDDPIVSGDC LPYSAVRASS HMILAGVAQG GHLGSFDSPS PFGPDRHRRW HVRPTIEFLR GVIKDLPKSA SEQKLGRVQV EEREDGWSWA GEVGWKLVGE EEECGWVGNG ICESEMDGAG ASV
|
| |