Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02470 |
Symbol | |
ID | 3256120 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 705805 |
End bp | 707581 |
Gene Length | 1777 bp |
Protein Length | 411 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255467 |
Product | phosphatase, putative |
Protein accession | XP_569538 |
Protein GI | 58264764 |
COG category | [R] General function prediction only |
COG ID | [COG0637] Predicted phosphatase/phosphohexomutase |
TIGRFAM ID | [TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCAACTCCA ACTCCTCGCC CAAGAACAAA ACTCTCCAAT CTCTTTCTTC ACGTTCGTTA CTTGGTCTCT TACCCGTACA TTAAATCCTC CTCACCGCAC TTGACTGACA CGACCCACAT AGTCCCCCAA GTAACTATCC AAAATGTCCG TCTTCACCAA ATCTGCCTTC ACCATGTCCG CTCTCCCCTC TCCCAACACC TCCCAGCCTC CCTCCGCCGC CCCCTCTCGC CGTGGCTCTT TTGCCAACGG CCTTGCCTCT GGTCAGCTCA CACCCGTGAC CGACCCCCAC ATTGTTTCCA TCAACGTCGA GTCGGTATTG TTTGACATGG ACGGCACTTT GATCAACTCT AGTCCCGCCG TCGTCAAAGC CTGGGAACTG TTTGCCGAAA AGTACCCTCT TGATTTGGAT GACATTCTCA GATGTGAGTG ATCAAGACTG TTAGTTTACT CAAGTACGAT TCTTATTTTC CTTGTAGCTG CTCACGGCAT GAGAACCATT GATGTGCTCA AGAAGTGGTG CAAGATCACT GATCCCGAGT TACTTGCCTC TGAGGTCATT CGTTTCGAAA CCGCCATTCT CAACGCCGCT GAGGACATTG CCAAGAATTC AGGCAAGGCT GGTATTGAGG TTCTTCCAGG TGTTGCCAAA CTCCTTGCCG ATTTGGGTGA AGAGGCCGAC AAGCGCGATG GTGAAGAAAA ATGGGCAATT TGCACTAGCT GTACGTGTAA CTGCGCCTGC CTTACGTTCT TTAAATTGAC TAGGTCATAG CAACATACTT TTACGCCGGT AAGGCAATCC CTATTGCTGG GTTGCCCACC CCCAAAGTGT TTGTGACTGC CGATTCCGTC ACTCGAGGGA AACCATTCCC TGACCCTTAT TTACTGGGTG CTTCGGGTTG CAATGCCTCT CCTTTCGAGT GTTAGTCCCT CCTAAATGTG CTTTCTTGAA ATATCACTAA ATCGCTCCGT AGCTCTTGTC GTTGAAGATG CCCCCACTGG TATTCGATCA GGCAAGGCGT CTGGTGCCCT TGTCCTCGCC ACTTGTACCT CCCACGAACG TGAGGAGCTC GAGCGTGAAC GACCCGACTT CCTTGTCGAT GATCTTTCTC ACGTTAAGGC GACTTGGGAT GCCGCCACCA ACACTTTCAA TTTGATCATT GAGCAACCTA TTGACCGATA CACACCCCGT CCAACTCCCG ATGTCACTCC CGTTATCACT CCGGCTATGT CCAGATCAAA TTCTTTCTCG GGCGTTGGCC AGGATCGTCC TAGTGTTCGC ACCTCCCAGG CAATCATGAA GGGAAGTGAT GACCTTACTG GCAACGACTC TGTTGTTGGC TCTCCAGCTG CCAGCAGACC CGGATCTCCT GGCGCCGACG ATAGTATTGA GAAGCGCGCG GAGATGGAGT TCCACAGACG TGCGAGCCAG TCCGGTCAGG CTGGCGTGAC ACTCGATGCT TTCCGACGTG CGCTGGCGGG TAACGCTGCT AAGAGGAGGG CTCAAAGTCA AGGCGAAATG TCTCAGGACG AGTAATATAT GATAATGTAG CGATAGGCTT TTTTTTTTTT TCTTTCTCAT TCTTTGACCT ACTTATACTT AGTGCGATCA GATCAGATCG GATTGGACCA GGATTGAAAT CGATGTGCTC GACTCGAGTA ATGGGTAGTT GTTCCATAGG TCCAACATCA TAGATATATA GACAACAATA CCGCATGCCA GCATGATTGT TACTGTGTCA TGTCGCAAGA GGTTTTTCAA TGCCATGCAA AACACCA
|
Protein sequence | MSVFTKSAFT MSALPSPNTS QPPSAAPSRR GSFANGLASG QLTPVTDPHI VSINVESVLF DMDGTLINSS PAVVKAWELF AEKYPLDLDD ILRSAHGMRT IDVLKKWCKI TDPELLASEV IRFETAILNA AEDIAKNSGK AGIEVLPGVA KLLADLGEEA DKRDGEEKWA ICTSSTYFYA GKAIPIAGLP TPKVFVTADS VTRGKPFPDP YLLGASGCNA SPFESLVVED APTGIRSGKA SGALVLATCT SHEREELERE RPDFLVDDLS HVKATWDAAT NTFNLIIEQP IDRYTPRPTP DVTPVITPAM SRSNSFSGVG QDRPSVRTSQ AIMKGSDDLT GNDSVVGSPA ASRPGSPGAD DSIEKRAEME FHRRASQSGQ AGVTLDAFRR ALAGNAAKRR AQSQGEMSQD E
|
| |