Gene CNC02470 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC02470 
Symbol 
ID3256120 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp705805 
End bp707581 
Gene Length1777 bp 
Protein Length411 aa 
Translation table 
GC content50% 
IMG OID638255467 
Productphosphatase, putative 
Protein accessionXP_569538 
Protein GI58264764 
COG category[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCAACTCCA ACTCCTCGCC CAAGAACAAA ACTCTCCAAT CTCTTTCTTC ACGTTCGTTA 
CTTGGTCTCT TACCCGTACA TTAAATCCTC CTCACCGCAC TTGACTGACA CGACCCACAT
AGTCCCCCAA GTAACTATCC AAAATGTCCG TCTTCACCAA ATCTGCCTTC ACCATGTCCG
CTCTCCCCTC TCCCAACACC TCCCAGCCTC CCTCCGCCGC CCCCTCTCGC CGTGGCTCTT
TTGCCAACGG CCTTGCCTCT GGTCAGCTCA CACCCGTGAC CGACCCCCAC ATTGTTTCCA
TCAACGTCGA GTCGGTATTG TTTGACATGG ACGGCACTTT GATCAACTCT AGTCCCGCCG
TCGTCAAAGC CTGGGAACTG TTTGCCGAAA AGTACCCTCT TGATTTGGAT GACATTCTCA
GATGTGAGTG ATCAAGACTG TTAGTTTACT CAAGTACGAT TCTTATTTTC CTTGTAGCTG
CTCACGGCAT GAGAACCATT GATGTGCTCA AGAAGTGGTG CAAGATCACT GATCCCGAGT
TACTTGCCTC TGAGGTCATT CGTTTCGAAA CCGCCATTCT CAACGCCGCT GAGGACATTG
CCAAGAATTC AGGCAAGGCT GGTATTGAGG TTCTTCCAGG TGTTGCCAAA CTCCTTGCCG
ATTTGGGTGA AGAGGCCGAC AAGCGCGATG GTGAAGAAAA ATGGGCAATT TGCACTAGCT
GTACGTGTAA CTGCGCCTGC CTTACGTTCT TTAAATTGAC TAGGTCATAG CAACATACTT
TTACGCCGGT AAGGCAATCC CTATTGCTGG GTTGCCCACC CCCAAAGTGT TTGTGACTGC
CGATTCCGTC ACTCGAGGGA AACCATTCCC TGACCCTTAT TTACTGGGTG CTTCGGGTTG
CAATGCCTCT CCTTTCGAGT GTTAGTCCCT CCTAAATGTG CTTTCTTGAA ATATCACTAA
ATCGCTCCGT AGCTCTTGTC GTTGAAGATG CCCCCACTGG TATTCGATCA GGCAAGGCGT
CTGGTGCCCT TGTCCTCGCC ACTTGTACCT CCCACGAACG TGAGGAGCTC GAGCGTGAAC
GACCCGACTT CCTTGTCGAT GATCTTTCTC ACGTTAAGGC GACTTGGGAT GCCGCCACCA
ACACTTTCAA TTTGATCATT GAGCAACCTA TTGACCGATA CACACCCCGT CCAACTCCCG
ATGTCACTCC CGTTATCACT CCGGCTATGT CCAGATCAAA TTCTTTCTCG GGCGTTGGCC
AGGATCGTCC TAGTGTTCGC ACCTCCCAGG CAATCATGAA GGGAAGTGAT GACCTTACTG
GCAACGACTC TGTTGTTGGC TCTCCAGCTG CCAGCAGACC CGGATCTCCT GGCGCCGACG
ATAGTATTGA GAAGCGCGCG GAGATGGAGT TCCACAGACG TGCGAGCCAG TCCGGTCAGG
CTGGCGTGAC ACTCGATGCT TTCCGACGTG CGCTGGCGGG TAACGCTGCT AAGAGGAGGG
CTCAAAGTCA AGGCGAAATG TCTCAGGACG AGTAATATAT GATAATGTAG CGATAGGCTT
TTTTTTTTTT TCTTTCTCAT TCTTTGACCT ACTTATACTT AGTGCGATCA GATCAGATCG
GATTGGACCA GGATTGAAAT CGATGTGCTC GACTCGAGTA ATGGGTAGTT GTTCCATAGG
TCCAACATCA TAGATATATA GACAACAATA CCGCATGCCA GCATGATTGT TACTGTGTCA
TGTCGCAAGA GGTTTTTCAA TGCCATGCAA AACACCA
 
Protein sequence
MSVFTKSAFT MSALPSPNTS QPPSAAPSRR GSFANGLASG QLTPVTDPHI VSINVESVLF 
DMDGTLINSS PAVVKAWELF AEKYPLDLDD ILRSAHGMRT IDVLKKWCKI TDPELLASEV
IRFETAILNA AEDIAKNSGK AGIEVLPGVA KLLADLGEEA DKRDGEEKWA ICTSSTYFYA
GKAIPIAGLP TPKVFVTADS VTRGKPFPDP YLLGASGCNA SPFESLVVED APTGIRSGKA
SGALVLATCT SHEREELERE RPDFLVDDLS HVKATWDAAT NTFNLIIEQP IDRYTPRPTP
DVTPVITPAM SRSNSFSGVG QDRPSVRTSQ AIMKGSDDLT GNDSVVGSPA ASRPGSPGAD
DSIEKRAEME FHRRASQSGQ AGVTLDAFRR ALAGNAAKRR AQSQGEMSQD E