Gene CNC04170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04170 
Symbol 
ID3256544 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1273965 
End bp1275213 
Gene Length1249 bp 
Protein Length251 aa 
Translation table 
GC content50% 
IMG OID638255638 
Productconserved hypothetical protein 
Protein accessionXP_569663 
Protein GI58265014 
COG category[R] General function prediction only 
COG ID[COG0637] Predicted phosphatase/phosphohexomutase 
TIGRFAM ID[TIGR01509] haloacid dehalogenase superfamily, subfamily IA, variant 3 with third motif having DD or ED 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTGCCC TACTCGCACA GCGTGTCCTG CCTCAAATCG AGTATGCCAT CTTCGACATG 
GATGGTCTTC TTAGTACGTG ATCCCATGCT CACAGTCATG TCCTCTCGCT CATCAATTTG
TGTGAACAGT GTAAGCTGCT TGATTCGCTG TCAAAACGCC GCTCGTCATA GCATACTGAC
CACTGATACG TTTGTTTGAT TCTAGTGACT CCGAGGTTCG TTCGCTCAAC AACATTCCTG
TTGGGTTCCA TTTACTGACT TGCTTACGTC CCAGAGGATC TACACAGAGG TGACCAGTAC
GACGCAATCC GCTTTTGACG TAGAAGATTA ACTAATAAAG TTCTCAAAGA TGCCATTCTC
GGTCGCTATG GTCATACAAT GACCTGGGAC ATCAAAGCCG GTGTTATGGG CAAACCCCAA
CGTATCGCTG CAGAGTACAT TCTCTCGCAT TTCCCCGACA TTCTGGAGAA ACTGACCGTC
GAAGAATTCA TCGCCGAGGG TGTGCAGAGG AGAGAAGAAC TCTTCAAGCG GGTTGAGCCT
ATGAGAGGAG CTGCAGAATT GGTCAAAGGT TTGGTGAGTA CTGAAGCAGC ACTAGGCATA
GACACGGGGG GCTAAATATG AATAAAAGCA TGCCGCTGGA ATCCCTATCG CCCTGGCTAC
GGGCTCTACC ATGCCAAATT TCATTCATAA AACAGTGAGC TAGACACTTT ATCGACTGCG
CAATCCCACG CTTATACCGG ACGGTTGCAG ACACATCTTC CCCACATCTT CTCTCTTTTC
CCGCCGACGT CAATTCTCAC TGCAGACTCT CCCGAAGTCA AGCGTGGTAA ACCCAACCCT
GATATATTCC TTGCGGCCGC CCATTCTCTC GGAAGGGACG TGGGCACTGC TGACGAATGT
ACCGAAGAGC AAAAGGCGGA AAGAAGTCGA GGATTGGTGT TTGAGGATGC CCGGCCAGGT
GTCTTGGCTG GCATCGCGGC AGGGATGAAT GGTGAGTCCA TGCGACTGGA GCATCTTTGT
GGTTATCTAA TCATAAATAG TCATCTGGGT TCCTGATGCC GAATTACTTG CACTTAACCC
GGGAGAGACA TACGGCGCGA CGGAAGTCCT TACTCATCTG GAGGAATGGG ATCCCACTAG
GTGGGGCCTC CCTCCTTTAC CCGGTTTCAA TGTAAGTCAT CACGCTTTTT GAAACATGTG
CGCATAGATA ACCTCTTGAT TATCACAGCA CATTCCTGCC CAACCCTAG
 
Protein sequence
MTALLAQRVL PQIEYAIFDM DGLLNAILGR YGHTMTWDIK AGVMGKPQRI AAEYILSHFP 
DILEKLTVEE FIAEGVQRRE ELFKRVEPMR GAAELVKGLH AAGIPIALAT GSTMPNFIHK
TTHLPHIFSL FPPTSILTAD SPEVKRGKPN PDIFLAAAHS LGRDVGTADE CTEEQKAERS
RGLVFEDARP GVLAGIAAGM NVIWVPDAEL LALNPGETYG ATEVLTHLEE WDPTRWGLPP
LPGFNHIPAQ P