Gene CNB01020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB01020 
Symbol 
ID3255859 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp307153 
End bp309293 
Gene Length2141 bp 
Protein Length472 aa 
Translation table 
GC content49% 
IMG OID638254753 
Productmandelate racemase/muconate lactonizing enzyme, putative 
Protein accessionXP_569108 
Protein GI58263396 
COG category[M] Cell wall/membrane/envelope biogenesis
[R] General function prediction only 
COG ID[COG4948] L-alanine-DL-glutamate epimerase and related enzymes of enolase superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value0.629016 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCGTTTCTT CCATCCCCAC TATTGCTCCA CAAATAACCA TGTCAGGCCT CAAGATCACA 
GAATTCTCCG TCCACGGTAT GTCATGCACT GCCCTCTCAT GACCACTGGC TAATCATTCG
CATCAGATAT CCGATTTCCC ACCGTAAGCC TGACCTTTTT CTTTTTTTCA ACCACCAAGC
AGCAGCTCAC CTCGTATCAA GAATGTCACT GGTGACGGTA CGGATGCCAT GTAAGCCATT
TCAGTCCAGT CCATAAAGCC CATACTGACA GCACCGCAGG AACAAGGAGT GCGATTATTC
CGCTGCCTAC ATCGTCGTCA AGACGAACTC TGACCTCAAG GGGCAAGGAA TGACTTTTAG
TATGTAAAAA CATCCTATTG CCTGCATAGG AATATTTTCT GACTGTTCTG CAGCCATTGG
TCGTGGAAAC GAAATCGTCT GCTTTGCTAT CGAACAAATA GCTAACCGTA TCGTCGGTTT
GGACCTTGCC CCCATCTTTG CCGACATGGG CAAGTTCTGG GACTTTTGTA AGCCATCGTT
TTCCAACAGG AAAATAGAAA CCAGTCGGCT AACATCTTTT AAAGTGGTGG CCGACCCTCA
GCACCGTTGG CTCGGCCCTG AAAAGGGTGT CATCCATATT GCTACCGCCG CTATTTCCAA
TGCCATCTGG GATATGTACG CCAAGCACGC CGGCAAGCCC TTGTGGAAGC TCATTGTCGA
CTTTACTCCT GAAGAGTATG TCCTGCTGTA CTTTTCACCT TGGATAAAAT AAGCTAACCG
TCAGACAGAT TCGTAAAGGC CACCTCTTTC CGATACATCA CCGACGCCCT CTCCCCGGCC
GAAGCCCTTG AGATCCTCAA GTCCAAGGAG TCTGGAAAGG CTGCCAGGGA AGCCGACGTC
AAAAAGAGGG GATACCCTGC CTACACCACC TCTGTCGGAT GGCTCGGGTA CTCTGACGAA
AAGGTCAGGC GATTGACAAA GGAAAGCCTT GCCCAAGGCT TCAACCATTT CAAGGTCAGT
GTCGACTTGA GAGAGACGGA TACGATCGCC CATTTTAACA TCTCTTAAAC CTCTTTAGCT
CAAAGTCGGC GCCGACCCCG AAGACGATCT TCGAAGGGGA CGACTCATCA GGTCCATCAT
CGATGATCCC GCCAACATGC CTAAAGATAG AAAACCTATC GACCCTGCCT CCATCGCCAA
CAAGAACGCC GGCCCCACAG GCTGTGTACT GATGGTGGAC GCCAACCGTG AGTCACTATA
AACCGGAAAC CTGCCGCAGA GCTAACTAAT CCCTTTATCA GAGGTCTGGG ATGTCCCTCA
GGCCGTTGAG TACATGAAGA AGCTTGAACC CTTGAAGCCT TGGTTCATTG AAGAGCCTAC
TGCCCCCGAT GATGCAGTCG GTCACGCCGC CATTCGAAAG GCCCTTAAAC CCATCAATAT
CGGCGTCGCC ACAGGTGAAC ACGCTCATAA CCGAGTGAGT TAAAAAACTT TTGAACCCAC
CCTCGTAACA ATCCACTAAA ACCGACATAT GACAGATGGT CTTCAAGCAA CTGTTGCAGC
TTGACGCTAT TGACGTTTGT CAAATCGACT CTTGTCGACT GGGCGGTGTC AATGAGATTC
TCTCTGTTTT GCTCATGTCT GCCAAATTCG GGGTACCAGT CTGCCCTCAC GCCGGTGGTG
TAGGATTGTG CGAGTATGTG GTGAGTATAT TCAGCTTCTA TAAAGCGACA AAAAAAGAAA
CTGATCAAAG AGTAACTCTC CAGATCCACT TGTCTCTCAT TGACTACATT TGCGTCTCTG
GTGATATGGA GCGTAACGTC TTGGAATTTG TAGAGTAAGT GGCCCATTCT ACTCCTTGAT
ACAATTGCCC TTCGCTTACC CCCTTTCACA TAGCCATCTG CATGAGCACT TCCTCTACCC
CGTGTCCATC AACTCTGAAG GTCGATACAA TGTACCTACC GATGCCAAGG GCGGATACTC
TATCGAGATG TTTGAAAAGT CAATGGAGGA CTACGCCTTC CCTGGAGGTG CTTACTGGGC
CGCGGTGGCA AGGGGAGAGA ACCCTGCCGT TTCACATTAA TCATATCTGG TTAGAGTATT
ATCATTGTAC GATAATGTAG ATCTGTACGC ATGAACCGAG T
 
Protein sequence
MSGLKITEFS VHDIRFPTNV TGDGTDAMNK ECDYSAAYIV VKTNSDLKGQ GMTFTIGRGN 
EIVCFAIEQI ANRIVGLDLA PIFADMGKFW DFLVADPQHR WLGPEKGVIH IATAAISNAI
WDMYAKHAGK PLWKLIVDFT PEEFVKATSF RYITDALSPA EALEILKSKE SGKAAREADV
KKRGYPAYTT SVGWLGYSDE KVRRLTKESL AQGFNHFKLK VGADPEDDLR RGRLIRSIID
DPANMPKDRK PIDPASIANK NAGPTGCVLM VDANQVWDVP QAVEYMKKLE PLKPWFIEEP
TAPDDAVGHA AIRKALKPIN IGVATGEHAH NRMVFKQLLQ LDAIDVCQID SCRLGGVNEI
LSVLLMSAKF GVPVCPHAGG VGLCEYVIHL SLIDYICVSG DMERNVLEFV DHLHEHFLYP
VSINSEGRYN VPTDAKGGYS IEMFEKSMED YAFPGGAYWA AVARGENPAV SH