Gene CND02340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND02340 
Symbol 
ID3256911 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp624868 
End bp627960 
Gene Length3093 bp 
Protein Length814 aa 
Translation table 
GC content51% 
IMG OID638256168 
Productallantoicase, putative 
Protein accessionXP_570443 
Protein GI58266574 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG3194] Ureidoglycolate hydrolase
[COG4266] Allantoicase 
TIGRFAM ID[TIGR02961] allantoicase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.649401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCTATTGGC CTTAATCTAT TGGCCTGTAC TCCTCCCCCA CATCTCATAT CAATAAGATC 
TACATTATCA ATATAAACAC ACAGCCATGC TCACCCCACA GCAGATCCCA CTTGAAGAGT
TCGACTCCAA TATCAAATCC AATTATGTCG GTACGCCGTC TTTCCTGATC CATCCCTGTT
CGCTGTTCGT TTACATCCAC AGTAGAGGTC TCTTCATCTG CCCTCGGTGG TGAAGTTGTT
GCATGCTCAG ACGATTTCTT TGCTTCTTGT CATAACTTGA TCAAGCCCAC TGTACGTTTC
GTTTCACGCT GGCATCTTAA ATCGTGATTG CAGAATTGCC GTCCATTGCC GACAATGACA
TGGGTCGCCT TGCAATTTTG CCAATTGAAC TGCAAAACGC CTAGAGCGAG CTGACTTGAG
ATGGCTAGCC GTCTGTATCC ATGAAAGGTC AATTCGGGCC GAATGGTGCG CTTTACGACG
GTTGGGAATC AAGGAGACAT AATCCTGCTT ACGATTGGTA CGTCTATAGC TCACCCATGG
AGAGCTTCCC TGCTAACTAT ACGTAGGGTC ATTATCCGCC TCGCCACTCC TCTCACCTCT
TTGCATTATG TTGACATTGA CACTTCTCAT TTCAACGGTA ATGAAGCCCC CCAGTCCCAA
GTGTTCGCCC TTTCGTTGCC CTCCGACGGA TCCACTCCCA TGTGGACCCA GGGTAACCCT
GGCTGGGTCG AAGTTCTGCC AGTGGTGGAT TTAGGACCTA ACAGCAGACA CATCTTTGAA
GTTGGGAAGG CAGGGAAGGA GGGAAAGTGG GGTGCTGTGA TGGTTAACAT GATGCCCGAT
GGTGGAATGG TAGGTCCCCG AGACAATCAG CGCAATCGAT CTAATAAGAG AGACAGGCCA
GATTCAGGGC ATACGGTCTC CCTCAAGGCC CTTCTCTCCC TAAATCTCTT CCGGCCAACT
ATCAGTCACT CCAACCCACC AACCTGTTAT CTCCTCTCAT TGGTGGCCGC ATCATTTCAT
GCTCTGACGC TCAGTTCTCT CCTCCAGGCA ATCTTCTCCT CCCTGGTCGT GGAGTAGACA
TGTCTGATGG ATGGGAGACT AGGCGTAGTC AGGAGAACAG GGGCAAATAC GCCCCCGGCG
GACCTCTGGC CGGCCAAGAA AGGAAGGAGT GGGCGGTGGC GAAGTTGGGT GTCCCAGGTA
TCGTGCGATG GGTTGAAGTG GACACCGCTT TCCACCCTGG TAATTACCCC AAGGTGAGTA
TCGCAATCGC CTTTTCATTT GAGCTGACGT CTCGCAGTAT TGTGCTGTCG AAGCAACGCT
TTCCTCGGAC GAAGACCTCT CATCTGCATC GTGGACCACT ATTGTCCGTA AAACCCCCTG
TGGAGCCCAT CGTCAACACT ATCTTCCCCT TGAGCCCAAC GTCCCTCCCA CTCAGGTATT
TTCCCACATT CGATACTCAG TGTTCCCTGA TGGCGGCACT AAGCGCGTCA GAATCTTCGG
TCACCCCCTC GACCCTACCT CTCCTGATGC CCAGGGGGCC TTAGAAAAAG CCAGGCTTGA
GCCTATGATC ATTCCCGCTC TGCCTCTTAC CCCAGAGGCT TTCAAGCCTT ATGGTCAAGT
TGTCCAAGGT TTTTCATTGC CCACTTCCGC ACCCAAGGGT ATCCATGTCA ACATCGCGAA
CCAAGGCACA GCATTCAAAT TTCATCGCCT TGCTAAACCT GCCGAATCCT ATGCTCCAGG
CCTTCTTCAA AAAGGCGGAA TTCATGTTGG CGCGGTCAAG GCTAGCAGCA AGATGGATAT
CAAGAACGGG AAGAGGATCA AGGTTGAACT TTTGGAGAGA CACAGACATA CATCTCAAGC
CTTTGTGCCC ATGGGAGCCG AACCTGGGAA GGAAGGAAAA CAGGGCGCAT TCGTGGTGGT
TGCAGCTCTG AATGGACCTG ACGATAAGCC CGATCTTAGT ACAGTCCGGG CATTCTTGGC
CACGGCGGCT CAGGGCGTCA ACTATGATGA GGGTATCTGG CGTGAGTATT TCAAAACTTG
TTTGTTGGCA TCAGCTGATG TAACTGTTTC AGATCACTCC TTGTTGACCG TTGGTGGAGT
AAGTGCGACA CAAGGTGATG TGATGACAGC TGCTGATGAA TACCGTAGGA TCTCAATTAT
GCCATCGTTG AAGCACAAAT GTCTGTTCCC AATGAGATCC GAGACTGCGA AAAGGTCGTT
CCGCCATCCG AGCTTCATGT TGAAGTTCCT CCATACCCCT TCACTCATCC CTCTACCGCC
TCCGTCTCTG CCGCACACGT TGTTGGTCAC CAGACCAACA GTGTTCTTCC TTCCCTAGCT
TCCCTTTTGC CTGGAAAGGC TCTTAAGCCC GTCCCCATCA CTCCTGAAAA CTTTGCGCCA
TTTGGTCATT TGATCACAAC AACCCCCTCA TCTTCCCATA CCGACACCGA GCGTGCGCCT
GATGGGTTGA CAGTTAAGCA CAACCGTCTG GCTCCTGTAA TCTCTACTTA TCCCGAGGAG
ACTGGTGCAG TCACTGGTAT TGCTGTATTT AGGGCTACGA AGAAAGTTGG ACTGGAAAGG
GGTAAAGTTT TCGATGTGCG ATACATGGAA AGGCACCCTT ATACCAGTCA GACGTTTGTT
CCCATGGGCA AGGGCGAGGT GAATTTATCC CATTAAATCC AAAAGGCATC AGGCTGACCA
CGACATTGTA GTGGCCCGGT CATGGTGAGG CTGCGCTTCC CCCTGGAGGC GAGTTCCTTG
TCATTGTTGC CCAGAACGGC CCGGACGACC GTCCCGACCC GTCCACCCTT CAATCATTCC
TTCTTCCGGC TCATCAAGGT CTTTCTTACT CACCTGGAAC ATGGCATCAT CCAGTTTTGG
TGCTGGACTC AACGTTGGAT CTCATGTGTG TTGAGACTCA AATTGCCACT GGCGTACATG
ATAGCGATGG AAGGGATTGC GAACTTTTGA GCTGGGAGGG TGAAGAAGTG TTCGGCAGGG
TCGCTGTTCC CGAGTAGATA AACGTGTTGA GAAAGCGTGG GCAAAGGAAG TATACAAGTA
TGTATATATA TATATATCCA TTTAGGGTTC TAG
 
Protein sequence
MLTPQQIPLE EFDSNIKSNY VEVSSSALGG EVVACSDDFF ASCHNLIKPT PSVSMKGQFG 
PNGALYDGWE SRRHNPAYDW VIIRLATPLT SLHYVDIDTS HFNGNEAPQS QVFALSLPSD
GSTPMWTQGN PGWVEVLPVV DLGPNSRHIF EVGKAGKEGK WGAVMVNMMP DGGMARFRAY
GLPQGPSLPK SLPANYQSLQ PTNLLSPLIG GRIISCSDAQ FSPPGNLLLP GRGVDMSDGW
ETRRSQENRG KYAPGGPLAG QERKEWAVAK LGVPGIVRWV EVDTAFHPGN YPKYCAVEAT
LSSDEDLSSA SWTTIVRKTP CGAHRQHYLP LEPNVPPTQV FSHIRYSVFP DGGTKRVRIF
GHPLDPTSPD AQGALEKARL EPMIIPALPL TPEAFKPYGQ VVQGFSLPTS APKGIHVNIA
NQGTAFKFHR LAKPAESYAP GLLQKGGIHV GAVKASSKMD IKNGKRIKVE LLERHRHTSQ
AFVPMGAEPG KEGKQGAFVV VAALNGPDDK PDLSTVRAFL ATAAQGVNYD EGIWHHSLLT
VGGDLNYAIV EAQMSVPNEI RDCEKVVPPS ELHVEVPPYP FTHPSTASVS AAHVVGHQTN
SVLPSLASLL PGKALKPVPI TPENFAPFGH LITTTPSSSH TDTERAPDGL TVKHNRLAPV
ISTYPEETGA VTGIAVFRAT KKVGLERGKV FDVRYMERHP YTSQTFVPMG KGEWPGHGEA
ALPPGGEFLV IVAQNGPDDR PDPSTLQSFL LPAHQGLSYS PGTWHHPVLV LDSTLDLMCV
ETQIATGVHD SDGRDCELLS WEGEEVFGRV AVPE