Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND02340 |
Symbol | |
ID | 3256911 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | - |
Start bp | 624868 |
End bp | 627960 |
Gene Length | 3093 bp |
Protein Length | 814 aa |
Translation table | |
GC content | 51% |
IMG OID | 638256168 |
Product | allantoicase, putative |
Protein accession | XP_570443 |
Protein GI | 58266574 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG3194] Ureidoglycolate hydrolase [COG4266] Allantoicase |
TIGRFAM ID | [TIGR02961] allantoicase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.649401 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCTATTGGC CTTAATCTAT TGGCCTGTAC TCCTCCCCCA CATCTCATAT CAATAAGATC TACATTATCA ATATAAACAC ACAGCCATGC TCACCCCACA GCAGATCCCA CTTGAAGAGT TCGACTCCAA TATCAAATCC AATTATGTCG GTACGCCGTC TTTCCTGATC CATCCCTGTT CGCTGTTCGT TTACATCCAC AGTAGAGGTC TCTTCATCTG CCCTCGGTGG TGAAGTTGTT GCATGCTCAG ACGATTTCTT TGCTTCTTGT CATAACTTGA TCAAGCCCAC TGTACGTTTC GTTTCACGCT GGCATCTTAA ATCGTGATTG CAGAATTGCC GTCCATTGCC GACAATGACA TGGGTCGCCT TGCAATTTTG CCAATTGAAC TGCAAAACGC CTAGAGCGAG CTGACTTGAG ATGGCTAGCC GTCTGTATCC ATGAAAGGTC AATTCGGGCC GAATGGTGCG CTTTACGACG GTTGGGAATC AAGGAGACAT AATCCTGCTT ACGATTGGTA CGTCTATAGC TCACCCATGG AGAGCTTCCC TGCTAACTAT ACGTAGGGTC ATTATCCGCC TCGCCACTCC TCTCACCTCT TTGCATTATG TTGACATTGA CACTTCTCAT TTCAACGGTA ATGAAGCCCC CCAGTCCCAA GTGTTCGCCC TTTCGTTGCC CTCCGACGGA TCCACTCCCA TGTGGACCCA GGGTAACCCT GGCTGGGTCG AAGTTCTGCC AGTGGTGGAT TTAGGACCTA ACAGCAGACA CATCTTTGAA GTTGGGAAGG CAGGGAAGGA GGGAAAGTGG GGTGCTGTGA TGGTTAACAT GATGCCCGAT GGTGGAATGG TAGGTCCCCG AGACAATCAG CGCAATCGAT CTAATAAGAG AGACAGGCCA GATTCAGGGC ATACGGTCTC CCTCAAGGCC CTTCTCTCCC TAAATCTCTT CCGGCCAACT ATCAGTCACT CCAACCCACC AACCTGTTAT CTCCTCTCAT TGGTGGCCGC ATCATTTCAT GCTCTGACGC TCAGTTCTCT CCTCCAGGCA ATCTTCTCCT CCCTGGTCGT GGAGTAGACA TGTCTGATGG ATGGGAGACT AGGCGTAGTC AGGAGAACAG GGGCAAATAC GCCCCCGGCG GACCTCTGGC CGGCCAAGAA AGGAAGGAGT GGGCGGTGGC GAAGTTGGGT GTCCCAGGTA TCGTGCGATG GGTTGAAGTG GACACCGCTT TCCACCCTGG TAATTACCCC AAGGTGAGTA TCGCAATCGC CTTTTCATTT GAGCTGACGT CTCGCAGTAT TGTGCTGTCG AAGCAACGCT TTCCTCGGAC GAAGACCTCT CATCTGCATC GTGGACCACT ATTGTCCGTA AAACCCCCTG TGGAGCCCAT CGTCAACACT ATCTTCCCCT TGAGCCCAAC GTCCCTCCCA CTCAGGTATT TTCCCACATT CGATACTCAG TGTTCCCTGA TGGCGGCACT AAGCGCGTCA GAATCTTCGG TCACCCCCTC GACCCTACCT CTCCTGATGC CCAGGGGGCC TTAGAAAAAG CCAGGCTTGA GCCTATGATC ATTCCCGCTC TGCCTCTTAC CCCAGAGGCT TTCAAGCCTT ATGGTCAAGT TGTCCAAGGT TTTTCATTGC CCACTTCCGC ACCCAAGGGT ATCCATGTCA ACATCGCGAA CCAAGGCACA GCATTCAAAT TTCATCGCCT TGCTAAACCT GCCGAATCCT ATGCTCCAGG CCTTCTTCAA AAAGGCGGAA TTCATGTTGG CGCGGTCAAG GCTAGCAGCA AGATGGATAT CAAGAACGGG AAGAGGATCA AGGTTGAACT TTTGGAGAGA CACAGACATA CATCTCAAGC CTTTGTGCCC ATGGGAGCCG AACCTGGGAA GGAAGGAAAA CAGGGCGCAT TCGTGGTGGT TGCAGCTCTG AATGGACCTG ACGATAAGCC CGATCTTAGT ACAGTCCGGG CATTCTTGGC CACGGCGGCT CAGGGCGTCA ACTATGATGA GGGTATCTGG CGTGAGTATT TCAAAACTTG TTTGTTGGCA TCAGCTGATG TAACTGTTTC AGATCACTCC TTGTTGACCG TTGGTGGAGT AAGTGCGACA CAAGGTGATG TGATGACAGC TGCTGATGAA TACCGTAGGA TCTCAATTAT GCCATCGTTG AAGCACAAAT GTCTGTTCCC AATGAGATCC GAGACTGCGA AAAGGTCGTT CCGCCATCCG AGCTTCATGT TGAAGTTCCT CCATACCCCT TCACTCATCC CTCTACCGCC TCCGTCTCTG CCGCACACGT TGTTGGTCAC CAGACCAACA GTGTTCTTCC TTCCCTAGCT TCCCTTTTGC CTGGAAAGGC TCTTAAGCCC GTCCCCATCA CTCCTGAAAA CTTTGCGCCA TTTGGTCATT TGATCACAAC AACCCCCTCA TCTTCCCATA CCGACACCGA GCGTGCGCCT GATGGGTTGA CAGTTAAGCA CAACCGTCTG GCTCCTGTAA TCTCTACTTA TCCCGAGGAG ACTGGTGCAG TCACTGGTAT TGCTGTATTT AGGGCTACGA AGAAAGTTGG ACTGGAAAGG GGTAAAGTTT TCGATGTGCG ATACATGGAA AGGCACCCTT ATACCAGTCA GACGTTTGTT CCCATGGGCA AGGGCGAGGT GAATTTATCC CATTAAATCC AAAAGGCATC AGGCTGACCA CGACATTGTA GTGGCCCGGT CATGGTGAGG CTGCGCTTCC CCCTGGAGGC GAGTTCCTTG TCATTGTTGC CCAGAACGGC CCGGACGACC GTCCCGACCC GTCCACCCTT CAATCATTCC TTCTTCCGGC TCATCAAGGT CTTTCTTACT CACCTGGAAC ATGGCATCAT CCAGTTTTGG TGCTGGACTC AACGTTGGAT CTCATGTGTG TTGAGACTCA AATTGCCACT GGCGTACATG ATAGCGATGG AAGGGATTGC GAACTTTTGA GCTGGGAGGG TGAAGAAGTG TTCGGCAGGG TCGCTGTTCC CGAGTAGATA AACGTGTTGA GAAAGCGTGG GCAAAGGAAG TATACAAGTA TGTATATATA TATATATCCA TTTAGGGTTC TAG
|
Protein sequence | MLTPQQIPLE EFDSNIKSNY VEVSSSALGG EVVACSDDFF ASCHNLIKPT PSVSMKGQFG PNGALYDGWE SRRHNPAYDW VIIRLATPLT SLHYVDIDTS HFNGNEAPQS QVFALSLPSD GSTPMWTQGN PGWVEVLPVV DLGPNSRHIF EVGKAGKEGK WGAVMVNMMP DGGMARFRAY GLPQGPSLPK SLPANYQSLQ PTNLLSPLIG GRIISCSDAQ FSPPGNLLLP GRGVDMSDGW ETRRSQENRG KYAPGGPLAG QERKEWAVAK LGVPGIVRWV EVDTAFHPGN YPKYCAVEAT LSSDEDLSSA SWTTIVRKTP CGAHRQHYLP LEPNVPPTQV FSHIRYSVFP DGGTKRVRIF GHPLDPTSPD AQGALEKARL EPMIIPALPL TPEAFKPYGQ VVQGFSLPTS APKGIHVNIA NQGTAFKFHR LAKPAESYAP GLLQKGGIHV GAVKASSKMD IKNGKRIKVE LLERHRHTSQ AFVPMGAEPG KEGKQGAFVV VAALNGPDDK PDLSTVRAFL ATAAQGVNYD EGIWHHSLLT VGGDLNYAIV EAQMSVPNEI RDCEKVVPPS ELHVEVPPYP FTHPSTASVS AAHVVGHQTN SVLPSLASLL PGKALKPVPI TPENFAPFGH LITTTPSSSH TDTERAPDGL TVKHNRLAPV ISTYPEETGA VTGIAVFRAT KKVGLERGKV FDVRYMERHP YTSQTFVPMG KGEWPGHGEA ALPPGGEFLV IVAQNGPDDR PDPSTLQSFL LPAHQGLSYS PGTWHHPVLV LDSTLDLMCV ETQIATGVHD SDGRDCELLS WEGEEVFGRV AVPE
|
| |