Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00400 |
Symbol | |
ID | 3259263 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 1075771 |
End bp | 1077566 |
Gene Length | 1796 bp |
Protein Length | 359 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258446 |
Product | B2-aldehyde-forming enzyme, putative |
Protein accession | XP_572232 |
Protein GI | 58270152 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4305] Endoglucanase C-terminal domain/subunit and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGCAATTTCA TAACAGAAAC CCTCTCCTTT ACATCATTAA CACCATATAC ATCAACATCA CATTCATTTA TACACGCTCT TTACTTGGTT TCCATACATA TCGTAATCCG ACTATCCAGC ACATCAACTA ACTCGATACA CCCACCGACC ATGCTCGCCC TCCTCGCCCT CTTCCAGCTC CTCCCCATTC TTGCGCTCGC TGCCCACAAC CCTCCCCACC GCCGCGCTCA CGAGCGAGCT CGAAAGAATT ACGAGCATAT TCATGGCCGC GATGCTGAAG AATCCCCGTC CATCGTCGAG AGGGATAACC ATTATGACAA TCGAACCATT GTTGAGCGCG ACCACTTTGA CAATAGGACC CTCACTCCCC GAGGGACAAC GTATACCGGC GTTGGAACCT TTTACTACAC AGGTTTGGGC GCTTGTGGTT TGAACTCTCA AGACAGCGAT TACATGGTTG CGTTGAACTC TGCGCAGTAT GGTAGTGGAT GTAAGTACGA GTCCTTGAAT CTCTTTATGC CGGAAAATCA AATTTCCAGA AAAAAGTAAA AAAAGGCATG TGCTGATGGA TCTCACGCAT GCAGACCCCG GTCCTCAGTG CTTCAAGTAC ATTACCATCC AAATGGGCTC TACCACTGTC AGCGGTGTTG AGATTCTTGG TGAGTTAGTT GATATTTGAG GCGTTTCCTT TGCTAACAAA CATACAGACG AATGCCCTAC ATGTGACTAT GGATCCCTTG ACTTGTCTCC CGGCCTTTTC ACCCGGTTCG CTGATTATGA CGCCGGTACT ATTCAAATCA CTTGGTGGTT TGACGACGAT GTGAGTTGGG CTGTCTTTAC AAAACTCCGC GCAAAATTCC TACTGACACA TCTTTTGTAG GCTCCTGCCG CGACTACCAC CTCTGAAACT CCCACTTCTA CTTATGTACC CCCCACCTCC ACCTGGGTTG CTCCTTCTTC CTCATCGACT TCCACTTACG TCTGGATTCC TCCTTCGTCA TCGTCCACCC CTGAGTGGGT CGAGCCTTCC ACCTCACCCA CTCCTTCTTC CACCACCCAG TGGTATTCTT CGCCCGCTGA AACCTCTACT TCGACCACAC CCACCTCCAC CTGGGTGGCT CCTTCTTCCA CTTCCACCTC CGTCTGGGTT GAGAGCTCCA CCTACAGCTC CTCTGCGCCT AGCAGCACCA TCACTTCATC CGCGTACTCT TCCGCTTCTG CGGCCGTCAA CTCTACCAAC CCTTTCGCCA TTGTGAGCAA CGCTTCCAAC AGCAGTGTTT CTGCTACCAT TTCCTCCGAT ACTGGTGTCA GCGGCAGCGC AAACAGCGGG TCATCAGAAG TTTCTGTCGA AGTTACCGGC AACCTTGAGA TGATCAACGC TCTTGCTGCC CAGTACGGTC AGCTCGTGGT CGAGGCCGCC CTCCAGAGTT AGATCTTATC TTCCCCATCC TATCACTCTC TCGAACTTGT TTACGAACAC TGGCTGTGCA GGACAATTGA AATTTTGAAA TATTGACGCG CATAACGAAC CTTATATCAT AGTATCTACC ATTATATCTC TTATTTTTCG TTTCCGTTGC GCTCTCAATT TTATAAACAC GGGCTGTGTG TAGTTCCTTT TACTTTGTCT CTTTTCTCAC AATCTTTTTC CATCTTCTTT CCCATGAACC ACTAACTGGA TGTTTGATGG CAGTGGAAAG GACGTATACC TTTTTTTTAC GTTTGTAACG ATACCCCATG TCCCTTTCGT TATCTACATG TTTTCGTCTG TGTACACGAT GCACGATGTC TACGAT
|
Protein sequence | MLALLALFQL LPILALAAHN PPHRRAHERA RKNYEHIHGR DAEESPSIVE RDNHYDNRTI VERDHFDNRT LTPRGTTYTG VGTFYYTGLG ACGLNSQDSD YMVALNSAQY GSGYPGPQCF KYITIQMGST TVSGVEILDE CPTCDYGSLD LSPGLFTRFA DYDAGTIQIT WWFDDDAPAA TTTSETPTST YVPPTSTWVA PSSSSTSTYV WIPPSSSSTP EWVEPSTSPT PSSTTQWYSS PAETSTSTTP TSTWVAPSST STSVWVESST YSSSAPSSTI TSSAYSSASA AVNSTNPFAI VSNASNSSVS ATISSDTGVS GSANSGSSEV SVEVTGNLEM INALAAQYGQ LVVEAALQS
|
| |