Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE03150 |
Symbol | |
ID | 3257541 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 894511 |
End bp | 896655 |
Gene Length | 2145 bp |
Protein Length | 431 aa |
Translation table | |
GC content | 47% |
IMG OID | 638256898 |
Product | cellulase, putative |
Protein accession | XP_570869 |
Protein GI | 58267426 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTGGTTCAA AGGAAGGAGT AACCGTCAAG ACTGTTAACC TTAAGCTTTT GGATACGCGC GGCACGCAAC TGCGGTCCTA GTCCATTAGA CCCTCCTTTC TCGTGTAACC CCACTCACAG TGACGTTACC GGCCAAAGCT AATCAACCAA CACTTTGTCG GCCCACTTCT TCAGTTCTTC CCATTCCTTC TTGCCACCGT TCACTTCTTT AACCTCATCG TTTCCTTCCC TTCACTTCTT CACTCATCCT CTCCCCTTCA AATCCTCTCA GCTGTCGAAC CACGGCGCCA CAACTACTCT CTAGGTGTTC AATGCGTTCG CTCATCCCAT TTTTCATAGT GCCCTTGGCG GTCGCCACGG CCCTGCCTGC TTTCCAGACT GTGGACCTTG AAAAACGATC TGTTAATGTC GGTTGGCCTT ATGGAAGCGA CAAGATCAGG GGTGTGAACA TTGGCGGAGT GAGACTTTCA ACCTTTTCAT TTCATGGAAC AACTTTATTT ACTGTCCCCA TTTGCAGTGG CTCGTGACTG AACCTTTTAT CACTCCCTCG CTGTTCGAAG CTACAGGAAA CAACGACATC GTCGATGAAT GGACTTTCTG TCAGTATCAG GACTATGATA CTGCTCAAGC TGCCTTGAAA AATCATTGGG ATACCTGGTT CACCGAGGAT GATTTTGCCA AAATTTCTGG TAAATCGCTT TTATATGCAG TGGATCAAAT ATTAACTATT AATATTCAGA GGCCGGTCTG AACCACGTTC GCATTCCTAT CGGATTTTGG GCATACGATG TGCAAGGCGG TGAGCCGTAC ATTCAAGGAC AAGCCGAATA TCTGGATAGA GCTATCGGTT GGGCCCGGAA CCACAATCTC GCCGTCATCA TAGATCTTCA TGGTGCCCCT GGTAAGTTTA TCCTTGGGAT GCACACAGTT CATAGATGAA TCAATCTCCA AGGGTCGCAA AATGGTTATG ATAATTCTGG AAGGAGGGGT GCTGCTGACT GGTAAGTTTA GTAATCTGGC TTGAGAATGA TACTGTAGCT GAAAGTGGAT CTCTTCAAGG GCCACCGATG AGGCCAATGT GGAGAGAACG AAGAATGTGA TCGCTTTGCT CAGCACGAAA TACTCTGATC CCCAGTACTA TGGAGTCGTT ACCGCCCTCG CACTGTTGAA CGGTAAATGA ACATAATGTT TGTACAATGC TGATCAAGTT CGTAGAACCA GCCACTTATC TCAATAACCA GCTTCTCCAG ACTGCTCGTC AGTATTGGTA CGACGCCTAT GGTGCTGCCA GGTATCCATT TGGCAATAGC GACAAATCCG GCCTAGCTCT TGTCCTGCAC GATGGCTTCC AACCTCTCAG CACCTTTGAA AACTATATGA CAGAACCCGA ATATGAAGAT GTCCTGCTCG ATACTCACAA TTATCAGGTC TTTAACGATG AGTATGTCGC TTGGAACTGG GATGAGCATA TCTCAGTAAG TTGTATCCTG CGGTTAATGA ACTCGGCCCT CGGCTCACAC CACGCCAGAA CATCTGTAAT AAGGCTAGCA CTTACAGCGG CTCTCCTTTA TGGCTTGTTG TGGGCGAGTG GACTTTAGCC ACGACTGATT GCGCTAAATA TCTAAATGGC CGAGGTATCG GTTCGCGTTA TGACGGTTCT TACCAAGGTT CATCGTATGT TGGATCTTGT GATGACAAGT CAAATGATGT CAGCAGATTT TCCGAGTGAG TGTTTCGTAG TCAATACCTT TTTTTCCAAG GTGTATACCG GTGCACATAC AAGTTACTGA TTCAACCGTC CAGAGAGTAC AAAGCCTTCA TGCACAGGTT TTGGGAGGTC CAAACGCAAG TGTACGAGCA AAATGGTCAG GGTTGGATCC ATTGGACCTG GAAGACAGAG AGTGCGGCAG ACTGGTCTTA CGAAGCCGGG CTTGATGGCG GATGGATCCC CTGGAACGCA GGCTCTCATG ATATTTCCCT TTCTTCGTTA TGTGGCTAGA TTGCTAGACA CCATTAGGAT TCTTTTCTAC CATATTGAAG CCACTCGCGC CTCGCTCGAG GATATTCAGG ACTTTTTCTC TGTTTTAGTC AAGCCATAGC GCAATGAACG AATTTGATAA GTTGAATAAA ACATGTCTTC TATCG
|
Protein sequence | MRSLIPFFIV PLAVATALPA FQTVDLEKRS VNVGWPYGSD KIRGVNIGGW LVTEPFITPS LFEATGNNDI VDEWTFCQYQ DYDTAQAALK NHWDTWFTED DFAKISEAGL NHVRIPIGFW AYDVQGGEPY IQGQAEYLDR AIGWARNHNL AVIIDLHGAP GSQNGYDNSG RRGAADWATD EANVERTKNV IALLSTKYSD PQYYGVVTAL ALLNEPATYL NNQLLQTARQ YWYDAYGAAR YPFGNSDKSG LALVLHDGFQ PLSTFENYMT EPEYEDVLLD THNYQVFNDE YVAWNWDEHI SNICNKASTY SGSPLWLVVG EWTLATTDCA KYLNGRGIGS RYDGSYQGSS YVGSCDDKSN DVSRFSEEYK AFMHRFWEVQ TQVYEQNGQG WIHWTWKTES AADWSYEAGL DGGWIPWNAG SHDISLSSLC G
|
| |