Gene CNE03150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE03150 
Symbol 
ID3257541 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp894511 
End bp896655 
Gene Length2145 bp 
Protein Length431 aa 
Translation table 
GC content47% 
IMG OID638256898 
Productcellulase, putative 
Protein accessionXP_570869 
Protein GI58267426 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTTGGTTCAA AGGAAGGAGT AACCGTCAAG ACTGTTAACC TTAAGCTTTT GGATACGCGC 
GGCACGCAAC TGCGGTCCTA GTCCATTAGA CCCTCCTTTC TCGTGTAACC CCACTCACAG
TGACGTTACC GGCCAAAGCT AATCAACCAA CACTTTGTCG GCCCACTTCT TCAGTTCTTC
CCATTCCTTC TTGCCACCGT TCACTTCTTT AACCTCATCG TTTCCTTCCC TTCACTTCTT
CACTCATCCT CTCCCCTTCA AATCCTCTCA GCTGTCGAAC CACGGCGCCA CAACTACTCT
CTAGGTGTTC AATGCGTTCG CTCATCCCAT TTTTCATAGT GCCCTTGGCG GTCGCCACGG
CCCTGCCTGC TTTCCAGACT GTGGACCTTG AAAAACGATC TGTTAATGTC GGTTGGCCTT
ATGGAAGCGA CAAGATCAGG GGTGTGAACA TTGGCGGAGT GAGACTTTCA ACCTTTTCAT
TTCATGGAAC AACTTTATTT ACTGTCCCCA TTTGCAGTGG CTCGTGACTG AACCTTTTAT
CACTCCCTCG CTGTTCGAAG CTACAGGAAA CAACGACATC GTCGATGAAT GGACTTTCTG
TCAGTATCAG GACTATGATA CTGCTCAAGC TGCCTTGAAA AATCATTGGG ATACCTGGTT
CACCGAGGAT GATTTTGCCA AAATTTCTGG TAAATCGCTT TTATATGCAG TGGATCAAAT
ATTAACTATT AATATTCAGA GGCCGGTCTG AACCACGTTC GCATTCCTAT CGGATTTTGG
GCATACGATG TGCAAGGCGG TGAGCCGTAC ATTCAAGGAC AAGCCGAATA TCTGGATAGA
GCTATCGGTT GGGCCCGGAA CCACAATCTC GCCGTCATCA TAGATCTTCA TGGTGCCCCT
GGTAAGTTTA TCCTTGGGAT GCACACAGTT CATAGATGAA TCAATCTCCA AGGGTCGCAA
AATGGTTATG ATAATTCTGG AAGGAGGGGT GCTGCTGACT GGTAAGTTTA GTAATCTGGC
TTGAGAATGA TACTGTAGCT GAAAGTGGAT CTCTTCAAGG GCCACCGATG AGGCCAATGT
GGAGAGAACG AAGAATGTGA TCGCTTTGCT CAGCACGAAA TACTCTGATC CCCAGTACTA
TGGAGTCGTT ACCGCCCTCG CACTGTTGAA CGGTAAATGA ACATAATGTT TGTACAATGC
TGATCAAGTT CGTAGAACCA GCCACTTATC TCAATAACCA GCTTCTCCAG ACTGCTCGTC
AGTATTGGTA CGACGCCTAT GGTGCTGCCA GGTATCCATT TGGCAATAGC GACAAATCCG
GCCTAGCTCT TGTCCTGCAC GATGGCTTCC AACCTCTCAG CACCTTTGAA AACTATATGA
CAGAACCCGA ATATGAAGAT GTCCTGCTCG ATACTCACAA TTATCAGGTC TTTAACGATG
AGTATGTCGC TTGGAACTGG GATGAGCATA TCTCAGTAAG TTGTATCCTG CGGTTAATGA
ACTCGGCCCT CGGCTCACAC CACGCCAGAA CATCTGTAAT AAGGCTAGCA CTTACAGCGG
CTCTCCTTTA TGGCTTGTTG TGGGCGAGTG GACTTTAGCC ACGACTGATT GCGCTAAATA
TCTAAATGGC CGAGGTATCG GTTCGCGTTA TGACGGTTCT TACCAAGGTT CATCGTATGT
TGGATCTTGT GATGACAAGT CAAATGATGT CAGCAGATTT TCCGAGTGAG TGTTTCGTAG
TCAATACCTT TTTTTCCAAG GTGTATACCG GTGCACATAC AAGTTACTGA TTCAACCGTC
CAGAGAGTAC AAAGCCTTCA TGCACAGGTT TTGGGAGGTC CAAACGCAAG TGTACGAGCA
AAATGGTCAG GGTTGGATCC ATTGGACCTG GAAGACAGAG AGTGCGGCAG ACTGGTCTTA
CGAAGCCGGG CTTGATGGCG GATGGATCCC CTGGAACGCA GGCTCTCATG ATATTTCCCT
TTCTTCGTTA TGTGGCTAGA TTGCTAGACA CCATTAGGAT TCTTTTCTAC CATATTGAAG
CCACTCGCGC CTCGCTCGAG GATATTCAGG ACTTTTTCTC TGTTTTAGTC AAGCCATAGC
GCAATGAACG AATTTGATAA GTTGAATAAA ACATGTCTTC TATCG
 
Protein sequence
MRSLIPFFIV PLAVATALPA FQTVDLEKRS VNVGWPYGSD KIRGVNIGGW LVTEPFITPS 
LFEATGNNDI VDEWTFCQYQ DYDTAQAALK NHWDTWFTED DFAKISEAGL NHVRIPIGFW
AYDVQGGEPY IQGQAEYLDR AIGWARNHNL AVIIDLHGAP GSQNGYDNSG RRGAADWATD
EANVERTKNV IALLSTKYSD PQYYGVVTAL ALLNEPATYL NNQLLQTARQ YWYDAYGAAR
YPFGNSDKSG LALVLHDGFQ PLSTFENYMT EPEYEDVLLD THNYQVFNDE YVAWNWDEHI
SNICNKASTY SGSPLWLVVG EWTLATTDCA KYLNGRGIGS RYDGSYQGSS YVGSCDDKSN
DVSRFSEEYK AFMHRFWEVQ TQVYEQNGQG WIHWTWKTES AADWSYEAGL DGGWIPWNAG
SHDISLSSLC G