Gene CNA03300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA03300 
Symbol 
ID3253512 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp862472 
End bp864877 
Gene Length2406 bp 
Protein Length526 aa 
Translation table 
GC content48% 
IMG OID638252661 
Producthypothetical protein 
Protein accessionXP_566653 
Protein GI58258481 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.600103 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTCAGAACAA TCTCCTTCAG CTCACTTCTT TGCACGCTCA CCCTCCTTTT ACATCACCGG 
CGCCGGCGCC ATTCCTACTC CTTTCGATTA CCCCGTACAC GACACAACGG CACACACACC
AAGACGAAGT TATTTGCTCC TGGCGACTTT GTAAAAAGCA CGGTTTCTTC AAACTGCGCA
CGCTTCCCTG CTACTACTTT CTCCTACCGG TCTCGCACAC GCTCTTTGAC ACTGAAAAGA
TAGTCGCTCT TTAGGCTCCT CTTCGCAATA TTTTCATTAG CTTAGCTACG CTCCAGACCT
TTAAACTTCT TCTCTCTCTC TACGACTTGG CTTCCCTCGT CATCCTAAAC CATCCACGTA
CGCACATCGT GGAAGATGTG GACCAATGTT CTGTCGGTTT TCATCTTCGC AACCTTCTTT
GCGCGCTCGA CACGCGCCCT TTCATCATGG TACACAGAGC CAGGTGGCAC ACCTACGGGT
TCGGTAGACG GCGATAAGGT GCGAGGAGTC AATTTGGGTG GATGGTTCAT CTTGGAGAAT
TGGATGATGC CCAGTTTTTT TGAGGAATCA ATTGTCAGAG ATACGTATCT CAATGACGAA
GTAGGTGTCG CGCGTTTCAA GAGGCGTTGG CTGGGTGAAA ATTAATTCTT GTCTTCGCAG
TGGTCTTTCT GTCTAGTTCT GGGACAGGAT GAGTGCCTTG CAAGACTACA GCAACATTGG
GATACTTACA TCACTGAGGA TGACTTCAAG AGATTCGCAA ACTATTCTCT CAACACAGTG
CGGATACCCA TGGGATACTG GGCATGGACA ACACCAGAGG ATTACGAGCC GTAAGCCAAC
TCGAGAATTA TGGCAATGCT GATAATTTGC AGTTATATTC AAGGACAGCT CCCCTATCTT
GAAAGAGCTC TGAACTGGTC CAGCTGGTAC GGTTTAGACG TCATGATGGA TCTCCATGGC
CTTCCTGGGG GAGCGAACGG CCAAGACAAC CAAGGATACA AGGGACCGAT AGAGTTTCAG
CTGAACAGCA CGAACATGGA TAGAGCCATG GAAGCACTCG CAAACATGAC ACAGTATGTG
ACAGCAGAAA AATTCGACGG TGTCGTTAAA GCCATCGAAC TAACGAATGA GGTGGGTCGA
TTCCTCCAAC CTTCCATATC ATCCCTAACG CTGACTCTGT TACCTCGCGT TAGCCTTACA
TCTTAGAATA CAGCTCACGC GGAATGGACT TTTATACTTT GGCCGACTTC TACGTGAAAG
GCTACCAGGT CGTCCGAGCA AACGAAAACA TCATTGACGG AGCCAATGAA GTAATGGTCG
TCATTCATGA CGCTTTCCAA CCACTCTTGA ACTGGAAGTA TTTTTGGGGA GAAGAAAGTC
TGGGCTTGAA CTGGACTAAC TATGCGCTTG ATACCCGTGA GTACTGGGGC CGTGCTGTTG
TTGAATACCC TCTGACGGCC CATAGATATT TATGATGCCT TTGGTGGCGC CGATCAAAAG
TCATACCAAG AGCACTTGGA CACAATATGT GCCCTATCGG CCTCTATCGC TGAAGCCCAG
CAGTATTTCC CCGTCATTGT TGGAGAATTT GCTCTGTGAG TGAACTTGCT TTTCAGGTGA
TTTTCGAGTC TCATATTATA CAGGGGCGTC AACACGTATT GTGTGGATTA TCAGTCTTGT
TGGGGCCTTA CCATGGACGA GGTCATCGCC AATTTTACCT CTACATACGA GGCATCTCTC
TTTCTGCGCC AATTCTGGGA GGTTCAGTCA GATGTGTATG AGCTTGGAGC TGGATGGATA
TTCTGGTCAG TTCACCATGA GCTTGCTGGA CCATGGAGTT GGACACAGTC GGCTGCTCAA
AATTGGATTC CAATGGATCC TTCTGAAAAA ATGTAAGTAA CGATCCAGTT ATCTATCTGG
TTGAAGTGTG ATACTCTCAC TTACACGTCG GCGACCTTTA GCTGGCCCTT CGATTCTGAT
GCCTCGTCCT ACTGTCTGGA CACCTTTAAC CCCTTAGAAG GTGACCAAAA TCTCCCTTAC
TTCCCTTTGT ACGCCAACAA TTACACCAAT ATCGACATTT CCTCAGTTAA GCCTGTGAGA
CTGAACGTCA ACCCCGCTTC CAACTCCACA GTAGCCTCTG CCACTTCGTC TTCTACCACT
TCTAGTTCAC AATCCTCGAC AGCCAGTCCG TCTAGTTCTT CGTCAGAAAG CGGTAGTTTT
TATACAGCTC CTCTACCGTC CTCTTTGGTT TTGCTGCTTA TGGTGGTTGC TGTCAGCCAC
CTATGATTTG TAATTAGTCC CCAAGATCCC TGGCGGGTAC CAGAGACCTT TGACATGTGA
TCCGTATGGC GCTCGCCTTG ACAGCTGGAC AGACAAGATA AAATATTCAC AGTCTTAATA
CAAGGT
 
Protein sequence
MWTNVLSVFI FATFFARSTR ALSSWYTEPG GTPTGSVDGD KVRGVNLGGW FILENWMMPS 
FFEESIVRDT YLNDEWSFCL VLGQDECLAR LQQHWDTYIT EDDFKRFANY SLNTVRIPMG
YWAWTTPEDY EPYIQGQLPY LERALNWSSW YGLDVMMDLH GLPGGANGQD NQGYKGPIEF
QLNSTNMDRA MEALANMTQY VTAEKFDGVV KAIELTNEPY ILEYSSRGMD FYTLADFYVK
GYQVVRANEN IIDGANEVMV VIHDAFQPLL NWKYFWGEES LGLNWTNYAL DTHIYDAFGG
ADQKSYQEHL DTICALSASI AEAQQYFPVI VGEFALGVNT YCVDYQSCWG LTMDEVIANF
TSTYEASLFL RQFWEVQSDV YELGAGWIFW SVHHELAGPW SWTQSAAQNW IPMDPSEKIW
PFDSDASSYC LDTFNPLEGD QNLPYFPLYA NNYTNIDISS VKPVRLNVNP ASNSTVASAT
SSSTTSSSQS STASPSSSSS ESGSFYTAPL PSSLVLLLMV VAVSHL