Gene CNG00690 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00690 
Symbol 
ID3258788 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp189265 
End bp192157 
Gene Length2893 bp 
Protein Length691 aa 
Translation table 
GC content48% 
IMG OID638257686 
Producttrehalase precursor, putative 
Protein accessionXP_571789 
Protein GI58269266 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1626] Neutral trehalase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.689154 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCTTGA GCGGAAATGG ACTGACCAGT AAGACAGACA TTTGTAGACA AGGTATTGTT 
CATCTTGTGA TACTGGGAGG TATATCTTGC AGCTGATACC CATTGAACAT AGCCAACAGC
GAAGACTCTT AACGAAACTT TGTCCGCATG GGAGGCTCTA GGTGACAATG TCACAGTCGG
AGATGTGGAA ACATTTGTCG AGCAATACTT CGTCCGTGTC TTCCCCTTAA TTTGTTATCC
GAAGTTGGAA TTGATTTATC TCTTTGATGG CAGAAAGGAG AAGGTCTTGA GCTTAGCCAA
GTTGAGCTCG AAAACTTTGT TGAAGACCCT GCTATACTTG ACAACATTAC CGATCCCGTC
TTCAGAGCTT GGGTAAAGAT TGTGAATGGA TACTGGACTC TCCTCGCCAG GTGAGACACG
GCTTCGAAAG GTGCAAAGCG CTAGCTAATG ATTGTGATAC CTTCATCAGG GAGACCAACC
AATCGGCCCT TTGCAATGGA GACTGCGAGT CAAGTTTGAT TCCTCTGAAC CATACTGTTA
TCGTTCCTGG CGGGCGATAC AGGGAAATAT ATTATTGGGA TTCTTTCGTG AGCGGCGCCT
TTCACACATA TCGCATGATT TTGGCTCATC TTTTGTATAG TGGGTGCTGG AAGGTCTTCT
CAAGTCTGAG CTGTACGACT ATGCCTGGGA TTTACTACAG AACTTTATGG ATCTCATTGA
TGTCAGTTCT TTTTGTCTGA ATGAGTGAAG AGACTGTTGT TGATAATGTC CGCAGATCTA
TGGGTATCTT CCCAACGGCG GGAGAAAGTA CTATCTCAAT CGTTCTCAGC CTCCAGTATT
TGTCCAGGTA TGTCGCAATA TATAATACTT TCATTATTAT ACGGATGCTG ATGAAGCGCA
GATGATCGAT GCTTACATCA AGGCCACTAA CAGCATTACT CTTCTTGAGC GAGCTCTCCC
TGTAGCTTCA GTTCGTATGT CCCTTCGTTG ATACGACCGT TCAGCTGACT ATAAGACATC
AGTCCGAGTT AGAATGGTGG GCAAATAACA GAACCTCAAA TTTCACGTCA CCCTTCACCA
ATCAATCCCG CACTATTGCT CAATATTCGG TCACTAACAG CGCTCCTCGA CCAGAAGTAT
GTCCAAAGTC CTGTCCCTTC CAAGCAATCT CTCTCTGATA AGCTACCTAG GGTTATGTTG
AGGACTTCGA AACGGTGATG GGAGCTTCCC CAGCCCTCAA CGAAACTGAA CAAGCCGAGT
TGTACTCGGA GCTCGCTACT GGCGCCGAGT CAGGCTGGGA CTACTCCTCA CGATGGTGCG
AGCAACCACT CCTTAACACA ACAGATAACA ACCCTTCCTT AAGGACCTTG AAAGTCAAGT
CAATCATCCC TGTTGATCTA CTGAGTCTGA TGGCCGGAGA CCATGCCCTC GTGAGTTCAT
CACTGATATA GAGGATGGGA TTATGGCGCT GACATAGGTT TAGCTGGCCA ATTTGTATGA
GCTTTATGCA AACAGTACTG GGGGTGGAGA AGGGACAGGC AATGAGGAAA TGTCAAAGAG
GGATGGGGAA TCTGATGATG CAGCGAGCAA AATTGCATAC CATCGTCAGA TGGCCCAAGA
GTTCAGCGAC TCGATCCTCG ATCTCTGCTG GGACCCAGAA AAGGTGAGCC TTGGAACCCA
TTTGGTACCG AAAAAGTGCT TACAGAGGGG ACTCGCATGC AGTCATGGTT CTACGACTTT
AACGTGACTT CAAACTCTCG CTCCAACATC TTCCACGCGG GTGGCACCTG GCCACTTTGG
CAAAACATTA CTCCATCCGA AATTATGGGC AACGAAAGTG CGGCTCTTTC TTTAGTTTCA
GGATTTAGGT TCCTTTTGGG TCACTACTCA GGGGTCCCAA GTGTGGCTAC TCTGCTGTTT
ACTGGACTGA ACTGGGTACG TTCCGAGGAT TTTTCGCGCG GAAAGAGGGT TGCTGATTGA
TGGGCAGGAT TTCCCTAACG CCTGGCCGCC CCATGCGTGT AAGTCAAGAG AGATGGCCAC
TAGCTGTAAA AGACATCCAT TAACATCGGA ATAAAAACCA CAGATACCGC CATCAAAGCT
TTTGAGACAC TTGGTCGTGT ATTGCCCAAT GCCACTGTCC TTTCCAACTT GACGATCCCC
TTCGATTCAG TGACCGAGAA CCAACTCGGT CTCTCAGAAT CCGAGCTCCA ACCACAACCC
CAATCCACCA TTGGTAACGT CTCTCTGAAC ACCGAGACCT CCCAAGACAA GCCCTGGCCT
CTTGCTCTCT CAATTGAATT TGCGAACAGG TATTTGGGAG CCGCATTCTG TTCATGGTAC
TCCACCGGAG GGCAAATTAG CGGATTATTG ACACAGTTGC CGTTGAGCGA CTTGAATGCT
ACTGGAACCT ATACTTCTGA GCAATCAGGT AAGGGGTTTT CGGAGCTGTG ATTCATATCA
GCTGACAATA AATTGCCAGG CGTGATGTTC GAAAAGGTGG GCTTACCTTC GCATATAGGG
TTATATAGGT TTGGAGCAAA ACTAACACGT CGGTAGTTCA ATGTTACTGA CACAGATGCC
GCTGGAGGAG GTGGTGAGTA TACAGTCCAA GTCGGATTCG GTTGGACAAA CGGAGTAGCC
CTTTGGGCCG CTGGCGAGTA CGGACAGTAC ATCCCTGCAC CCACATGTCC CCTTATTCCG
ATCATCGAAG TCAATGGGAC GGCTGGTTCC AATACCTCTG ATAGCTCGGT ATACAAGTCG
ACGGATAAGG ATGGCGGTCC GACAGCGAGT GACACCACTA CGTCCAAGAG CTTGTTTGTC
GGATACCGAA TCCCGAGAGA GTAGGCTCGC CCGCGAAAGG ACTCTTATGA ATCAATCCTT
TTATAATTTG CTT
 
Protein sequence
MILSGNGLTS KTDICRQGIV HLVILGAKTL NETLSAWEAL GDNVTVGDVE TFVEQYFKGE 
GLELSQVELE NFVEDPAILD NITDPVFRAW VKIVNGYWTL LARETNQSAL CNGDCESSLI
PLNHTVIVPG GRYREIYYWD SFWVLEGLLK SELYDYAWDL LQNFMDLIDI YGYLPNGGRK
YYLNRSQPPV FVQMIDAYIK ATNSITLLER ALPVASSELE WWANNRTSNF TSPFTNQSRT
IAQYSVTNSA PRPEGYVEDF ETVMGASPAL NETEQAELYS ELATGAESGW DYSSRWCEQP
LLNTTDNNPS LRTLKVKSII PVDLLSLMAG DHALLANLYE LYANSTGGGE GTGNEEMSKR
DGESDDAASK IAYHRQMAQE FSDSILDLCW DPEKSWFYDF NVTSNSRSNI FHAGGTWPLW
QNITPSEIMG NESAALSLVS GFRFLLGHYS GVPSVATLLF TGLNWDFPNA WPPHAYTAIK
AFETLGRVLP NATVLSNLTI PFDSVTENQL GLSESELQPQ PQSTIGNVSL NTETSQDKPW
PLALSIEFAN RYLGAAFCSW YSTGGQISGL LTQLPLSDLN ATGTYTSEQS GVMFEKFNVT
DTDAAGGGGE YTVQVGFGWT NGVALWAAGE YGQYIPAPTC PLIPIIEVNG TAGSNTSDSS
VYKSTDKDGG PTASDTTTSK SLFVGYRIPR E