Gene CNH00790 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00790 
Symbol 
ID3259023 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp963711 
End bp965921 
Gene Length2211 bp 
Protein Length590 aa 
Translation table 
GC content51% 
IMG OID638258403 
ProductEndoglucanase E-4 precursor, putative 
Protein accessionXP_572272 
Protein GI58270232 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.323744 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCTTACTTA CTTATTTATC GCCTTCCACT CTTTCTCCCT CTTCCGTCTC TCTCTTCCTT 
CTCGGCTGAT ATTTCAGCCA TGCTCATCCC CTTCCTTGCC CTCGCTTCTC TGTCCCGAAT
AACCACAGCC CAGCTCACAC CGTCTCCCAC TTATTCTCCT CCAACCTCCT CTGCAGGACT
CACAGCCTCC AGCGAAACAC CCAACACACA ATGGTCCAAT ATCCTTGGTA ACTCCCTTTG
GTTCTACGAT GCCCAAAGGT CAGGTAGATT AGATGAAGGA ACATATGGGA ACAGAGTAGA
CTGGAGGAAT GATAGCGCTT TAGAAGATGG GAGTGATTGG GGTTTGGACC TTGTCGGCGG
ATGGTATGAT GCGGGTGACT ACATCAAGGC GACATTTCCT TTGGTAAATC TGTTCTATTT
ATATTCCCCA TGACCTAGAC TAACCAGAAT AGAGTTTTAC CTTATTTGCG CTCTCCTGGG
GGGCGTTGAC GCATGGCCAA GGATATGGCC TTGCCAACCA AACAGCCTAT CTCGATGGGA
CCTTGCGATG GGGTTTTGAT TGGCTTATGA AGGTAAGGAA AGGCCTACAT GTGGGATTGT
GCTATAGAAC TGATGAGAGC AGGCACACCC ATCGGATGAT GTGCTGTTTA TCCAAGTTGG
TTCTGGGGAT GTCGACAACA ATTACTGGCA AGTCCCTTCA CTTGTACGCA GCCCAAAACA
GATCTAACAA TCACTCAGGG GCGGGGACCA GGACATTCCA AGTCCTCGCC CGGGGTACCC
AATCAACTCT TCTTACCCTG GTACAGATGG CTGGGCCGCT GCCTCTGCCG CCTTTTCACT
AGGTTCCCTC CTTTACACAC CAGGCGTCTC GTACAGACCC ACTTCATCGT CCTCTCCTCC
AACCTCACCT TCATTGGAAA ACTCCACTTA TGCGTCTCAG CTGTTGGCAC ACGCTGAGTC
GCTTTACTCT GTCGCCAACT CTACCACCCC TCGACAAACT TACTACGCGG CTTTAGGTGA
TGAAGTTGCC GCTTACGCCT CTTCCGACTG GCGAGATAAC CTCTGTGCAT CTGCTCTGGC
TCTGGCACTG GCGACAAACA ACTCTGCGTA CTACGCCGAT GCATACAACT ATTATGTCCA
ATATGGGCTG TCAGGCACAC ATGAAGTTTG GAACTGGGAT TCGTCACAGC CGGCAATTTA
TGTCATGTTT GCGGAAATTG CGAGCGCAAA GCCCGAGTTA GCGCAAGGAG CTGGACTCGA
CGTGAACTTG ACTGGATGGC AGACTGAAGT CGAGAACTAC TTTGATGGGC TTATCAAAGA
GGATTTCAGT AATTCTTACT TGACCGAAGG TGAGCAAAAT CCTTCCAAAA AGACCACTAG
CTTACGGGAG TAGGGGGATT ACTCTATTGG GATGGCGACT CTGACGAGGC GTCCTTGAAC
CCTGCCATGG CTGCCGCTAT GCTCATGTTC AAGTACGCAC CCATGGCCTC TTCAACCGAC
AAGACCAACT CTTACAATTC ATTCGCTCAA TCCCAACTCA ACTACCTGCT CGGCTCCAAC
CCCATGTCAG TCCCTTACAT CGTTGGGCAA CACCCGAATT CCCCATCCAA CCCCCACTCT
GCCCCCGCTT CTGGTGGCTT CAACATAAAT AATATCCGTG ACGACCCTCC CACCGAGGCG
CACGTGTTAT ATGGTGCGGT GGTGGGTGGA CCGTTGAGCA GTGATCAATT TTGGGATTGG
AGAGACGATT GGGTGCAGAC GGAGATAGCA TTGGATTATA ACGCGATGAT TCCAACTCTC
GCCTCTATGC AGGTACGTCT TTGACTAGCT TCGCCTGGCA TTTAAACGAG GGCTTGAGAC
TGACTCTATC TGTAGCTTAT GAACAACACT GCCGATCCAC CTTATGTCGA CATCGCTGCA
GGCACATACT CCATCCCCTC TGGCCAACCT TGTGATGCAG CTCTTCCATG CCGCGGTGGC
GGCGGTCTTA GCGGTGGTGA GATTGCAGGG ATTGTTGTGG GTGTTATTGT GGGTGTGGTC
TTGTTGGTGA TTGTGGGCGT TTGGTGGTGG TGGAGGAAGA GGGGAAAGAG ATGGGGTAGT
AAGTGGTAAG TAGGAAGCGA GGATTGCGGC GCAGTTTGGT GGACATTTGG TTTTTTGTTG
TGTCATCATA ATGGATTTAT AAAGTCAAGC CTATGTATCC GTGTTTTTGT A
 
Protein sequence
MLIPFLALAS LSRITTAQLT PSPTYSPPTS SAGLTASSET PNTQWSNILG NSLWFYDAQR 
SGRLDEGTYG NRVDWRNDSA LEDGSDWGLD LVGGWYDAGD YIKATFPLSF TLFALSWGAL
THGQGYGLAN QTAYLDGTLR WGFDWLMKAH PSDDVLFIQV GSGDVDNNYW GGDQDIPSPR
PGYPINSSYP GTDGWAAASA AFSLGSLLYT PGVSYRPTSS SSPPTSPSLE NSTYASQLLA
HAESLYSVAN STTPRQTYYA ALGDEVAAYA SSDWRDNLCA SALALALATN NSAYYADAYN
YYVQYGLSGT HEVWNWDSSQ PAIYVMFAEI ASAKPELAQG AGLDVNLTGW QTEVENYFDG
LIKEDFSNSY LTEGGLLYWD GDSDEASLNP AMAAAMLMFK YAPMASSTDK TNSYNSFAQS
QLNYLLGSNP MSVPYIVGQH PNSPSNPHSA PASGGFNINN IRDDPPTEAH VLYGAVVGGP
LSSDQFWDWR DDWVQTEIAL DYNAMIPTLA SMQLMNNTAD PPYVDIAAGT YSIPSGQPCD
AALPCRGGGG LSGGEIAGIV VGVIVGVVLL VIVGVWWWWR KRGKRWGSKW