Gene CNH02900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH02900 
Symbol 
ID3259218 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp301568 
End bp303528 
Gene Length1961 bp 
Protein Length498 aa 
Translation table 
GC content51% 
IMG OID638258195 
Productcytoplasm protein, putative 
Protein accessionXP_572461 
Protein GI58270610 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2730] Endoglucanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones32 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTCCGGTCTG TTCCTTGAAT TTTGTGCACA GGCCCCTATC AATACCAACC AGAAATCTCA 
CATTAACACT CGCCCACAGC CAAAAGCACA ACCTGTTTAT CCACCCCAAT CCACAAGCAT
TCAGCAATGG GTGTACTCTC AAAGTTCAAA GATCAATTCA CCCAATCCTC CCCTTCTTAC
GACTTTCCCC CTCCACCTCC TTCAATCCCT CTCGGCCCTA ACGCAGTCTT CCGTTACCGC
AAGCAACGAG GTGTCAATTT GGGATCATGG TTTTCTCTGG AGCAGTGGAT CTGCCCTCAC
GTCTTCAAAG GCGCCAAACC CCCTGGCCAA AGTGACTATG ATGTGGCTAG TGGGAAGGAC
GCAAAGAGGA TTTTAGAGGA GCACTGGGAT ACTTGGATCA CCGAGGATGA TATGAAATGG
ATTGCATCTA GGGGGTTTAA CAGTGTTCGA CTACCCGTGA GCCTACTACA GATATGGATA
ATATTTACTT GCTGATGGTA TCTTGGTTGG GCTCAGATTG CATACTATCA CCTCTGCGGT
CCTCTCCCCG AAGTTTTGAA AGGCACTGAT TTTGAATCAT TTCGCCACGT TTTTGAAGGC
GCTTGGGGAC GAATCGAGAG AGCGGTCGAG ATGGCAGGAG CTTATGGTTT AGGAGTACTT
ATAGGTTAGT CATTGAGTTG ACACTTGCCA TCGGATCACT TCTCAAAACA TTAGGCGCTG
ACGTCTTTTC ACCGTACTGC AATTAGACCT GCACGGTGCG GCCGGTGCTC AGAATCCGGA
TGGTGAGTTT TTACAGCAAG GTAGCCATTT CAGTTCAGCA TCATTAACCA TAACTGCCTT
TATACAGCCC ATGCCGGCCT TTCTCGCGGC AAAGTATCAT TCTGGGATAC CCATGCCAAC
CAAGCCTCCA CCTCCCTTGC ACTTCGCTTT CTCGCATCCA AGTTCGCTTC CGTCCCCCAC
ATCGTCGGCC TAGAACTCCT CAACGAGCCT CAAAACAACC GCAAACTACA GTCATGGTAC
AGCAAGACTA TCGACGAAGT CCGTAAAGTC GCTCCGCCGG ATTTTCCCAT CTACTGCTCA
GACGCATGGG ATACGGATCA TTATGCTGGT TGGGTCGGAT CGAGAGGCGA TTTCGTCGTA
CTAGACCATC ACCTTTACAG ATGTTTCACC GATGAGGACA AGTGTCAAAC CGGAACAGAT
CATGCAAACA ATCTCAGATC TGGTTTCAGA GGTAGGTTTG CCCAGCAATG CGAAGCCGCC
AAAGGATCTC TCGTCGTGGG CGAATGGTCT GCCTCCCTGG ACCCTCGGTC CTTCCCTCAA
GGAATGCCTG ACGGGGAGAA AGACGCTCAG AGACGGGCAT TCGTCCACGC CCAGCTCGAG
ATATTCGAGT CCCACGCGGC GGGGTATTGG TTTTGGACTT ACAAGAAAGG TGAAGGATGG
GACGCTGGCT GGTCAGCAAC CAATGCGAGT CAGGCAGAGA TTCTGCCTGG GTGGGTTGGA
AGCAGACAGT TCAAAGGGAC ACCACCCAGT CATATCAAGG ATCAAGAGTT GCAGAACGGA
CATAGTAGGT TTAGCAAATC TCTCCCGCTA ATTGTCAAAA TGCGAAAAGC ACTGATGGGA
CATTCTTTTG AATAATCTAG AATCCCACGC TGATTATTGG GCCGCAAACG GCGGTTCTCC
CAATCCAGAT ATGTACGCTC CCGGATTCTC TCAAGGATGG GACGATGCTC TCATCTTCTT
GAGCACCCAA GGATCACCTA GCGAGATGGG TTTTGTGCAC CAGTGGGCAA TAAGGCGTCA
AGCAGAGTTT GAAAGCCAGG GGCATAAGCT CGGCCGCGCT GCTTGGGAAT GGGAGCACGG
CTTCAAGCAA GGGGTTGAAG CTTGTGCTAG GTGCTGTCTG GCTTGATACA CGTTTCGGCA
AGGAGTTCGC TGTATCATTT AGGCAGCCCG TTGTAGATAG T
 
Protein sequence
MGVLSKFKDQ FTQSSPSYDF PPPPPSIPLG PNAVFRYRKQ RGVNLGSWFS LEQWICPHVF 
KGAKPPGQSD YDVASGKDAK RILEEHWDTW ITEDDMKWIA SRGFNSVRLP IAYYHLCGPL
PEVLKGTDFE SFRHVFEGAW GRIERAVEMA GAYGLGVLID LHGAAGAQNP DAHAGLSRGK
VSFWDTHANQ ASTSLALRFL ASKFASVPHI VGLELLNEPQ NNRKLQSWYS KTIDEVRKVA
PPDFPIYCSD AWDTDHYAGW VGSRGDFVVL DHHLYRCFTD EDKCQTGTDH ANNLRSGFRG
RFAQQCEAAK GSLVVGEWSA SLDPRSFPQG MPDGEKDAQR RAFVHAQLEI FESHAAGYWF
WTYKKGEGWD AGWSATNASQ AEILPGWVGS RQFKGTPPSH IKDQELQNGH KSHADYWAAN
GGSPNPDMYA PGFSQGWDDA LIFLSTQGSP SEMGFVHQWA IRRQAEFESQ GHKLGRAAWE
WEHGFKQGVE ACARCCLA