Gene CNH00650 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNH00650 
Symbol 
ID3259350 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006693 
Strand
Start bp998752 
End bp999847 
Gene Length1096 bp 
Protein Length252 aa 
Translation table 
GC content51% 
IMG OID638258418 
Productconserved hypothetical protein 
Protein accessionXP_572258 
Protein GI58270204 
COG category[R] General function prediction only 
COG ID[COG2110] Predicted phosphatase homologous to the C-terminal domain of histone macroH2A1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.110793 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATATCCG ACGGCGATCT CCAGTCGTCA CTCAAGATTC GCTCGAGATA TAACGGACAC 
TACTCCTGCA TGACGCTTCC ACGTCTGGCA AGCCGGCTCG CAAAGACCAT GACACCCGCC
GACATCCCTA CCCTTTCCCA GCTCTACCGC CATGATCACA CGAATGCACT CAATCCCACC
AAGCCAAAGT ATGAGTTTAC GAAACAGCTC AATGATCGGG TCTCCATTTG GAGAGGAGAT
ATCACCGAGC TCGAGGTGGG TGTTTTTCCT CCGTCTAGAC TAGTCACCTC GCCAACTTTA
CGTCTGTATA GGCCGACATG ATCGTCAACG CTGCCAACTC GTCACTCCTC GGCGGGGGCG
GCGTCGACGG TGCGATCCAC CGGGCTGCAG GCAAGCACCT GCTCGAGGAA TGTAAAAAGC
TGGGCGGTGC CCAGACGGGG GAAACAAAGT TTACCGCCGG CTACAACGTG CGTCCTATAC
CCCCCTTGCA GTCACGCTGA AAAAAGCCCC AGCTATCGAG CAAGAAGATC GCACATACAG
TCGGACCCGT CTACCACTCG CACCCACCCC AACGTGCAGC CCAGCTTTTG AAAAGCTGTT
ACCAATCGTC GTTGGAAGGG TGTAGAGATT CGGGAGGAGG CGTCATTGGG TTTAGCAGTA
TCTCTACCGG CGTCTGTACG TCAAGAGTCT GGATGGGTGC TGGTGCTGAT GCTGATGGCA
AAATCCGCAA TGATAGATGG GTATCCGATC AAGGATGCTA CGCATATCGC ACTCGAGACA
ACTCGTCAGT TCTTGGAACA AGATGACTCT GTACGTCGTC TCTGTTTCCC AAACCGCACT
GACTTACATA CTGGACTGGA CCCCACCATA GATTACAAGA GTAATCTACG TCGTGTTTTC
AAAAAGGGAT GAAGATGTCT ATCGGGAGAT TATCCCACAG TATTTCCCTC CTGATCCCGA
ACATGGGCAT GGAAGTGTGT AAATAAATAA AAAGGTAACG ATAAAAAGAA AAGGTAACGA
TGGCCGTGAT GGGAATGGAA AATGCATATC CTTTAGCACC AAAAGATGGG TACAACCACT
AAAACTCTGA TATACA
 
Protein sequence
MISDGDLQSS LKIRSRYNGH YSCMTLPRLA SRLAKTMTPA DIPTLSQLYR HDHTNALNPT 
KPKYEFTKQL NDRVSIWRGD ITELEADMIV NAANSSLLGG GGVDGAIHRA AGKHLLEECK
KLGGAQTGET KFTAGYNLSS KKIAHTVGPV YHSHPPQRAA QLLKSCYQSS LEGCRDSGGG
VIGFSSISTG VYGYPIKDAT HIALETTRQF LEQDDSITRV IYVVFSKRDE DVYREIIPQY
FPPDPEHGHG SV