Gene CNG04150 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG04150 
Symbol 
ID3258641 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp1170306 
End bp1171910 
Gene Length1605 bp 
Protein Length457 aa 
Translation table 
GC content54% 
IMG OID638258038 
Productconserved hypothetical protein 
Protein accessionXP_572169 
Protein GI58270026 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG5071] 26S proteasome regulatory complex component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.161996 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGTTCTCAA GGCATACGAT GGACATTACA GCAGCCCTCC ACGAAGCCTC TTCTCAACAA 
AACCAAAAAT TGCGCTCTCA AGCATACCTC TCTCTCCTCC AATCCCTCCT GCAACCACCA
CTCACATCCG CATCCCCGCT TATCGCTTTC GGCACGCACT TCACGACGTC TAACACCGTC
ATCATCATCG TCGGTCGTCG TATCCTCGGC GCATACCTTG TCGCCCTTCT ATCAGGGACA
ACAGTGGTGC AAAAGGGGAC TGCCAAGGTC CCCCTGGATG AAGAGGGGCA AGCTGACGAA
GCAGAGTGGG CAGCGCTTGG GAAAGCGGCA TTTGGTGGGG AGAAGGGCGA AGAGGTGAGG
AGGGATGTTG TTGAAGGCGT GCTGGCTGCT GGCTCTAGTG GGTGGTGCGA TGAGCAGGTG
AGTATCATGG CTGGGCGAGC AGTCCACGTC CACGGGCTGA CATAACCGTG GCATAGATAA
CTGTCCTGCG ACATCTACAC TCGCACCTCC TCATGCTTGA AGAAGACTGG GAAGGTGCTG
CTCGAGCACT GATGCCGATG CAACTAGAGG GTGGTTCAAG AGTTGTATCC GACGATGAGA
AGCTCAATGT GTACATGCAA ATCGTCCGCC TCTTCCTCGA GGTAAGCTTC CTCGTCCATT
CTATCGCTGG GCTTTCAACT CACACCTCAA CACCTAACTC AGTGCGGCGA ATGGGGCCAA
GCCCAAACAT ACTTTACCCG CGCTTCCCTC TTACCCCGCC CAACAGATAA GGAGACCCGC
CTATCTATGC GTCTCTCTCA AGCTAAACTA TACGATTTTG CCAACGAATT CGCCAAAGCG
TCTGTCACTT ACCACGAAGT CTCGCACGAC CCTTCCATCG ATCCTTCCGA CCGACTCATT
ATCCTCTCCG CCGCTGTCAC CACCTCTATC CTCGCCCCTT CTGGCCCCCA CCGCTCTCGG
ATCCTCGCTA CGCTCAACCG TGATGACCGG GTACACACCG AGCTGCCCGC CGGGTTGGGC
ACAATGTTGA AGAAGATGCT TCTGGAGTAT ATCGTGAAAC CGGAGGAGAT GAAAGAGTTT
GAGGGGGCAT TAGCACCACA TCAGCGAGCG GTCGTAGAAG GCGGTGGGAC AGTCTTGGAA
AGAGCTGTAC GGGAACACAA CGTTGGTGCA TGTGCCAAAG TATACGACAA CATCTCCTTC
TCCGCCCTCG GTGCGATTCT CAACCTCTCT CCGTCTTCAG CCGAGACGAT CGCTCAGCGT
ATGATTGAGC AATCCCGTCT TCGCGCATGG ATTGACCAGC CTTCCCAACT CATCTTTTTT
GAATCCCGTC CGCAGCTTGA TACCGACGCA GACGCCCAGG GCACGGCGGG CGGGTTAGGG
GTGGAGAAGG AGGAGAAGGA GGTGGAGAAG GTAGGATGGG GTGTAAGGTG GGATGAGAGG
ATTAGAGGCA CGAGTTTGAG AGTGGAAGGG ATTGCAGAGG CGATCTTGGC AAAAGGTTTG
ATCGATGCGT AATCGGGATT GCTCACGCGG AGACTGCACG TAGAGGGAGT GAAATGTAGA
TTAAACAGAA AAAGAACATT GTATGTACGT ATATATCTAC GCTTC
 
Protein sequence
MDITAALHEA SSQQNQKLRS QAYLSLLQSL LQPPLTSASP LIAFGTHFTT SNTVIIIVGR 
RILGAYLVAL LSGTTVVQKG TAKVPLDEEG QADEAEWAAL GKAAFGGEKG EEVRRDVVEG
VLAAGSSGWC DEQITVLRHL HSHLLMLEED WEGAARALMP MQLEGGSRVV SDDEKLNVYM
QIVRLFLECG EWGQAQTYFT RASLLPRPTD KETRLSMRLS QAKLYDFANE FAKASVTYHE
VSHDPSIDPS DRLIILSAAV TTSILAPSGP HRSRILATLN RDDRVHTELP AGLGTMLKKM
LLEYIVKPEE MKEFEGALAP HQRAVVEGGG TVLERAVREH NVGACAKVYD NISFSALGAI
LNLSPSSAET IAQRMIEQSR LRAWIDQPSQ LIFFESRPQL DTDADAQGTA GGLGVEKEEK
EVEKVGWGVR WDERIRGTSL RVEGIAEAIL AKGLIDA