Gene CNC04110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04110 
Symbol 
ID3256614 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1261907 
End bp1263887 
Gene Length1981 bp 
Protein Length516 aa 
Translation table 
GC content48% 
IMG OID638255632 
ProductAAA family ATPase, putative 
Protein accessionXP_569653 
Protein GI58264994 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0465] ATP-dependent Zn proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCGCTACATA ATAATAATAC CTCTGCCACT GCGGTAGGGT CACCAACATA ATTACCGATA 
CTGCATACTC CAAATAACAC GACATTTCAC GATGTTTAGA TCATCCTTAC CTCAGCGGAC
CAGTCATAAT TCTGCCTCAG AAGCTGGGCC CTCGATCCCT ACACCGATAC CGACAGATGG
AGATGCTGTC ACACTTGAAA ATGCCAGTAA CACAACTGGG GAAGCTACTT TATCCTCAAG
TGGCTCATCC GAAGGAGAGT CATTAATATC CAAGATGATG GCGGATAAGT GAGTGTCAGT
GAGCTAAAGT CGGCCTCCAC TCTAACGCCC AATAACAGTC CTTACTTCTC AGCAGGTGCT
GGACTTATGG TAGGCTCGGA ATTAGATGTA GTACCAATTT CCACTGATTT TATTCGGACA
CAAAGGGGAT TGGTGTGGCC CTGACAGCAT TGCGTCGCTC TATAACACTA GGAGCTACCA
TGGCTCAACG AAGAATGTTG GTGACTCTTG AAATCCCTTC CAAAGATCGG TCATACCCAT
GGTTCCTCGA GTGGATGGCC CATCAGTCAG CGGCCCAGAC CAGAGGAAAT GTTAAACCAC
CCGGTCTGTT TGGTTGGGGT CAAGGCATGA GAAGTCACGA GCTAGCTGTG GAAACAAGTT
ACAAGCAGCA CGAAAATGGG GCCAGCGAAG CCATTTTCAA TTTAGTGCCA GGGCCGGGTA
CACATTACTT CAAATATGGA GGGGCCTGGT TTCAAGTATG TGATATCCAA TGATGGGCAT
AAATATGTCA GATTATTGAC GTATAAATAC ATCAGGTAAA ACGCGAGCGG GATTCTAAAC
TCATGGACCT GCACTCCGGA ACGCCTTGGG AGACACTGAC TCTGACAACA CTCTCAACTT
CGCGAGACCT CTTTTCATCT CTCCTTGAGG AGGCACGAAC GCTTGCTGAA GCCTCGACTG
AGGGTAAGAC TGTTGTTTAT ACTGCGTGGG GTGTCGAGTG GCGTCCATTC GGCAAACCAA
GGAGAAGAAG GGAAATGGGC AGTGTAGTCC TGGGTAAAGG AATTGCCGAA GAAATCGAAT
CCGATCTGAA GGGCTTTTTG GGTCGAGGGA AATGGTACGC GGAAAGAGGT GGGTTGTATA
TGACTCCTGA TAATTCTCTT CCTAAACGCT TGTCTAGGTA TTCCCTATCG AAGAGGATAC
CTCCTGCATG GACCTCCGGG ATCTGGTAAA ACGTCGTTCA TCCAAGCTCT CGCAGGGTCC
TTGAACTACA ATATCTGTCT CATGAATCTT AGCGAGAGAG GTCTTACAGA TGATAAACTT
AATCATTTGC TGGGCCTTGT ACCTGAGCGG AGTTTTGTGC TGCTAGAGGA TATTGATTCG
GCGTTTAACA GACGTGTACA AACGAGTGAG GACGGGTGAG TGGGTTATGA GTCATGCGTT
GCGATGGTAT CTTACGACTC TCTAGCTATA AATCTTCTGT CACCTTCTCC GGTCTCTTAA
ATGCTCTTGA CGGTGTCGCA TCATCGGAAG AACGAATCAT CTTTATGACC ACCAATCATT
ACGATCGCCT TGACCCGGCA CTTATTCGAC CAGGTCGAGT TGATATTCAA CAACTTCTGG
ATGATGCTGC TGGTGAACAA GCCAAACGAC TGTTTGTCAA GTTCTATGGC AATTCAGTCA
ATGAGGATGG CACAAAGGGT AGAGTGTTGA GGGAAGGAGA GCTTCCGCTA AATGACGAGG
AAGTAGAATC TTTAGGCAAC TCGGTGCAAC GTATCGTGGA CGATGAGCGG GCTCATGGAA
AGGTCGTTAG CATGGCTAGT CTGCAAGGGC ATTTTATCCG TACCGGAGCG AAAGAGAGCC
TAGATGGGAT CCGTGAGCTG TGTAGGCCAA GGGAAGGGCA AGCATAGGGA CATTTCTATT
GATGCATCAA CACTACTGCA TGATAGACAA TATTACTCGG TTGATTAAGT TACACAATTC
A
 
Protein sequence
MFRSSLPQRT SHNSASEAGP SIPTPIPTDG DAVTLENASN TTGEATLSSS GSSEGESLIS 
KMMADNPYFS AGAGLMGIGV ALTALRRSIT LGATMAQRRM LVTLEIPSKD RSYPWFLEWM
AHQSAAQTRG NVKPPGLFGW GQGMRSHELA VETSYKQHEN GASEAIFNLV PGPGTHYFKY
GGAWFQVKRE RDSKLMDLHS GTPWETLTLT TLSTSRDLFS SLLEEARTLA EASTEGKTVV
YTAWGVEWRP FGKPRRRREM GSVVLGKGIA EEIESDLKGF LGRGKWYAER GIPYRRGYLL
HGPPGSGKTS FIQALAGSLN YNICLMNLSE RGLTDDKLNH LLGLVPERSF VLLEDIDSAF
NRRVQTSEDG YKSSVTFSGL LNALDGVASS EERIIFMTTN HYDRLDPALI RPGRVDIQQL
LDDAAGEQAK RLFVKFYGNS VNEDGTKGRV LREGELPLND EEVESLGNSV QRIVDDERAH
GKVVSMASLQ GHFIRTGAKE SLDGIRELCR PREGQA