Gene CNA06970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA06970 
Symbol 
ID3253228 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1893064 
End bp1896037 
Gene Length2974 bp 
Protein Length812 aa 
Translation table 
GC content52% 
IMG OID638253019 
Productchromatin assembly complex protein, putative 
Protein accessionXP_567014 
Protein GI58259203 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0273376 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATCCATATCC TCTTTCCATA CACTCCAACA ATGAGACCCA AGGTCCTTGA GATCGCGTGA 
GCATCCATCC CTCTCCAACC CCGACCGTAC CCCGCCGAGC TGACCCCGCC ACGTTAACAG
CTGGCACGAA ACACAGGCGG TTTACTCGTG CGATTTCCAG CCGCTCCCGC TCCCCCAGTT
GAAACGTCTC TTGGCTGCGT CCACAACCAG CGAGAGCGAA GAGGACAGGG ACAGGATCGA
AAAGGGCAGC TCTTCGGCAG CTACTGCAGC TGGAGGAAGG CAGTACAGGC TGGCAACTGC
CGGTGGTGAT TCCAAAGTGC GGGTACGTCT TCCATCCTTC TTCCCCCCTC CGACTGGTTT
TGGGACAACT CACCAAAGGT AATAACCGCT GGGCACGTGT AGATATGGAT GGTTTACCCC
AATATCCCTT CCATCCCCCC GTCCACCTAC GCCGCCCTCA CAGGACAAGA ATATACACCA
CACCCACCAC GAGTGGAATA CCTTGCGACG TTGTCGAAAC ACACTGCTCC GGTTAACGTC
GTCAGGTTCA GTCCCAGCGG ACAGATACTT GCTTCGGCTG GTGATGGTGA GTGCAAAAGT
CTCATGCATA TCCCAAAGGC TATAGACTGA GAGGTTCCAC GGTAGACGGA AACGTTATCC
TCTGGGTGCC CAGCGATAGA CCAAGCGTGA CTTTTGGAGA GACTTCAGAT GATTTGCCCG
ACAAGGAGCA TTGGAGATTA CAAAAGATGC TTCAGTATGT CCCTTGTGTC TCCCTTTATC
TGCAATATCA AGCAGAAAAG CTAATCACAG GCCCAGGGTG ACCACAAAGC ATGTATACGA
CTTGTCATGG TCTCCTGATG GAGAGTATCT CATCGCCGGG TCGACCGATA ACACCGCGAC
AATATGGAAG GCTGCCACCG GTGAGTGTTG CATAACGATA TGGTTGAGAA ATGTTTCTGA
TGGGATTTTT TTTTTTTAAA GGCGAATGTG TGTTTGCACT TCGAGAACAT TTGCACAACG
TGCAAGGTGT CGCTTGGGAC CCTCTGAACG AATACATTGC TACTCAAAGC AGTGACCGTG
CGGTACACGT CAATACGTTT ACCACTCGTA ACGGTATTCC CGATGTCCAC CCTGTCTCTC
GTTCAACACG GATGGAGATC CGTCACTCCC GAACCCCTTC CATCTCCTCG GCGTCTAGAC
CCAGTATGGT TCGTAGAGGA TCCACTACTT CCGAAGCTGG TTCAGTGATT ACTACCGCCT
CTGATTTTCC CGAGGCTGCT TTGCCTCCTC ATGCCCCAGT TTTGGCCGGT GTAAGTGCCA
GCGCTACCCC AGCTACACCT TCAGCATCTG TGCCCTCCAC CCCTCAGGTT GCTCCCGCCC
CGATGAACCC TCCAGCCACT TCCAACCGTC CTTGTTCCAG ACGTTCTTCC TTTTCCGGAT
CACAAGCTGC CGCTTCCCCA GCTCTCAGCG CTGCAGCTTT CAGTCACCTC GCACGCAGTG
CCCGGTCACC TTCTCCTATC CCCCCTTTAC CCGCCATCCG TGCACCTCCA GCCTCGACAA
TCAATCAACG TCTTTATGGT GAAGAGGGTG CGACGAGATT CTTTAGGCGA CTGACATTCT
CTCCTGATGG CTCTTTGCTA CTCACTCCTG CCGGGCAAAT TGAGGATCAA GTGTACAAGG
GATCTCCCCT GCTTACCGCT AAGAATATCT CCCAGGATAC ATCCGACCCA TTATCATCGT
CTGTCCCACG GCCGAAAAAC GTTGAGACGG GCAAGCCGAC AGCATACATC TACTCTCGCG
CCAACCTTTC TCGACCGCCG ATTGCCCATT TACCGGGCCA TAAAACTTCT AGTGTTGCTA
TTCGCTTCTC CCCCGTGTTT TATGACCTCC GCCAGAACGG ACAATTATCT GCCGAGCCAA
AGCATGTCAC TTTCGACAAG AATGATACCC AGCCAGTGCA CGTGAGCTTG AACATGCCCC
CACCTCCCGC TCCTTCAGGT TCAAGGGAAA AGGAAAAGGA AAAGGAGGGA GACAAAGTGT
TGGGAAGTGT GTTTGCTTTA CCGTATAGGC TTTTGTACGC GGTGGCATGC CAGGACTCGG
TCCTACTCTA TGATACACAA CAGGCTGGGC CTATAGCCAT CTTCAAGGGA CTACACTATG
CTGGATTTAC TGATGTCGCT TGGTAAGTCA TCAATGATGA TCGATCTCAT GCGTTACAAA
CTAACATTAC AATGCCATGC AGGTCACCGG ACGGACAATG TCTTTTCCTT TCATCCGCAG
ACGGCTACTG CTCCATCGTC ATCTTTGATC TTGGCGAGCT CGGAACTGTT CACCCTACCC
AACAACATCA CCGCCAACTG CAGGCAATCG CCCAGTCCCA CAACAATGGG ATTTCCACCC
CCCTCCCACC ATCACTTACT CATCGCGACT CTATCCATTC GTCACATTCC CAATCGGGCG
CTTCCGCCAC AGGTCACAGT CCCGCAGTCA GTCATGTGGC AAGACAAAGC CCAGCACCGG
GAGTGGCAAG GAGTGATAGA GAAGGTTCAA CAGCCAGTAG CGTGGTTGGT GCCAGCGGGT
CAGTCTCTGC GTCTTTACTG TCAGTTTCCA ATGTTGGAGG TGGTGCCAAA GCGCCCCCAA
GTTCGGCGAG CTCAGTGACA GTGACGGACC AAGTGCTACC CACCCCGACA CCTAGTGATA
CCGAAGGACC AAGTGCTGCT GGAGTTGATT TGGGTATTGC CGTGAGTCAG GAAGAGGATG
CCAAGAAGCG TGAAGGAGCA GGCGAGACGA CACAAGCCGA GGCCCCAAAG AAGAAGAGGA
GGGTTGCGTT GACGCATTTG GGATCGGAGC AATAATGGAA TTACAAACAA ACAAACAAAC
AAAAAGAAAG GTTAAAAAAT GCCAGGTCTC TAATTATCCA TCACGTCAAA GGTTGCTATT
AAATATCACA TCGAGTTTCG TTCGTTATGT TATG
 
Protein sequence
MRPKVLEIAW HETQAVYSCD FQPLPLPQLK RLLAASTTSE SEEDRDRIEK GSSSAATAAG 
GRQYRLATAG GDSKVRIWMV YPNIPSIPPS TYAALTGQEY TPHPPRVEYL ATLSKHTAPV
NVVRFSPSGQ ILASAGDDGN VILWVPSDRP SVTFGETSDD LPDKEHWRLQ KMLQVTTKHV
YDLSWSPDGE YLIAGSTDNT ATIWKAATGE CVFALREHLH NVQGVAWDPL NEYIATQSSD
RAVHVNTFTT RNGIPDVHPV SRSTRMEIRH SRTPSISSAS RPSMVRRGST TSEAGSVITT
ASDFPEAALP PHAPVLAGVS ASATPATPSA SVPSTPQVAP APMNPPATSN RPCSRRSSFS
GSQAAASPAL SAAAFSHLAR SARSPSPIPP LPAIRAPPAS TINQRLYGEE GATRFFRRLT
FSPDGSLLLT PAGQIEDQVY KGSPLLTAKN ISQDTSDPLS SSVPRPKNVE TGKPTAYIYS
RANLSRPPIA HLPGHKTSSV AIRFSPVFYD LRQNGQLSAE PKHVTFDKND TQPVHVSLNM
PPPPAPSGSR EKEKEKEGDK VLGSVFALPY RLLYAVACQD SVLLYDTQQA GPIAIFKGLH
YAGFTDVAWS PDGQCLFLSS ADGYCSIVIF DLGELGTVHP TQQHHRQLQA IAQSHNNGIS
TPLPPSLTHR DSIHSSHSQS GASATGHSPA VSHVARQSPA PGVARSDREG STASSVVGAS
GSVSASLLSV SNVGGGAKAP PSSASSVTVT DQVLPTPTPS DTEGPSAAGV DLGIAVSQEE
DAKKREGAGE TTQAEAPKKK RRVALTHLGS EQ