Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA06970 |
Symbol | |
ID | 3253228 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | - |
Start bp | 1893064 |
End bp | 1896037 |
Gene Length | 2974 bp |
Protein Length | 812 aa |
Translation table | |
GC content | 52% |
IMG OID | 638253019 |
Product | chromatin assembly complex protein, putative |
Protein accession | XP_567014 |
Protein GI | 58259203 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0273376 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATCCATATCC TCTTTCCATA CACTCCAACA ATGAGACCCA AGGTCCTTGA GATCGCGTGA GCATCCATCC CTCTCCAACC CCGACCGTAC CCCGCCGAGC TGACCCCGCC ACGTTAACAG CTGGCACGAA ACACAGGCGG TTTACTCGTG CGATTTCCAG CCGCTCCCGC TCCCCCAGTT GAAACGTCTC TTGGCTGCGT CCACAACCAG CGAGAGCGAA GAGGACAGGG ACAGGATCGA AAAGGGCAGC TCTTCGGCAG CTACTGCAGC TGGAGGAAGG CAGTACAGGC TGGCAACTGC CGGTGGTGAT TCCAAAGTGC GGGTACGTCT TCCATCCTTC TTCCCCCCTC CGACTGGTTT TGGGACAACT CACCAAAGGT AATAACCGCT GGGCACGTGT AGATATGGAT GGTTTACCCC AATATCCCTT CCATCCCCCC GTCCACCTAC GCCGCCCTCA CAGGACAAGA ATATACACCA CACCCACCAC GAGTGGAATA CCTTGCGACG TTGTCGAAAC ACACTGCTCC GGTTAACGTC GTCAGGTTCA GTCCCAGCGG ACAGATACTT GCTTCGGCTG GTGATGGTGA GTGCAAAAGT CTCATGCATA TCCCAAAGGC TATAGACTGA GAGGTTCCAC GGTAGACGGA AACGTTATCC TCTGGGTGCC CAGCGATAGA CCAAGCGTGA CTTTTGGAGA GACTTCAGAT GATTTGCCCG ACAAGGAGCA TTGGAGATTA CAAAAGATGC TTCAGTATGT CCCTTGTGTC TCCCTTTATC TGCAATATCA AGCAGAAAAG CTAATCACAG GCCCAGGGTG ACCACAAAGC ATGTATACGA CTTGTCATGG TCTCCTGATG GAGAGTATCT CATCGCCGGG TCGACCGATA ACACCGCGAC AATATGGAAG GCTGCCACCG GTGAGTGTTG CATAACGATA TGGTTGAGAA ATGTTTCTGA TGGGATTTTT TTTTTTTAAA GGCGAATGTG TGTTTGCACT TCGAGAACAT TTGCACAACG TGCAAGGTGT CGCTTGGGAC CCTCTGAACG AATACATTGC TACTCAAAGC AGTGACCGTG CGGTACACGT CAATACGTTT ACCACTCGTA ACGGTATTCC CGATGTCCAC CCTGTCTCTC GTTCAACACG GATGGAGATC CGTCACTCCC GAACCCCTTC CATCTCCTCG GCGTCTAGAC CCAGTATGGT TCGTAGAGGA TCCACTACTT CCGAAGCTGG TTCAGTGATT ACTACCGCCT CTGATTTTCC CGAGGCTGCT TTGCCTCCTC ATGCCCCAGT TTTGGCCGGT GTAAGTGCCA GCGCTACCCC AGCTACACCT TCAGCATCTG TGCCCTCCAC CCCTCAGGTT GCTCCCGCCC CGATGAACCC TCCAGCCACT TCCAACCGTC CTTGTTCCAG ACGTTCTTCC TTTTCCGGAT CACAAGCTGC CGCTTCCCCA GCTCTCAGCG CTGCAGCTTT CAGTCACCTC GCACGCAGTG CCCGGTCACC TTCTCCTATC CCCCCTTTAC CCGCCATCCG TGCACCTCCA GCCTCGACAA TCAATCAACG TCTTTATGGT GAAGAGGGTG CGACGAGATT CTTTAGGCGA CTGACATTCT CTCCTGATGG CTCTTTGCTA CTCACTCCTG CCGGGCAAAT TGAGGATCAA GTGTACAAGG GATCTCCCCT GCTTACCGCT AAGAATATCT CCCAGGATAC ATCCGACCCA TTATCATCGT CTGTCCCACG GCCGAAAAAC GTTGAGACGG GCAAGCCGAC AGCATACATC TACTCTCGCG CCAACCTTTC TCGACCGCCG ATTGCCCATT TACCGGGCCA TAAAACTTCT AGTGTTGCTA TTCGCTTCTC CCCCGTGTTT TATGACCTCC GCCAGAACGG ACAATTATCT GCCGAGCCAA AGCATGTCAC TTTCGACAAG AATGATACCC AGCCAGTGCA CGTGAGCTTG AACATGCCCC CACCTCCCGC TCCTTCAGGT TCAAGGGAAA AGGAAAAGGA AAAGGAGGGA GACAAAGTGT TGGGAAGTGT GTTTGCTTTA CCGTATAGGC TTTTGTACGC GGTGGCATGC CAGGACTCGG TCCTACTCTA TGATACACAA CAGGCTGGGC CTATAGCCAT CTTCAAGGGA CTACACTATG CTGGATTTAC TGATGTCGCT TGGTAAGTCA TCAATGATGA TCGATCTCAT GCGTTACAAA CTAACATTAC AATGCCATGC AGGTCACCGG ACGGACAATG TCTTTTCCTT TCATCCGCAG ACGGCTACTG CTCCATCGTC ATCTTTGATC TTGGCGAGCT CGGAACTGTT CACCCTACCC AACAACATCA CCGCCAACTG CAGGCAATCG CCCAGTCCCA CAACAATGGG ATTTCCACCC CCCTCCCACC ATCACTTACT CATCGCGACT CTATCCATTC GTCACATTCC CAATCGGGCG CTTCCGCCAC AGGTCACAGT CCCGCAGTCA GTCATGTGGC AAGACAAAGC CCAGCACCGG GAGTGGCAAG GAGTGATAGA GAAGGTTCAA CAGCCAGTAG CGTGGTTGGT GCCAGCGGGT CAGTCTCTGC GTCTTTACTG TCAGTTTCCA ATGTTGGAGG TGGTGCCAAA GCGCCCCCAA GTTCGGCGAG CTCAGTGACA GTGACGGACC AAGTGCTACC CACCCCGACA CCTAGTGATA CCGAAGGACC AAGTGCTGCT GGAGTTGATT TGGGTATTGC CGTGAGTCAG GAAGAGGATG CCAAGAAGCG TGAAGGAGCA GGCGAGACGA CACAAGCCGA GGCCCCAAAG AAGAAGAGGA GGGTTGCGTT GACGCATTTG GGATCGGAGC AATAATGGAA TTACAAACAA ACAAACAAAC AAAAAGAAAG GTTAAAAAAT GCCAGGTCTC TAATTATCCA TCACGTCAAA GGTTGCTATT AAATATCACA TCGAGTTTCG TTCGTTATGT TATG
|
Protein sequence | MRPKVLEIAW HETQAVYSCD FQPLPLPQLK RLLAASTTSE SEEDRDRIEK GSSSAATAAG GRQYRLATAG GDSKVRIWMV YPNIPSIPPS TYAALTGQEY TPHPPRVEYL ATLSKHTAPV NVVRFSPSGQ ILASAGDDGN VILWVPSDRP SVTFGETSDD LPDKEHWRLQ KMLQVTTKHV YDLSWSPDGE YLIAGSTDNT ATIWKAATGE CVFALREHLH NVQGVAWDPL NEYIATQSSD RAVHVNTFTT RNGIPDVHPV SRSTRMEIRH SRTPSISSAS RPSMVRRGST TSEAGSVITT ASDFPEAALP PHAPVLAGVS ASATPATPSA SVPSTPQVAP APMNPPATSN RPCSRRSSFS GSQAAASPAL SAAAFSHLAR SARSPSPIPP LPAIRAPPAS TINQRLYGEE GATRFFRRLT FSPDGSLLLT PAGQIEDQVY KGSPLLTAKN ISQDTSDPLS SSVPRPKNVE TGKPTAYIYS RANLSRPPIA HLPGHKTSSV AIRFSPVFYD LRQNGQLSAE PKHVTFDKND TQPVHVSLNM PPPPAPSGSR EKEKEKEGDK VLGSVFALPY RLLYAVACQD SVLLYDTQQA GPIAIFKGLH YAGFTDVAWS PDGQCLFLSS ADGYCSIVIF DLGELGTVHP TQQHHRQLQA IAQSHNNGIS TPLPPSLTHR DSIHSSHSQS GASATGHSPA VSHVARQSPA PGVARSDREG STASSVVGAS GSVSASLLSV SNVGGGAKAP PSSASSVTVT DQVLPTPTPS DTEGPSAAGV DLGIAVSQEE DAKKREGAGE TTQAEAPKKK RRVALTHLGS EQ
|
| |