Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA04270 |
Symbol | |
ID | 3253370 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1147503 |
End bp | 1151144 |
Gene Length | 3642 bp |
Protein Length | 895 aa |
Translation table | |
GC content | 51% |
IMG OID | 638252747 |
Product | HMG1, putative |
Protein accession | XP_566774 |
Protein GI | 58258723 |
COG category | [B] Chromatin structure and dynamics |
COG ID | [COG5648] Chromatin-associated proteins containing the HMG domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCACGTCCCA TACTCCGTAC CTGCGCCTGC TGACGCACTA CGCGACTGTC CCCGCCATAG ATCATAGATC CTGTAGCCTG TAATCTCCCC TGATCATCCA CACGGACCTC GCAATCCGCA CCGCTCTTCC GCCACTCCAT CATTCCCGCA TCTGCATCCA CGGCGACCGC GTTGCCCCTT CTCTCCCTCA CCGCCCCTCT TCCGCCGTAG CCAAGCGATC TAATCGCGCA TTATCTCATT ATCAGTACCT ATCATGTGAG TCCTCTATAT ATCACTTGAC GCGCCTTCTT ACGTCGCGTC GCGTATGGCC ATCCTATGTA TGCAGTAACG CTGATGTTTT CTTGATGTAG TGCTGAGCGC GTTGCTGCTC CTCGATTAAT ATGCAAAGCA CCAACGCGCC TGTTAACGGT GGTGTTTCCC AACAGTGGCA GGTAAGCCTT CCCCTTCATA CCTCGGTCTC TACTGTTGAT CTAATTTTGA ATGTAATCGC CGACGTGCTT TTGTTTCCAT TTGGTCTCTT TATTCACTGA TTATGCCTCT TTTTTGTCAT GGCCTAATTC ATCACCCTCT CGACGCAGTC GGCCATGCAG GGCATGTACT CAGGTGCCCA GCAGAGGCCC ACATCCAATT CGAAGCCCTA TAACGCGCCG GATGCTAATG GTATGCCCGC CAAGAAGGAC GATGCGAATC AATCCATGTA CTCTGTCAAC ACAGAAAATG GCTATCCCGC GTATCAGCAA CAAGGGTATG GCACGCTTGG AGGCTTAGGA CAATATGGAT ACAATAGTGC TGGACTGGGG TGGGTGCCAT GTGACTAAAT CTGGAAGTAA CCTTGTTGAC ATGGACGTAC TGCAGCGGCC TGTCAGCAGC ATATGGTGGA GCAACATCAT CAACAGCTGG TGCGACCAAT GGACGCAGAA CATCTGGCCA GAGGATGATG ACCCCGCCTC CTCACCCCTC TTCAGTTGCA ACAAACACCT CTCCATCTGC TCGATATTAC ACTGGTCAAG AAGGTGAGGC AGGCCGTCAT GCTTCGCCCT ATCAGCCCGC TTCCGCACCC CCTCAGATCC TCTCTCATCA GTTCAACGCC ATGACCAATT CACCTCAAAC AGCCCAAGGT TACCCCCAAT ATCAGTACAA TCAACAGTCG CATCAACAAA ATCCAACCCA ATGGGCGGGA TATCCTACGT ACGGTCAGTC ACAGGCCCAA GCTCGTAATA ATATGTCCGG ACGATTGAGC ACGACACCGG CCATCCCACA AAACAGTGCG CTGGGCGACA GTTCAGCTCG ACAGCAGCCT ACCTCGATGT ACGATTACAC CACCCACGCC CTTCAACAAT GGCCCGCCTC TGATCTGTCA AAGGTGCCGA GCGCTCACCA GAACTCATGG CAGAACGCCC AGTCCTTGTC AAGACTGACA CAGGCTCAAT CACCAGCCCT TGGTCAAACG CAACCATCAT CCGCTTCTCA ACCCACTTCA TCGACATCAT CCACACAAAA TGCGTCACCC GCTGTCCCTC AAAGCGGCTG GCAAGGATAC CCCTCTATCC CTAGCATACC TCCGCATACA CTAGGTGGAG GACATTTGAT GAGTGGGCAA CCGTACGGCT GGGGACACAC CCAACAATGG GGTGGTTATT ATGGCGCTCA ACAACCATCC CAGGCGCCTC AAGGGACCCC GGCTCATCGA CCTCCGTTGA ATAACTTTCC AACTGGCAGC ACCAGCTCTG CGCCTGCAGA GTCTCTCCCT ACTGGGCGAA AGAGGATGAG CAAGGATAAA GGCAAAACGG CCAAGGAAAA GGAGGAAAAC AGAAGTCATT TCGAAGATTA TCATGGGTTG GGTAAGCGTT CTGCAGAAGC CGTGGTTGAG GAGATACCTG GGGAAGATAA GAAAGAGAGC AAGAAGAAGA AGGGTAAGGA AGAAAAGGAG AAGCCGGTTC CCAAAGCCAA GTCTCATCTG CATCCACCAA AACAAGCACC CTCCTCGTGG CAACTCTTTT TCGCCGATGA ACTTGCCAAA GCCAAAGCAG CTGAACCCGA GTCAAGATCC CCAGGCGGTA CTTCACATCC ACAAAAACTT AATGTCGCAC AGATAGCTAA AGAGGCTGGC GTGGCATATG CGAACCTCAG TGAGGAAAGG AAAAAATATT ATGCTGAGCG CGTCAAGGAG CATAGAGAGA TTTACGCGAA GGAGTTGGCG GCTTGGCAAG CAACACTAAC TCCTGAGGAC ATTCGTGCTG AGAACGCGTT CAGAGCCCAG CAAAGGAAAG AAGGAAAGTC GCGCAAGGGA AATATCAAAG ATCCGAACGC GCCAAAGAAA CCCCTCAGTG CCTATTTTTT ATTTTTGAAG GCTATTAGAG AGAACAGCGA TATCAGAGCT CAAGTTTGGG GAACCGAGGC TGAGACTACT AAGCAAAGCG TTATGGCGGC TGAGAAGTGG AGAAGTTTAA CTGATGACGA AAAGAGAGTA AGTCGAGTCT GCATGATCAG TCGGCTTCCT ATTCTAACAG GTGCTACAGC CTTACCTCGA ACAAGCTGAA CATGATAAGC AAACCTACGA GACTGCTCGT AAGCAGTACG AGGAAGATTC TGCTGCTCGC GCTCGAGGTG AAGACGTTCC CATTAGGGCT GTGGAAGCAC CCGCTTCGCC GCCTAAGCCT CCTGCATCTA TCTTGAGATC TATCCACGGA GGACAAGCAC CGGTTACCAA GTCATCACCT TCTGTGACAG ATTCTGCACC TCATCCTGCG CCTCATTCAG ACCTCGAACC AGGCATCACC TCTCTGGGCA AGTCACCCTC ACCCAACCCT CCCTCTGAAC CTAGTCTTGC CCAGTTTCAC ACTTCACCTC GCTCTGTCCC TCATTATGAC CCCTCACTCG AGGTGGATGA TTTCCAAGGA TTCTCTGATC CTCTTAACAT GGATCTGTCG GGTTTGGACG GCATAGATGT TGGGTCTATG GAAGTAGGTG CTGGTGGCGA TAGTGAACAA CCGTGGGATG AGCTGCAAAA GCTCATAGGC ACAGAAGATG TGTATAGTTC AGCAAAGCTT CAGCCTGCAG CGCATATCGT ACCTGACGCT TCAACCGCAG CAAGAGTCGC CCCTGCTTCT GGATTTGAAA GCACTGACAT TTCAGTTGCG CCATCCTATG CTGAAGCTAA GAACGAGACA CAGCAAGAGA CTTCCTTGAA TGGGACCCCT GAGGGGAACG GGAATGAGTC AAAGGTACGC CATATTGCAG AGGCAGCGGA GGGCGAGTTG GCTGTGCTTC CAGCTCTAGG CGAAAATACC GCCCAGAAGA ACCAAGGGGG TTTAGAGGTC ATGGTTGGAG CAGGGGAAGC CCGAAGACAT AATAAAAACT TGTCAACTGA ATCAGGACAG GCGGCTGCCG ACGCTCCAGT GGTTGATGGG GTGTAAAATG GGGCTTTGTG GTTTATAATA TAATATGTAG TGAAGGTTTT GCGTGGACCA GTACGTAGAA CAAACACATT AGCTTATTAA TCTTACTTTG AACATAGTCA TATATTCATA TCAGATTCAA CATCCTTGCT ATACGCGAAT AGGAGCATAC ATTGATATAT TGTAGTTTTA TCTTATTAAT CTTGCTTTCC AACACGTAGT ACATCCATCT GTATTATTGT AA
|
Protein sequence | MQSTNAPVNG GVSQQWQSAM QGMYSGAQQR PTSNSKPYNA PDANGMPAKK DDANQSMYSV NTENGYPAYQ QQGYGTLGGL GQYGYNSAGL GGLSAAYGGA TSSTAGATNG RRTSGQRMMT PPPHPSSVAT NTSPSARYYT GQEGEAGRHA SPYQPASAPP QILSHQFNAM TNSPQTAQGY PQYQYNQQSH QQNPTQWAGY PTYARQQPTS MYDYTTHALQ QWPASDLSKV PSAHQNSWQN AQSLSRLTQA QSPALGQTQP SSASQPTSST SSTQNASPAV PQSGWQGYPS IPSIPPHTLG GGHLMSGQPY GWGHTQQWGG YYGAQQPSQA PQGTPAHRPP LNNFPTGSTS SAPAESLPTG RKRMSKDKGK TAKEKEENRS HFEDYHGLGK RSAEAVVEEI PGEDKKESKK KKGKEEKEKP VPKAKSHLHP PKQAPSSWQL FFADELAKAK AAEPESRSPG GTSHPQKLNV AQIAKEAGVA YANLSEERKK YYAERVKEHR EIYAKELAAW QATLTPEDIR AENAFRAQQR KEGKSRKGNI KDPNAPKKPL SAYFLFLKAI RENSDIRAQV WGTEAETTKQ SVMAAEKWRS LTDDEKRPYL EQAEHDKQTY ETARKQYEED SAARARGEDV PIRAVEAPAS PPKPPASILR SIHGGQAPVT KSSPSVTDSA PHPAPHSDLE PGITSLGKSP SPNPPSEPSL AQFHTSPRSV PHYDPSLEVD DFQGFSDPLN MDLSGLDGID VGSMEVGAGG DSEQPWDELQ KLIGTEDVYS SAKLQPAAHI VPDASTAARV APASGFESTD ISVAPSYAEA KNETQQETSL NGTPEGNGNE SKVRHIAEAA EGELAVLPAL GENTAQKNQG GLEVMVGAGE ARRHNKNLST ESGQAAADAP VVDGV
|
| |