Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02200 |
Symbol | |
ID | 3256229 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 625520 |
End bp | 628252 |
Gene Length | 2733 bp |
Protein Length | 829 aa |
Translation table | |
GC content | 52% |
IMG OID | 638255441 |
Product | hypothetical protein |
Protein accession | XP_569495 |
Protein GI | 58264678 |
COG category | [B] Chromatin structure and dynamics [K] Transcription |
COG ID | [COG5076] Transcription factor involved in chromatin remodeling, contains bromodomain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAGCCGTCTC CGAACCTCCA CCCACCACGT CCACCGGCCC TCATGGTAAA CTCTCCCCCA CAAGGCCTCC CGCCAATAAA GCTCAAGCTC TCTCTCACAA ACATACACCA ACCCTCCACC CCTACTCCCA CACACAATAA GAAGGGCAAG CGCGCCGCTC CTCCCATAGA CGCCCACGAT GCCAACTCGT CTTCCCTTTT GCCGCGCCCC AAACTCAAGC CCAATGCCGC CGGTCCATCC GCCTTCCGTA TAAGTCTTCC GGCGGTAGGG TCTTCCACTG CGGACCTTTC AACTGTGGCG ACTCCTGTCC CGTCGACTGC ATCTTCCCCT GCCAACCCCA TAACTCCTTC GGGGTCTGCA TCAAAAGCCC AAACAAGTCC CGGTAAGAAG TCATCTATCA AGAAATCTGC AAAACCCTTA GCGTCCAAGA AAAAGACTGG CCCTAGACGG CCGCCGTCTG CCATCCCTCA AAGATTGTTG TCCTCAACTT CCACTACGCC GTCAAAACCC AGTGCTGCTA TTCCCGAGAC ACCCACACAG GTCAAGGCTG AGCCCGAAGA TATATCGCCT GCTTCTTCGT CCATTCCGCC AGCTGAAGAA GTTACTGGTA TATCAACACC ATCGCAGAAC GGCGGTGAAG AGACCCCTGG AACAGGCAAA CGACCGACCA AGTGGATGCG TGTGAAAAAG CCATTGAAAG AGCTTTTGCA GAAGATTATG GTCGAAATTA GAAAAAAGGA TGACTATGCG CTGTTTGAAG AACCTGGTCA GTCCTCTTAA ACCTTGATTT AAAAGTAGGA TTTACGCTGA TGAGCATATA GTCGACCTGG AGGCTTTCCC TGACTATCTT GATGTTATCG GCGGAGAAGA TAATATGATG GATATGGGCA CCATGCAAGC CAAGGTTGAT CGCAACGAGT ACCGAAATAT TGAACAAATT GAGGTGAGTA CCCTACTATG TACTGCTATC AGCTCCTTAC GGGTGACTAC TAACAAGTAA ATTGAATAGG CTGATCTTCG AACACTTGCA AGTGCTGCGC AAAAGTTCAA TCCGCCCGGG TCAGTCCCTC ACAAATCGGC AGGGATCATC CTCGCCCATG GTCTCAAGCA CATTGAACGT TCTCGCCCTC TTGTTTTGAC CCCTCCTTCC AGTCCTCGAG ACTCTGCCAC ACCCGCCAGA GCGACATCTG TCCTTTCTAC CCGCGAGTTG ACAGCCGCTC TGGAAGAGAG GAAGGCAAGG GACGATGTGC CTCCCCATCT TTACATTCCG GAAGAGATGC TTTCTTTCCC ACCCAACAGT GTTATGGCCC GAGCAGTCGG GTGGAATCTC AACGGAGGTA AACGTATGTA CAACAAGCGT ATCTCTCGTG CGCGTGAAAA GTTTGGCGGG AAATGGCGCA ACTGGACGAC AGACGGTAGT CGCGATATCG CAGAAGCGGA TGATATCCAC TTGCTCTTTG ATCCGTGGCG GGTGAGGACA GGTGAAGAAT GGAGAAAGGT CCCTGACTGG CAGGCGCTGA GGAATGAGAC AAACTGGTGG GAACTGGAAA TGCCTCCACC ACCACCCCAG TCAGCGGCTC AGCAAGCTCC CTTACCGTTT AATCCTGGGG AGCCTCGATT GGACAAGGTG CCCCTCCATG ATTTTGTCCC TTACGATTTC GGCCAGTATC CTTCCATATC ATCCGAAATA TCCTTCTTAC GTCAGCGTAT ACCGAATCTA TCTGAAGAAG AAGATATTCT TTCCGAACAC ATCCGGCCAG TTTATCCTCG ATTAAAAAAG GGTGAATCAG CCCCGCCACC TAATCTTGTA AATATATATG ATGGGCCGCT CAGAAGAACA GCAGGCGATT GGGTGAGGGA GATGGCCACT GGGGGCGTGA TAGGGGAGGC ATACATTGAT AGTCTTAATA GGTTCGTCAA GGGAGCGATG AAGGAGGGTG CCGAGAAGGA TTTGTCTGGC GAGCAGGATA CCAAGCCAAA TTTGGGAAGC GAGCAGCTTC CGCTGGACGA ATACGTCATG ACCAGCTACC ACACGCCCTT CCTCCAAACT TCTACCCGAC AAACGATTCA CGACACCCTT GGCCATCTTT CGCCTTTATC TGCACGTCCC GATTACATCC TTCCTTTGGC CAAGGCCGCA TATGCCCGTG TCGCACTTCG TCTGCTCACG GGCCCATCAA ATCCCATGGA TATCAAGCCG CTCTTGCGGG AGGAAGGTGA TTTCATGTAC CAAGGGGTCG GTGGCAAGAG TGGAGTCAAT GTTGGATTGG AGTGGACCGG TGAGGAGCTG AGAAGGCTGG TAGAGAAAAT GAGCGTTCAA AAACAATACC TGGCCGGGAA GCGGAAGAGG GAAGAAGGAA ATGGGGTTGA AGGGGAGACT AAGAGAATCA AGATGGAGGC CGATTCAGCA GCTGCTATCA CCGATAAACC GTCTCTTATT TCCGAGTCGG TTACCTCTCA TCCCCCTCAA CCACGAGACG CAGCTGTATC AGAAGAACCC GAGAAGATTA ACGATGAAGA ACTGAAGCGT CTCAGGTTGG AACTCGTAGC GCTCAGTAAA TTCTACCCTC TTCCAGCGCT GAAGAAAATG AGCAAGGAAG AGGCTGCAAA GTTGCTGCCT GTGAATGTGA GAGGGTTGAT GTGTAGACCT TAAGGCGGTA GATATAGTCG GGGAGCACGT AGTGTTTCTT GGATCGCCCG CCGAGAACTG CAGTTGTAAA ACAGTCATAT GCA
|
Protein sequence | MVNSPPQGLP PIKLKLSLTN IHQPSTPTPT HNKKGKRAAP PIDAHDANSS SLLPRPKLKP NAAGPSAFRI SLPAVGSSTA DLSTVATPVP STASSPANPI TPSGSASKAQ TSPGKKSSIK KSAKPLASKK KTGPRRPPSA IPQRLLSSTS TTPSKPSAAI PETPTQVKAE PEDISPASSS IPPAEEVTGI STPSQNGGEE TPGTGKRPTK WMRVKKPLKE LLQKIMVEIR KKDDYALFEE PVDLEAFPDY LDVIGGEDNM MDMGTMQAKV DRNEYRNIEQ IEADLRTLAS AAQKFNPPGS VPHKSAGIIL AHGLKHIERS RPLVLTPPSS PRDSATPARA TSVLSTRELT AALEERKARD DVPPHLYIPE EMLSFPPNSV MARAVGWNLN GGKRMYNKRI SRAREKFGGK WRNWTTDGSR DIAEADDIHL LFDPWRVRTG EEWRKVPDWQ ALRNETNWWE LEMPPPPPQS AAQQAPLPFN PGEPRLDKVP LHDFVPYDFG QYPSISSEIS FLRQRIPNLS EEEDILSEHI RPVYPRLKKG ESAPPPNLVN IYDGPLRRTA GDWVREMATG GVIGEAYIDS LNRFVKGAMK EGAEKDLSGE QDTKPNLGSE QLPLDEYVMT SYHTPFLQTS TRQTIHDTLG HLSPLSARPD YILPLAKAAY ARVALRLLTG PSNPMDIKPL LREEGDFMYQ GVGGKSGVNV GLEWTGEELR RLVEKMSVQK QYLAGKRKRE EGNGVEGETK RIKMEADSAA AITDKPSLIS ESVTSHPPQP RDAAVSEEPE KINDEELKRL RLELVALSKF YPLPALKKMS KEEAAKLLPV NVRGLMCRP
|
| |