Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA06070 |
Symbol | |
ID | 3253738 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1626577 |
End bp | 1628617 |
Gene Length | 2041 bp |
Protein Length | 623 aa |
Translation table | |
GC content | 50% |
IMG OID | 638252928 |
Product | specific transcriptional repressor, putative |
Protein accession | XP_566961 |
Protein GI | 58259097 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTCAGC TCCAAGCCCA AGAGAAATCT ATTTCGACGT CCCCAGAGAA GACTGACCAA ATCCTCTTAC ATAATTACGC GGAGACATTA TCGGCTGAAG ACTCAATTGA CCCCTTACTG AAGCACCTCA CTGACCCATT CAACGTCGAA CAGGCTGATT CTCCTCAGAG TAGAACTCAA GATGCTGTTC AAGGAAGTGC AGAAACCGCA AATGACTTTT CGTGGCTACC AGACTTCACA AATACTGCCA GCCAAGCCAA TACCGTTGCA GCTCACCTCT GGCCTCTAAA CACTACACTC CCTTCCCTTC CTTCTACTCA GCCGCCTTCC TCGGAAGCCG AGTCCCCAGC TTCTTCCTTT ACCGGTGGAC CGCATGCGGG AGTTGAATGG CCAAATAATT TCGATAACAG GAACATCGAA TCGATACACA GCACTGGCGC TAGCGATCAT GATGGCATTG CCGGGGTTGT TGGAATAAGC TCAACAAGTT CAGAGAGTGA ACTAGAAGCT GGAGGAGGAA AAGATGACGT GGTGCCGCCT CCTCCGAAGA AGAAAAGTCA TGCAAGGAAG GTATGTCGAC TGTACTTATT CAACGCAACC AGCCTGATAG TATTATTAGC AACCGGAAGG TCACATTAAA AGGGCCAGAA ACGCCTTCAT TTTGTTTAGA AAGCACATCA CCGATTCGAA CTTGATCCCT CCCAGTGTTG AGGTTAAACA CCAAAACATC TCAGTGGTTG TAAGTGATTG AAACTTTATT TGGGTTTTCA TCCTTATATA TGTGCGGGTC CTGTAGGCAG CCAAGATGTG GAAGGAGGCG CCTCTGGAAG TGCGTCAAAA GTTCCAGGAA CAAGCTCGCA TTGAGAAAGA GGAACACCAG CGCAAATATC CTGGGTACCG TTATCAGCCG GTCTTTCGCC GAACAGACAT TATCCGTCGT CGAGTGCGCA AAGATCCAGC CGAAGACGAG AAGGTAGATG CCGTGGCGGA AGCTCTTATC AAAGGCAAAG CAGGAAATGA GCTAGAGAAG GAAATTAAAG AGCAGCTGGT TACGCGAAGT GAGGCAAGCG AGAGTGACGG CGAAAGCTCC AGGGGAAGCA GAAGGTAGTT TAGTTATACA CAAACTACAA GAGTAAGAAC TAATAATCCT TCTGCTTTGC CTAATAGACG CCGCCGGGAA ACAGGACAGC TTTCTAAGGG TGCAATTAGA GCCCAACGGG CCCAAGCTAG AGCCAAACAG ATGAGGCAAA ATCTACTGGG ATCCAATCTC TTGAGCATGT CTCTCTACAA TGCTGCCAAC GCCCGTCTAG CTTCTTCCGC TGCTGCGACT GCTGCAGCTG AGAATGACTA CCGGTATCAT TCTGGTATCG ATGCCATGGG AACGACGGTT GGGCACGGCC ATACTCATCC AGGGATGCAA TACGCCTTGG ATTCTTACAT TCAGCTAGGA TATGGTCTCG ATAACAGGCC TGTACAAGTT GCAGGGTACT CCGGGGAGAT GTATGGGGCC CCTTCTACGG CATCAGGCCA CGGAAGTGAT ATATACCGAC TTCCTCCCAT TAACGGTATG ATGGCTGTCG AAAACGGGTA CGAATGGCAT GCTCCAAGTA TGGAGTATTG GGATCAGACA AACGTAGATC CGGAGGCGGC CCAACGGCAA GCGGGATATC CTTTTGAAGG GGAGTACTAC GCCACTCACT ATGATTTGGA GGGAGGAGTT CCAGAGCAAA CGGAGTATCA CCTTTCCTCT CTTATGGAGA CTTCTCACGT AGGTAACGGG TCGCTTCCCG AGCGAGCGCC GGGAGACATT ATCGCTGGCG CCTATGTTAC TCACGAGAAG CAGGAAGACA AGGAGCTGCC TAGTATGCGA CAGTGGGCTG GCGAAGGACA GCAGACTCCT TCTGGACATG TGATGTTCAA CGAGAGATTA TTTGATGGTG CATTAGGGAG TGCAGGTCTT CCTGACAAGG CTGAAGAACC CGACGCTTTA GCCATGTTCG ATCAAGCTAT GGAGCAAGCA GGCGAGATTG CACCTTGGTA A
|
Protein sequence | MIQLQAQEKS ISTSPEKTDQ ILLHNYAETL SAEDSIDPLL KHLTDPFNVE QADSPQSRTQ DAVQGSAETA NDFSWLPDFT NTASQANTVA AHLWPLNTTL PSLPSTQPPS SEAESPASSF TGGPHAGVEW PNNFDNRNIE SIHSTGASDH DGIAGVVGIS STSSESELEA GGGKDDVVPP PPKKKSHARK QPEGHIKRAR NAFILFRKHI TDSNLIPPSV EVKHQNISVV AAKMWKEAPL EVRQKFQEQA RIEKEEHQRK YPGYRYQPVF RRTDIIRRRV RKDPAEDEKV DAVAEALIKG KAGNELEKEI KEQLVTRSEA SESDGESSRG SRRRRRETGQ LSKGAIRAQR AQARAKQMRQ NLLGSNLLSM SLYNAANARL ASSAAATAAA ENDYRYHSGI DAMGTTVGHG HTHPGMQYAL DSYIQLGYGL DNRPVQVAGY SGEMYGAPST ASGHGSDIYR LPPINGMMAV ENGYEWHAPS MEYWDQTNVD PEAAQRQAGY PFEGEYYATH YDLEGGVPEQ TEYHLSSLME TSHVGNGSLP ERAPGDIIAG AYVTHEKQED KELPSMRQWA GEGQQTPSGH VMFNERLFDG ALGSAGLPDK AEEPDALAMF DQAMEQAGEI APW
|
| |