Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00060 |
Symbol | |
ID | 3255392 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 17778 |
End bp | 21590 |
Gene Length | 3813 bp |
Protein Length | 919 aa |
Translation table | |
GC content | 50% |
IMG OID | 638254422 |
Product | RNA polymerase II transcription factor, putative |
Protein accession | XP_568515 |
Protein GI | 58262210 |
COG category | [K] Transcription |
COG ID | [COG5068] Regulator of arginine metabolism and related MADS box-containing transcription factors |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0968509 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTCCTGTTCG CGAGTCACAA CACAGAGGAA ATCATAGGCT CACCACATCC ACCCTATTCC AATTGTCATT GGCTGGCCCT ACGCGGGAGC TGAAGTAGTA ACTCAGATCC GAATCAGCTG AAGGTGCAAC CAAACATCCA ACATCTCCTA TCTCCACTCC TTGCCTTGTT GTATTGCTTT AAGCATCTCA TGCTGACCAC GTGATAGTGC CACAAATATA TCAGTAACGA GACAATCACT GTATCCAACA TCCTCCAGTC TGGCGCAGGG GAACTGTAAC ATCTAGTTCG CTTCTGGTTT CACCTCCCCC GGAGTATCCC ATCCAACCGA CTCGACCAGA AAATGGAGGA TTCCTATTCA CCCCGCATTC GTACGCCTCA TATGAATCAT CTTCACCCTT CTCATCCTGC CCACCAGCAT TCACATTCGC CGAGAACTGT GTCGATAACG ATGGAGGACT ACATTGCGGA CGATAATGAT AGAGGAGATG GATTGGGTAT GGGTGAAATG GGTATGGGAA GTGGTTTTGG AGAAGGAGAT TTCGGAGAAG GAATGGGAAT TGATATGGAT TTTGCCAGTA AGTCAGGTTG CGGTTTACTT GCGGAGCGAA GCGACATGGG GAGGAGACCG TTCTTGGAAC TGGTTGAGAG GCGGATGTCG CATACTCCTG TATGGCGACA AGCGAGAAAC AGGTGTCGTG ATGGAAGAAA TTAGCAAAAA GTACCAGCTA ACTCATCTTT CACAGCGTCC GCTCTTACTC CCACTTTATC ACAACACTCG CTCTCTCGCC CTCCGACTAG TCACAGTCTT GCCGATGCTC CTTCAAGCGG TGGTCTCAAT TCATCCGCTT TACCTTCCAG TGGGGGGATG AGTGGCACTA TGCCATCGAT CGGATACGGC GCTCAGCAAC ATCAGCATCG GCATCCTCAG GCGCAGCAAC AACATCAACA TCACGCTCAT CAGTCTCAGC AGCAACAGCA TCCTTCCCAA CCTCATCCCA CCTCCCAATC AAACCAACAA AGCCACCCAC ACGCGCATCA TCAAGCGTAT TCTCAGCCAC ACGCCCACAC CCATACCGAC CATATGCAGT CCATGCACCC TAATAGTGGA ATGGACATGT TCCAGTCAGA GACGGGTTTA GAAGATGAAG ACGATGAAGA AGGTATGTAT CTATCTCGAA TTTTATTGTG TTGGCTGAAC CTTGACTTTC GGATTACAGA TACTCGAGCC AAACGTCCCC GCCCAAACAC TTCCCAACCC TTCCATGCCT CCCGCGCACC TTCTCATCCT CACTCTAATT CCAATTCCCA TTCTCACGAC CCCGATGCGG AAAACGATAA TGATAACGAC AACGATGTTT CCGACAAGGA CGAACCCCAA CGACGTAAAA TCAAGATTGA ATACATACCT GACAAGTCTC GTCGACATAT CACGTTTTCG AAGCGGAAGG CGGGTATCAT GAAGAAGGCG TATGAGCTGT CGATTTTGAC AGGAACGCAA GTGTTGTTAT TGGTGGTATC TGAGACTGGG CTAGTCTACA CATTCACGAC GAACAAGTTG CAGCCTTTGG TGCAGAAGTC GGAGGGCAAG AATCTTATCC AAGTACGTTC TTTCATCTTT TTCCCTCTTT GTCGCAATAC CTTTTATTGA CATGGTCCTT TTCTTCTCCA ATAGGCATGT CTCAACGCTC CTGATGGTTT CGGTCCCGAC GGCGAACCTG TCGGACCCAT GTCCGCTACG AAGTCGAAGA ATGGTGGCTT GGCGATACGA CCTCACAAGT TACCTGCTGG AGCAAGCGCG GCAATGGCGA AGAGTCAGGC CTTGTCAGCC GAGCAGAATG CCGCCCAGCT TCAAGCTCAT GCCCAAACAC AAGCTCAGGC GGCACAAGCC CATGCCCAAG CTCAAGCGCA GGTGCAAATG CAGGTGTCAG CCGCAGCACA GGCTCAAGCA ACTGCTCAAG TTCAAGCTCA CGCGGCGGCT AAAGCAGCCC AAGCCCAAAG ACAACAAGCT CAGATACAAT CTCATGTCCA GAATCAGAAC TCTTCTCCGA ATCCGAACCA GCCTGAGCTC AGTCGTCGAG AGCAACAACT GAAATCACTC TCAGCGCTCG GTATATCCGC GACACCCCCT ACGACTTTGT CTACTCTTCC TCCATCCTCG GCTTCGATAC CTTCGTTACC TTCTTTACCT GGATCGGGCT CAATGGGTTC TGTAACGCAA GTTCCGCCTA TGACTTCGTC TGGGCAAGGA GGAGGTATGG GGAGGACCCC AACGCCAAGC CAACTGCATA CGCCTACAAT GGGACATGGG CCACTCGGTC AAGGTATTTC AACGCCTGTT ACACCTCGCG CCAACGCCAA TATGGCCGCA AACTCCAACA AAAACAACTT GGCCAATGCC AACCACTCGG TATCCGTATC ATCCCCCGCT AATCGACCAA AGAAGCGGCT CCCTTCCCGC CGACGTCAAG CTTCTACCTC TTCAGCAGTC GGCGAGATGG GCATAAAGTT GGAGCAAGGA ATGGATATAC CGCCCGTGCC GAATTTGCCT GCTGAATATG TAGGATCCGG CCAGGCGAAT GGGTCGTCAC CTGGTGCTAG CGGATCATCT ACTGGTAAAG GCCTCAGCGG AAAAGGAAAG GCCCAAGGTT TGGGTATAGG TATTGGTGTA AGCTCCCCTA GGATCGCGAG CTCTTCCGGT GCAGCGAGCC CCAAACTGGG ATCAGGGGTA GGGGTGATGA CCGGCACAAT TGCGCAGAGG AGAGCGGAAG CGTCCGCGGC AAGGCTGGAG AGGATGGGTG GGGAGATCGA TTCAGAGATG GGCGCTCAAC ATAGTCAGCA TGGTCATCAA CATCAACAAC ACCAGTATCA TCACCAGCAA AATTCGCCTT TATCCCCTAT GTCAGTCTCG ACATTCCATT ATCCTCCTGA ATACCGCCAT CCCTCTTCTC ATGCGCATAG CTACGCCCGC ACTCTTCACC ATTATCCTCA TGCTGAGCAC GGTCATCTTC AACCCCAGGC AGTCTCTGCT ACCCAAGCAC AAGCTCAAGC AGACGTGGAG AATGCCTACT ACTACGGCAC TCGTTCACCA AATTCGCAAT CGCTCTCTGC AGCTGGTCTG GGCGACATGC GCCACATAAG TGACATGAAC GATATGGGAG AGATGGGTGA TTTGCGAGAT ATGAGGGAGA TACGAGATAT CCGCAATATG GGAGATATGG GCGATATGGG TGACATGGGT AATCTGGGTA GTATAAGCGA CATGGGTGAT ATGGGTGATA TGGGCGATAT GGGCCACTTG CGTGAGATGG ATGGCATGGA TATGGAGATA GGAATGTTTG GCTCAGGTGA AGAGAGGGCA GGGAGGGAAG GAGGGAGGTT GATGGGTATT GGGATGTAAA TCGTGGTGTC GAAAGAAGAG AACCGGGCGA ACGCCACGAA GAATCGGTTG TTTCCCCTTT CGTGAGTACT TTAGGGTCCG CTCAATCTTA CTCTGTTCTG CAACATCATC TAGCTAACTG GTCTCCTATG ATAGCATCTT TTTGGCAAGG CTATTCTATC CGTTGGACAA TAACAACTTT CTTCCTTTGC GTCTTTTTTC TTTTTTATCT TTGTGCTCTT TCTTTATATT CTCGCTATCA CTTTAATGTG TTTGACTATC TCAACTTTTC GAGTTCATCT AACTGCAGTT ATCAAGACAG TACCATGTAG AGGTTGTTCG AAAGAGTCCT TGGTGCGGGG AAAGTGGGAA GACAAATCAG TGTATTACTG TTCCTTATGA GTCTCATTAT TTTTATATGA TATGTTTGCT TAA
|
Protein sequence | MEDSYSPRIR TPHMNHLHPS HPAHQHSHSP RTVSITMEDY IADDNDRGDG LGMGEMGMGS GFGEGDFGEG MGIDMDFATS ALTPTLSQHS LSRPPTSHSL ADAPSSGGLN SSALPSSGGM SGTMPSIGYG AQQHQHRHPQ AQQQHQHHAH QSQQQQHPSQ PHPTSQSNQQ SHPHAHHQAY SQPHAHTHTD HMQSMHPNSG MDMFQSETGL EDEDDEEDTR AKRPRPNTSQ PFHASRAPSH PHSNSNSHSH DPDAENDNDN DNDVSDKDEP QRRKIKIEYI PDKSRRHITF SKRKAGIMKK AYELSILTGT QVLLLVVSET GLVYTFTTNK LQPLVQKSEG KNLIQACLNA PDGFGPDGEP VGPMSATKSK NGGLAIRPHK LPAGASAAMA KSQALSAEQN AAQLQAHAQT QAQAAQAHAQ AQAQVQMQVS AAAQAQATAQ VQAHAAAKAA QAQRQQAQIQ SHVQNQNSSP NPNQPELSRR EQQLKSLSAL GISATPPTTL STLPPSSASI PSLPSLPGSG SMGSVTQVPP MTSSGQGGGM GRTPTPSQLH TPTMGHGPLG QGISTPVTPR ANANMAANSN KNNLANANHS VSVSSPANRP KKRLPSRRRQ ASTSSAVGEM GIKLEQGMDI PPVPNLPAEY VGSGQANGSS PGASGSSTGK GLSGKGKAQG LGIGIGVSSP RIASSSGAAS PKLGSGVGVM TGTIAQRRAE ASAARLERMG GEIDSEMGAQ HSQHGHQHQQ HQYHHQQNSP LSPMSVSTFH YPPEYRHPSS HAHSYARTLH HYPHAEHGHL QPQAVSATQA QAQADVENAY YYGTRSPNSQ SLSAAGLGDM RHISDMNDMG EMGDLRDMRE IRDIRNMGDM GDMGDMGNLG SISDMGDMGD MGDMGHLREM DGMDMEIGMF GSGEERAGRE GGRLMGIGM
|
| |