Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00600 |
Symbol | |
ID | 3259290 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 1012388 |
End bp | 1015341 |
Gene Length | 2954 bp |
Protein Length | 716 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258424 |
Product | specific RNA polymerase II transcription factor, putative |
Protein accession | XP_572252 |
Protein GI | 58270192 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATTACA GTGCAGAGGC AGGGCATCAG AGAAAGCCCA CGTTCAAAAT CTCCCGTCCC CTTTTTCTCC CTCAGACCCT CCTACACGCC TACAGGGTGA GCTTCTGCGT AACTGACCGC CAGTTCCGAA CAGAGTGGCC CCAAATCTTC CCAATATCTC TAGCGGACGC CCAATCCGGC ACGAAGCACC GGCAACGGCG AAGAAACGCT TTTTTTTCTT TGCATCATTG ACCAAACTCC GTCCGAATAC AATCCTAGTT CTATCTCTGT CACTTTATTC TGATTCAAAA AAACTCCTTT CAAAAGCTGG TTTACGACAT GTACCAGCAA CAGACCATGG CCAGCCACCC ATTCTTCAGC CACAACAATC CTTGGGCCCG TTCTGCCATT TGCTGTGATC AGCACCACGG CGAATCGACT TCGGCCGCTA TGGCACGTCT TGCTGGCAAC ATTCCCTTTT CTCATGATGA AAATTTACAC GACCACCATC TCGGATACCA CGACTCGTCT GGATGCACTT CGGATTGCCC TCTCGACACC TACTGCTGCA GTGGAGATTA CTGCTGCGAC AAGCATGGTT CTTGCTCCAG TGGAGATGAG TGTTGCGACG ACCCACGTTG TGAAGAAGCG CACAGACCGG ATAGTCGTGC CAGTCATCAA AGCCATCACA ACCGTTCTAA ACCCGAGCAA AGCCGTAACC ATAACCACCA GCAGCCTATG AGTCTGGAAG AATGGGCTGG AACCCAGGAA GGGTGTAACG CCATACAACA GCTGGTAAGT TGGCCTACAA TTACATGTTT CTACCTTTGT CAGCTTCCCT GTCATTAGCT AATACTTCCA AAGATTGAAT GCTGTAACCA GCCAGACTGC CATATACCGG TCTGCCCTAC GGACAATTCT GAAGTCCATC CACTGCCGGC AGACCCCTTG TCAGCCCTAT TTGCATCACT AGATGCACAG CAGCAGCCTC AACCTATCTC CACTGCCCAG CAGCCTATGG CGCCGGTAAG CTCCGTTGAG GCTTCTCACA CTTGCCACTG GGGTAATTGC CACCTCGTTT TTGGCTCAAT GCCCGACCTT TTGGCACATG TGGCGGCAGA TCACCTTAAC GCAGCGGGTA CGGCGCATCA GTCCGATCAA CTTCTGCAGC AAGCCCAGTC TGCCCAGTCT GCCCAGTCTA CCCCGTTAGC ACTGCTTACT GAGCGCGCGC TGTCTAGTAT TAGTACGAAT ACGACTGGTT TACAGAGTCA TTTGCCGACT AACTCTTCGC TTCAAGCCAC ATCTTTGGCC GTCAATGACG CCCTGCTATC TTGCATGTGG GATGACTGCT TTCCTGTCCC CGAGGTGCCT GCTGCTTCAT CAACGTCTCA CAGCACGTTC CATCATTACA ACTCTGATAA CTGCCAGGCT CCTCATAATC ACCAGCACGA CCATTCTTAC GCTGCTGGAG AGCCCTTTAA CCCTGGGACG ATGCTACGAC ATGTTCTGGA AGAGCATTTG GGTATTCCCC CTGATATCAT TGGCTGGCCG AATGAGGCTG AGCTTCAAGC TCAAGCACAG GCGATCCTTG AGAAGCACCA TCATCATCAC CATATCGACC CTCGCGAGGC CTTAGTGAAC CACTCTGAGA ACTGCAATCA TGTGCATCCT CACCCTCACT CTCATTCTCA TGGGAACAGT GCCGGCACCG GTGCCAATGA CTCACATCCT CATGGCCATG CTCTCGCTCA CTCTCAGTTC CATCCCAATC TTCATCCTCA TCCTCATTCT CTTCCTCATG AGCGCTCGTA TGCTCATTCT CATTCCCTCT CTCATTCACG TCCTCTCTCG CACGAACCTC TTCCCACGCC CCCCTCCACG GTCAAGACCG AAGCCTGCAC CTCCCCTGCC GCTTCCAACG ATTCCGTCGC CAGCACAGTC CTCACTGCAT CCCAATCCTC TAAAGATCTG ATCTGTCTCT GGCCCGGGTG CACCATCCAC ACTCCTTTTG CTTCCACTGC TTCCCTCATG GATCATCTGT CCGAAATGCA CATCCCGAAA GGTAAAGATT GTTATACATG CCATTGGGGT GGGTGCGGTG GTGAAGAAGG GAGGGTGTTT AAGAGTAGGC AAAAGGTGTT GAGGCATTTG CAGAGTCATA TAGGACACAA ACCGTTCGTT TGTGGGGTGT GTAATCAGGC TTTCTCGGAA GCGGCGCCTT TGACGGCGCA TATGAGGAGA CATGCGCAAG AAAGTCAGTT AGATCGCGTA TTCCATTTTT TTCTTTTTTT TGGAATGCTG ATAGTGGTTT GTAAGAACCT TTCAAGTGCG AACATCCAGG ATGTGGCAAA TCGTTTGCAA TCTCTTCGTC TTTAACAATC CACATGGTAA GATTTGACCT GCTATAAGCA TCACAGCAAT GGCTAATTTT TAATCAACAG CGCACACATA ACGGTGAAAA GCCATTTGTC TGCCCGTATT GTCAGAAGTA AGTCCGCTCT TTTTAGGAAC AGTCGATATA GGGATTCTAA CGCAACGCTA CAGGGGGTTT GTAGAAGCGT CCAACTTGAC CAAACACGTA AGTCATTCCT CCCCAAAAAA AACCCCGAAA ATGAAACCCA TTGACAAAAC TTCGAACCAG ATCCGAACGC ATACGGGCGA ACGGCCATTT GCGTGCTCTC ATCCTGGATG CGGCAAGAAA TTCTCGCGTC CTGATCAGCT GAAGAGGCAC ATGACTATTC ATAACAAGCC ACCTGGGGAG AAAAGGCGAG GAAGTGGTGT CCCTGCGAAG TAAAAGACTT TTCGTTTCAC GGCAAGTGTA ATGAAACTGA AAAAAGCGCT CGAGGTCGGT GCATAAGCAA AAGCGTGAAA CGGTCGAGAA TCGAGACTTT TTGTGCAATC TGTATTTGTA CCATGATGTT GGTGTACGGC ACTCCTTGAA AACTACTACG TGAAAATAGT CTGCAGAGTA TTAT
|
Protein sequence | MYQQQTMASH PFFSHNNPWA RSAICCDQHH GESTSAAMAR LAGNIPFSHD ENLHDHHLGY HDSSGCTSDC PLDTYCCSGD YCCDKHGSCS SGDECCDDPR CEEAHRPDSR ASHQSHHNRS KPEQSRNHNH QQPMSLEEWA GTQEGCNAIQ QLIECCNQPD CHIPVCPTDN SEVHPLPADP LSALFASLDA QQQPQPISTA QQPMAPVSSV EASHTCHWGN CHLVFGSMPD LLAHVAADHL NAAGTAHQSD QLLQQAQSAQ SAQSTPLALL TERALSSIST NTTGLQSHLP TNSSLQATSL AVNDALLSCM WDDCFPVPEV PAASSTSHST FHHYNSDNCQ APHNHQHDHS YAAGEPFNPG TMLRHVLEEH LGIPPDIIGW PNEAELQAQA QAILEKHHHH HHIDPREALV NHSENCNHVH PHPHSHSHGN SAGTGANDSH PHGHALAHSQ FHPNLHPHPH SLPHERSYAH SHSLSHSRPL SHEPLPTPPS TVKTEACTSP AASNDSVAST VLTASQSSKD LICLWPGCTI HTPFASTASL MDHLSEMHIP KGKDCYTCHW GGCGGEEGRV FKSRQKVLRH LQSHIGHKPF VCGVCNQAFS EAAPLTAHMR RHAQEKPFKC EHPGCGKSFA ISSSLTIHMR THNGEKPFVC PYCQKGFVEA SNLTKHIRTH TGERPFACSH PGCGKKFSRP DQLKRHMTIH NKPPGEKRRG SGVPAK
|
| |