Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF03420 |
Symbol | |
ID | 3258396 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 1007457 |
End bp | 1010241 |
Gene Length | 2785 bp |
Protein Length | 754 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257460 |
Product | enzyme activator, putative |
Protein accession | XP_571612 |
Protein GI | 58268912 |
COG category | [R] General function prediction only |
COG ID | [COG0790] FOG: TPR repeat, SEL1 subfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGA TGAGACGAGC GGCCGGAGGA GTCCCCGATG CTGCGTCCGG GGCTTTCACC AAACTCCACA ACACATCCCG CACATATCAC TTACATACCG CCATGTCGGC AAACCCCCTG CCCCAGCGTC GCTCCGCTAC GTACGACAGC TACCTGCCGA CCAACTCGAG CAGCCAGAAT CAGATGTATG CAGGGGACGA AAGTGAGACT GCTGGGAGGA GCAGGAGTTC CCGGAATGAC GGTGAGTGTT TTGTCCGTTC TGCGTGTGTA GGTGTGTGTG CGAAAGCCGA GTCACAAAGC AACACTCTTC TAAACTCTGC CTTGCGGCGC TCCTACCATC TTTCAAATTC ACCGTGTAAC ATGCTGACCC GCTACTCAGC CGTGCCAGAA CCAGAAATCG ATCCAAACGA TCCTCCTTTG CCTTCTTACC AGGCTCTGAT GGCTTCAGAG AACGCTGGAA TCCCCCTCTC TCCAACAACA ATCTCAGAGT CCGACCACTC TAATAGATCA CAGCTGCCTC CCGTACTACA TACCCGCACT CCTTCTCTGG AACACGCTCC TTATGCTCCT GTAGATTACG GTCTCCCCAA CTCTGCCTAT CCTGCGAGCC ACCACGGCCA TTCCAACGAA CCACCTCCCA GTCAAAGCAA TTGCTTCTCT GGAAGCTCTC TCGGGCGCCG ACCGGTATCC GATCCCAACA GCATCGATTT CAATCAACTG CAAATCACCC CTAATATCCC CGCTGTGCCC GGAGCCCCTG CAAATCCAGT TGCGCGGCAC AAGACAGCGC CCTCTCACCC ACAACAATAT CATCCCCGCA ACCAGTATCC CAGACAAAGA TCAGTTACAG GGGGTTATCC GACAGATATG GATGGTACAG CAAGTGTCTA CAGCTTAGAC TCTGGGATGG GAGCGTATGG ACAGGGAGCT AGACAAGGCC AGCAGTACCC GACCGGGCAG GCATATGGTG GTTATGCTGC ACCTGTTCAG ACCGACTATT CAAATCCCTA TTATTCGGCT GCGAACGATG TGCTCGGTTT ACCGCCTGTG CCACCCCCAC CTTCCGAATC ATCATATGCG ACATCATCCT TTACGCCAGT CCCCACCCGA CAACGGTCGG TTCCTTCCCT TGCACGATCA AACACAGCAG CCACCAACAT GTCCACCGCC TCTTCTCGAT CAATGCCTCC AGTCCCTGTC ATGCCTCGTG TAACCGAGCA GGTTGGCGCT TCTTCCACCC GCCGACGAGG CACGCAAGCC GCAGTAGACC TGAATAAACC GCCTTATACC AAGCAATACG TGGACGACTA TCGAAAACGT ATGAAGGATG ACCCTGATCC TGAAGCTCAA TTTGCGTTTG CCAAATACCT CATCGAAGCT GCCAAGAAAC TCGGCGACGA GATCAGTCAT TCTGATCCAA AACTGGGTCG CAAATATCGT GACTCTCTTT TGCAAGAATC TTTGCGAAAC ATCAAGAAAC TTGCAGAAGG TAAAGAGCCC TATCCCGATG CTCAGTTCTT CCTCGCCAAC TTGTACGGCA CTGGTCAGCT GGGTTTGTCG GTCGATCACG AGAGGGCATA CTACCTCTAC ATGATGGCGA GCAAACACAA TCATCCTGCC GCAACGTATC GATCCGCAGT ATGCAATGAG ATCGGAGCGG GGACCAGAAA GGATCCAGGA AGGGCGGTGC TGTTTTACAG AAAGGCGGCA GCTTTGGGAG ATACGGCAGC AATGTACAAA CTCGGTATGA TCCTGCTCGG TGGTCTTCTC GCACAGCCAC GGAATATTCG TGAAGCTATT GTATGGCTCA GGCGTGCTGC ATCCCAAGCG GACGAAGACA ACCCTCACGC CTTACATGAA CTCGCTCTCT TACATGAACG CCCCAACGGT GTTGGAGGTG TCTTGCCGCA CGACCCCAAC ATGGCGAGGG AATTGTTCAC TCAAGCTGCT CAGCTGAATT ATCCGCCTGC GCAGTTCAAA CTGGGGCAAT GTCATGAGTT TGGACACCTT GGATGTCCAA TTGATCCGAG AAGAAGCATC GCATGGTATA CTCGCGCGGC CGAGAAGGGC GATAGTGAGG CGGAGCTGGC ACTGAGTGGA TGGTATCTTA CGGGTAGTGG TAAGTGATAG TTGTCGCTCC CTCGTAACAG TGGCTAACAG GCTTGTAGAG GGAGTGTTGA AACAGTCGGA TACCGAAGCC TACCTATGGG GACGAAAAGC TGCCAATAAG GGTCTTGCAA AAGCGGAGTA TGCTGTGGGC TGTAAGTCAA TGGTCTCACC TCGCAAGCCA TGCCTAACTT GTTTTGTAGA CTACACTGAG ATTGGTATTG GAGTCAAACA AGATATGGAC CTTGCCAAAC GATGGTACAT GCGCGCAGCG GGTATGTAAT TTTTTTTTAA TTTTTTTTTC CTAATGGCGT TAAAATCTGG ACTTTCCTGA TACGTCCACT GTTTAGCTCA ACAACACAAA CGGGCAATGC AACGTCTCAC TGAACTCAAT AATCAGAAAA ACCCAAAGGG CAAGGGTTCA AGGCCAACAA GGCACGACGC ACAGAGCGAA TGTGTAGTGA TGTGAGGAGG TTTGGAAAAG CAAACCAGGA AGTCAAGAAG AAGCATAGAA AACTGGAAGG ATCAGTTGTA TTATTTGCAT TGGGATATAT GGTGTGAATG CTATATAGGG AAGGGCAAAA ACATTACACG TGTATACATG GGGAATTTTG AAACTGCTGG TCAATGGCTT TATTTATTAA TGTGACCTTT ATATTACCAC ACATCAGAAG TTACC
|
Protein sequence | MKQMRRAAGG VPDAASGAFT KLHNTSRTYH LHTAMSANPL PQRRSATYDS YLPTNSSSQN QMYAGDESET AGRSRSSRND AVPEPEIDPN DPPLPSYQAL MASENAGIPL SPTTISESDH SNRSQLPPVL HTRTPSLEHA PYAPVDYGLP NSAYPASHHG HSNEPPPSQS NCFSGSSLGR RPVSDPNSID FNQLQITPNI PAVPGAPANP VARHKTAPSH PQQYHPRNQY PRQRSVTGGY PTDMDGTASV YSLDSGMGAY GQGARQGQQY PTGQAYGGYA APVQTDYSNP YYSAANDVLG LPPVPPPPSE SSYATSSFTP VPTRQRSVPS LARSNTAATN MSTASSRSMP PVPVMPRVTE QVGASSTRRR GTQAAVDLNK PPYTKQYVDD YRKRMKDDPD PEAQFAFAKY LIEAAKKLGD EISHSDPKLG RKYRDSLLQE SLRNIKKLAE GKEPYPDAQF FLANLYGTGQ LGLSVDHERA YYLYMMASKH NHPAATYRSA VCNEIGAGTR KDPGRAVLFY RKAAALGDTA AMYKLGMILL GGLLAQPRNI REAIVWLRRA ASQADEDNPH ALHELALLHE RPNGVGGVLP HDPNMARELF TQAAQLNYPP AQFKLGQCHE FGHLGCPIDP RRSIAWYTRA AEKGDSEAEL ALSGWYLTGS EGVLKQSDTE AYLWGRKAAN KGLAKAEYAV GYYTEIGIGV KQDMDLAKRW YMRAAAQQHK RAMQRLTELN NQKNPKGKGS RPTRHDAQSE CVVM
|
| |