Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF02050 |
Symbol | |
ID | 3258057 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 592923 |
End bp | 595955 |
Gene Length | 3033 bp |
Protein Length | 861 aa |
Translation table | |
GC content | 49% |
IMG OID | 638257331 |
Product | sulfur metabolite repression control protein, putative |
Protein accession | XP_571327 |
Protein GI | 58268342 |
COG category | [R] General function prediction only |
COG ID | [COG2319] FOG: WD40 repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATTTGCA TCACGGGACC GGAGGTGACC GATGCGGTCT ACCGAGCAGC CAAGCAAGAA GCTGGGGGGC ACTCCTCACT TGCCACCTGC TTTTACTTAA TTTCTTTCTT CTTTGCTCCT TCCTTCCTTC TCAGCCAGGC TAGGTTTCTT CTCATCTTCT TGACTGTTAC CCTATTCACC CACATATGCT CCTTGGAGAC ACGACAAGCT CACGGCGCCG ATAAAAAGAA CTCAAGACTG ACTCACAAAC GTCCGCCACC AGACCTTATC CTTCCTAACA TGTCCTCTTC TTCTGCGCCA CCCAACATGA CAGCTTTTCC TCGCGAGAAT CACTACGAGC TGGAAGATCA AGCTCTTGAT GTAATACCCA CTCGGGCCGG CAGAAAGCTT TGCGTCAGGC ATAAGCAAAT GGCGAATCAA AACGTCAACG AAAAGCTGCA GCGTGTGAGT GCCCACCTCA ATTGCTGCTG TACCTTTACT TACCGTGCAT TAGTCACTTG ACAATTTGAA TCCTTCAGAA CGTGCGGCCA TCACTCAGAT GTGGTCTACA TTTTCCACTG CGCCTCATGG AAAGAGAAAA ATTATTTTGG AAGGCATTTT GACCATGTGC TGCTTGTAAG CATTGAGAAC TGAATGTTTC TCCTTGATCC CGATCCTGAT ACCACTTTTA GCTCCCAGTT ATCCCATCTC TCTGATTCTC TCAACCAAAT CATCCGAATT GACCCCTTCT CTCTCCTTCC GCGCGAGACG TCACTTCGCA TACTTGGATA TTTGGACGCC TTCTCTTTAG GTCGTGCTGC GCAAGTGTCC AGGTCTTGGA AGGCGCTTGC CGACGATGAT CTCCTTTGGC GCAGAATGTG CGGCCAACAT ATTGATCGAA AATGCGATAA ATGTGGTTGG GGTCTTCCTC TACTTGAACG AAAAAGGCTT AGAGTCGAGT TGAAGGATAG AAGTCCTGCC GGCCTTGTTG AGCACGATCA CAAGCACGAA AATGAAAATG GGGAAAGTCG ACTGGTTACA AGGGATCAGG TGCTTTCAGG CAACGCGAAC ACAGTGAGCT CGATTGGTGG TTTGAAGTCT TGTGACACCT CGGCAATGTA CCTCTTCCCT CCGAACGTAA ACGCTACCGC CCCTAAAGGT ATCAAAAGGA CTGCTCCCGA GTCATCTGTA GGGGCAGCAA AGAAGGTCAA GATGAATGAC AGTGATTCTG ACGTGGAGAT CATCAAGCCC GGTGGCAGTA GCTTGACTAG AGAGGTCAGG TTGACTAGGC CATGGAAAAC TGTTTATTGC GAGAGATTGA TGGTGGAGAG GAACTGGAGA AAGGGCAGAT GTAGCACCAA AATTCTGAAG GTAAGTCGGC ATGGCCAATT CGACGCTCCA ATGTCCACAG TATGTATGCT CATAATAAAA TGCAGGGTCA TACCGATGGT GTCATGTGTC TTCAGTATCA CACTGCTCTT ACAAACCCGT CCTATTCTGT TCTCATCACA GGTTCTTACG ACAGGACCGT TCGCGTGTGG AATCTTGATA CGGGTGAAGA AGTTCGCGTC CTTCGAGGTC ATACCCGTGC TGTCCGAGCG CTCCAATTTG ATCAGATGCT TCTCTTCACG GGTGCTATGG ATGGTACGGT CCGTATGTGG AATTGGAGGG CCGGTGAGTG TTTGAGAGTT ATGGATGGGC ATACGGATGG TGTCATCTCT CTCAACTACA ACGGGTATCT TCTTGCGACT GGATCCGCCG ATTCAACGAT AAACGTCTGG AATTTCCGTA CCGGCAATCG CTTCACTCTG CGTGGTCATG AAGAATGGGT CAACAATGTC GTACTCTGGG ACGGGAAGAC TTCGCCGTCC GACACTGATC CTGCTGCCAT CCCGAGCTTT ACTCAGGCTG TCAGTAACAG GTGTCAGAAA TCAAAATCCC CAGCTGCTGC TAGCAATGAG CCAACCCTAC CCAATATTGA CCCGGGTGCG ATGCTCTTCT CTTCTTCGGA CGATATGACC ATCAAGCTTT GGGATCTTGA GACTGCCGCT TGTATTCGTA CCTTTGAAGG ACACAAGGCT CAAGTCCAAT CTCTGAGGGT GTTGATGGTG GACATGACGG AGGAAGAAGT CGCAGCCCGA GACCGACGTC AGCGTCGGCA GGCGACTCCT CCCACCACAG GCTTTACCGC TGCCTCGCTA GTCTCCCCCC CAGGCTCTCA GGCGGCGTTT GGCGCCGGTG GTGCCTCCAT CCACGATGCT CCTGCTGGTT TTGACCCGCT CGAGCACCGG GGCCGTTCTC GTTCTGACAC GGTTCAACCG CGAGTTTACG TACATTCCCC TGACGGTACC CACAAGAAGT CTGAACGGGA GCAGTCTCGC GGGCATGAGA AGAAGGCCAT TGTTGCATCT GGCAGTCTTG ACGGCACTGT TAAGATTTGG GATGTTGAGA CTGGTCGAGA GCAGTCAACG TTGTTTGGCC ATATTGAAGG TGTCTGGGCT GTCGACATTG ATGCTCTAAG ATTAGTCTCG GCTTCTCATG ATAGGACAAT CAAGGTTTGG GAAAAAGAAA GCGCACAGTG TGTGCAAACT CTGGTCGGCC ACAGGGGTGC TGTCACCTCG TTACAATTGA GTGATGACAT GATTGTTTCG GGCTCTGGTA AGTATTTCGC GTATATATAT ATTGATAGGA ATGCTGATTG GCTTGTCAGA CGACGGAGAC GTCATGATTT GGAACTTTGC CTCTTCGGCC AACAATGTCT CGAATACGGC AAGTGTTAGC GGACCTTGTG TTGATATCAC TCCATCCCCG ACTCCTGCCA TTGTATAAAC TTGGCCGGCA CGACAAGAAA AAAAGGATTT ACATGACATA ATTAATGACA AAAGTTAAAA AGTTAAAAAG TTGTAATACG AGATTTTTGG GGTAGTTTTT GTACGTACTG CATTAGCATG ATATTTTGGT TGTTCTACAT TAGTTGATTA GCGCGTTTTT TGTATATCTT TATCAATGAT ACCATCCATT TTGCGTGATT TTTTGGGGTT TTCTGGCCGA GTG
|
Protein sequence | MICITGPEVT DAVYRAAKQE AGGHSSLATC FYLISFFFAP SFLLSQARFL LIFLTVTLFT HICSLETRQA HGADKKNSRL THKRPPPDLI LPNMSSSSAP PNMTAFPREN HYELEDQALD VIPTRAGRKL CVRHKQMANQ NVNEKLQRSL DNLNPSERAA ITQMWSTFST APHGKRKIIL EGILTMCCFS QLSHLSDSLN QIIRIDPFSL LPRETSLRIL GYLDAFSLGR AAQVSRSWKA LADDDLLWRR MCGQHIDRKC DKCGWGLPLL ERKRLRVELK DRSPAGLVEH DHKHENENGE SRLVTRDQVL SGNANTVSSI GGLKSCDTSA MYLFPPNVNA TAPKGIKRTA PESSVGAAKK VKMNDSDSDV EIIKPGGSSL TREVRLTRPW KTVYCERLMV ERNWRKGRCS TKILKGHTDG VMCLQYHTAL TNPSYSVLIT GSYDRTVRVW NLDTGEEVRV LRGHTRAVRA LQFDQMLLFT GAMDGTVRMW NWRAGECLRV MDGHTDGVIS LNYNGYLLAT GSADSTINVW NFRTGNRFTL RGHEEWVNNV VLWDGKTSPS DTDPAAIPSF TQAVSNRCQK SKSPAAASNE PTLPNIDPGA MLFSSSDDMT IKLWDLETAA CIRTFEGHKA QVQSLRVLMV DMTEEEVAAR DRRQRRQATP PTTGFTAASL VSPPGSQAAF GAGGASIHDA PAGFDPLEHR GRSRSDTVQP RVYVHSPDGT HKKSEREQSR GHEKKAIVAS GSLDGTVKIW DVETGREQST LFGHIEGVWA VDIDALRLVS ASHDRTIKVW EKESAQCVQT LVGHRGAVTS LQLSDDMIVS GSDDGDVMIW NFASSANNVS NTASVSGPCV DITPSPTPAI V
|
| |