Gene CNF02050 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF02050 
Symbol 
ID3258057 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp592923 
End bp595955 
Gene Length3033 bp 
Protein Length861 aa 
Translation table 
GC content49% 
IMG OID638257331 
Productsulfur metabolite repression control protein, putative 
Protein accessionXP_571327 
Protein GI58268342 
COG category[R] General function prediction only 
COG ID[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATTTGCA TCACGGGACC GGAGGTGACC GATGCGGTCT ACCGAGCAGC CAAGCAAGAA 
GCTGGGGGGC ACTCCTCACT TGCCACCTGC TTTTACTTAA TTTCTTTCTT CTTTGCTCCT
TCCTTCCTTC TCAGCCAGGC TAGGTTTCTT CTCATCTTCT TGACTGTTAC CCTATTCACC
CACATATGCT CCTTGGAGAC ACGACAAGCT CACGGCGCCG ATAAAAAGAA CTCAAGACTG
ACTCACAAAC GTCCGCCACC AGACCTTATC CTTCCTAACA TGTCCTCTTC TTCTGCGCCA
CCCAACATGA CAGCTTTTCC TCGCGAGAAT CACTACGAGC TGGAAGATCA AGCTCTTGAT
GTAATACCCA CTCGGGCCGG CAGAAAGCTT TGCGTCAGGC ATAAGCAAAT GGCGAATCAA
AACGTCAACG AAAAGCTGCA GCGTGTGAGT GCCCACCTCA ATTGCTGCTG TACCTTTACT
TACCGTGCAT TAGTCACTTG ACAATTTGAA TCCTTCAGAA CGTGCGGCCA TCACTCAGAT
GTGGTCTACA TTTTCCACTG CGCCTCATGG AAAGAGAAAA ATTATTTTGG AAGGCATTTT
GACCATGTGC TGCTTGTAAG CATTGAGAAC TGAATGTTTC TCCTTGATCC CGATCCTGAT
ACCACTTTTA GCTCCCAGTT ATCCCATCTC TCTGATTCTC TCAACCAAAT CATCCGAATT
GACCCCTTCT CTCTCCTTCC GCGCGAGACG TCACTTCGCA TACTTGGATA TTTGGACGCC
TTCTCTTTAG GTCGTGCTGC GCAAGTGTCC AGGTCTTGGA AGGCGCTTGC CGACGATGAT
CTCCTTTGGC GCAGAATGTG CGGCCAACAT ATTGATCGAA AATGCGATAA ATGTGGTTGG
GGTCTTCCTC TACTTGAACG AAAAAGGCTT AGAGTCGAGT TGAAGGATAG AAGTCCTGCC
GGCCTTGTTG AGCACGATCA CAAGCACGAA AATGAAAATG GGGAAAGTCG ACTGGTTACA
AGGGATCAGG TGCTTTCAGG CAACGCGAAC ACAGTGAGCT CGATTGGTGG TTTGAAGTCT
TGTGACACCT CGGCAATGTA CCTCTTCCCT CCGAACGTAA ACGCTACCGC CCCTAAAGGT
ATCAAAAGGA CTGCTCCCGA GTCATCTGTA GGGGCAGCAA AGAAGGTCAA GATGAATGAC
AGTGATTCTG ACGTGGAGAT CATCAAGCCC GGTGGCAGTA GCTTGACTAG AGAGGTCAGG
TTGACTAGGC CATGGAAAAC TGTTTATTGC GAGAGATTGA TGGTGGAGAG GAACTGGAGA
AAGGGCAGAT GTAGCACCAA AATTCTGAAG GTAAGTCGGC ATGGCCAATT CGACGCTCCA
ATGTCCACAG TATGTATGCT CATAATAAAA TGCAGGGTCA TACCGATGGT GTCATGTGTC
TTCAGTATCA CACTGCTCTT ACAAACCCGT CCTATTCTGT TCTCATCACA GGTTCTTACG
ACAGGACCGT TCGCGTGTGG AATCTTGATA CGGGTGAAGA AGTTCGCGTC CTTCGAGGTC
ATACCCGTGC TGTCCGAGCG CTCCAATTTG ATCAGATGCT TCTCTTCACG GGTGCTATGG
ATGGTACGGT CCGTATGTGG AATTGGAGGG CCGGTGAGTG TTTGAGAGTT ATGGATGGGC
ATACGGATGG TGTCATCTCT CTCAACTACA ACGGGTATCT TCTTGCGACT GGATCCGCCG
ATTCAACGAT AAACGTCTGG AATTTCCGTA CCGGCAATCG CTTCACTCTG CGTGGTCATG
AAGAATGGGT CAACAATGTC GTACTCTGGG ACGGGAAGAC TTCGCCGTCC GACACTGATC
CTGCTGCCAT CCCGAGCTTT ACTCAGGCTG TCAGTAACAG GTGTCAGAAA TCAAAATCCC
CAGCTGCTGC TAGCAATGAG CCAACCCTAC CCAATATTGA CCCGGGTGCG ATGCTCTTCT
CTTCTTCGGA CGATATGACC ATCAAGCTTT GGGATCTTGA GACTGCCGCT TGTATTCGTA
CCTTTGAAGG ACACAAGGCT CAAGTCCAAT CTCTGAGGGT GTTGATGGTG GACATGACGG
AGGAAGAAGT CGCAGCCCGA GACCGACGTC AGCGTCGGCA GGCGACTCCT CCCACCACAG
GCTTTACCGC TGCCTCGCTA GTCTCCCCCC CAGGCTCTCA GGCGGCGTTT GGCGCCGGTG
GTGCCTCCAT CCACGATGCT CCTGCTGGTT TTGACCCGCT CGAGCACCGG GGCCGTTCTC
GTTCTGACAC GGTTCAACCG CGAGTTTACG TACATTCCCC TGACGGTACC CACAAGAAGT
CTGAACGGGA GCAGTCTCGC GGGCATGAGA AGAAGGCCAT TGTTGCATCT GGCAGTCTTG
ACGGCACTGT TAAGATTTGG GATGTTGAGA CTGGTCGAGA GCAGTCAACG TTGTTTGGCC
ATATTGAAGG TGTCTGGGCT GTCGACATTG ATGCTCTAAG ATTAGTCTCG GCTTCTCATG
ATAGGACAAT CAAGGTTTGG GAAAAAGAAA GCGCACAGTG TGTGCAAACT CTGGTCGGCC
ACAGGGGTGC TGTCACCTCG TTACAATTGA GTGATGACAT GATTGTTTCG GGCTCTGGTA
AGTATTTCGC GTATATATAT ATTGATAGGA ATGCTGATTG GCTTGTCAGA CGACGGAGAC
GTCATGATTT GGAACTTTGC CTCTTCGGCC AACAATGTCT CGAATACGGC AAGTGTTAGC
GGACCTTGTG TTGATATCAC TCCATCCCCG ACTCCTGCCA TTGTATAAAC TTGGCCGGCA
CGACAAGAAA AAAAGGATTT ACATGACATA ATTAATGACA AAAGTTAAAA AGTTAAAAAG
TTGTAATACG AGATTTTTGG GGTAGTTTTT GTACGTACTG CATTAGCATG ATATTTTGGT
TGTTCTACAT TAGTTGATTA GCGCGTTTTT TGTATATCTT TATCAATGAT ACCATCCATT
TTGCGTGATT TTTTGGGGTT TTCTGGCCGA GTG
 
Protein sequence
MICITGPEVT DAVYRAAKQE AGGHSSLATC FYLISFFFAP SFLLSQARFL LIFLTVTLFT 
HICSLETRQA HGADKKNSRL THKRPPPDLI LPNMSSSSAP PNMTAFPREN HYELEDQALD
VIPTRAGRKL CVRHKQMANQ NVNEKLQRSL DNLNPSERAA ITQMWSTFST APHGKRKIIL
EGILTMCCFS QLSHLSDSLN QIIRIDPFSL LPRETSLRIL GYLDAFSLGR AAQVSRSWKA
LADDDLLWRR MCGQHIDRKC DKCGWGLPLL ERKRLRVELK DRSPAGLVEH DHKHENENGE
SRLVTRDQVL SGNANTVSSI GGLKSCDTSA MYLFPPNVNA TAPKGIKRTA PESSVGAAKK
VKMNDSDSDV EIIKPGGSSL TREVRLTRPW KTVYCERLMV ERNWRKGRCS TKILKGHTDG
VMCLQYHTAL TNPSYSVLIT GSYDRTVRVW NLDTGEEVRV LRGHTRAVRA LQFDQMLLFT
GAMDGTVRMW NWRAGECLRV MDGHTDGVIS LNYNGYLLAT GSADSTINVW NFRTGNRFTL
RGHEEWVNNV VLWDGKTSPS DTDPAAIPSF TQAVSNRCQK SKSPAAASNE PTLPNIDPGA
MLFSSSDDMT IKLWDLETAA CIRTFEGHKA QVQSLRVLMV DMTEEEVAAR DRRQRRQATP
PTTGFTAASL VSPPGSQAAF GAGGASIHDA PAGFDPLEHR GRSRSDTVQP RVYVHSPDGT
HKKSEREQSR GHEKKAIVAS GSLDGTVKIW DVETGREQST LFGHIEGVWA VDIDALRLVS
ASHDRTIKVW EKESAQCVQT LVGHRGAVTS LQLSDDMIVS GSDDGDVMIW NFASSANNVS
NTASVSGPCV DITPSPTPAI V