Gene CNL06300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL06300 
Symbol 
ID3254783 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp743907 
End bp744955 
Gene Length1049 bp 
Protein Length191 aa 
Translation table 
GC content57% 
IMG OID638254105 
Productgal4 DNA-binding enhancer protein 2, putative 
Protein accessionXP_568154 
Protein GI58261488 
COG category[K] Transcription 
COG ID[COG1308] Transcription factor homologous to NACalpha-BTF3 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value0.918528 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CCCCACCGCA CACACCGCCA TGTCCATCGA GAACCTCCAC ATCGCTGACG AGACCGAAAT 
CCCCGCCGGC GCCACCGTCG AGCTCCACTC CCGCCCCGAG CGCAAGGCCA GGAAGGCTCT
CGAAGGGCTC GGCCTCAAGC GCGTCCAGGG CATCCAGCGA GTCACCCTCC GACGAGCCCG
CAACGTCCTC CTTGTCGTTT CTAGCCCCGA AGTCTACAAG TCCCCCGGAA GCGACTGCTA
CATCGTCTTT GGAGAGGCCA AGGTGGAGGA CCCCAACAGT GCGGCGCAGT TGCAGGCGCA
GGCTCAGTTG GCTGCCAGTA GCCAGGCCGC CCAGCAGGCT CATGCCCACG GAGGGTTCAA
GGAGGGTGTG CCCAAGTCTT TGGAGGAGTT GATGCAGGAT GCGTAAGTCC CGAACGAGCG
AGCCGAACGT CCCGGACGGA GCGGAGGTGA TGGGTGATGG ATTGGCGGAT GGATGGATGG
ATGGATGGAT GGGGCCGAGC GCAGCGAGAT GTGGTCATCA CTAACGCTTC CTCTTAGGCC
CTCCACCGAC TCTTCCGCCC CTGCCCCCTC TGGCGAGGCT ACCGACGCTT CCGCTTCTGG
CGACTTCAAG GTCTCTGACG AAGAGATTCA ACTCATCGTC GCCCAGACTG GTGTGGACGA
AGCCAAGGCT CGAGAGGCGT ACATCTCTGA AAAGGGTGAC TTGATCAATG CTAGTATGTT
AAAGTTCTTC CGTCCTTCCT TCTTCCTTGC TCCCTCCTTC ATCTTTCATT CTCAAGTTCC
CTACTTCCTA CCTGCTGTAG AAAGAGTTAG CGCTGACCTT TTCTCTTCCC TTGATATCCA
GTCATGAAGC TCCAATAAGC CACATCTCAG CAGCAGTAGA GTCAGTGTCA CGGAGCGGAC
GAGAAGTCAG AGTTGGAGAA AAAGGGGGAA TGGGGGGTTG TAAAAACCGA GGGAGGAGGC
TCTGTGTAAT TTTTGAATAC TCAGCAGGGA GCGGATGGCG AGTAGGAGAG AGAAACAAAA
CAGGCATGAA TCCAAAAAAA TGCTTTGCT
 
Protein sequence
MSIENLHIAD ETEIPAGATV ELHSRPERKA RKALEGLGLK RVQGIQRVTL RRARNVLLVV 
SSPEVYKSPG SDCYIVFGEA KVEDPNSAAQ LQAQAQLAAS SQAAQQAHAH GGFKEGVPKS
LEELMQDAPS TDSSAPAPSG EATDASASGD FKVSDEEIQL IVAQTGVDEA KAREAYISEK
GDLINAIMKL Q