Gene CNI00010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNI00010 
Symbol 
ID3259436 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006694 
Strand
Start bp754 
End bp2253 
Gene Length1500 bp 
Protein Length391 aa 
Translation table 
GC content53% 
IMG OID638258485 
Productfungal specific transcription factor, putative 
Protein accessionXP_572787 
Protein GI58271262 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCGTCT ATCCAGTGTA ACTTCGTAGA CTGCCCAACT GATGATCATC AGCTGAATGT 
TCCGCAGGTA TTGTTTCTTT CACTGGCCAA CCTTCATTAA TCACGTCAAG GCGCGCCGTC
AGGATAATGA TAGATCCTTC CAAGCATCCG TTTTGGCCGT CTGTGCCGTG GCAGCAGCGC
GTCTGCGCGA TGGAGCATGT AGCGAGCTAC TCAAGGTCTG GTCCCCACCT TCCAGCCAGT
CAGACTCAAG CGGAATCACC GATATAGCGA TAATATCGAA TGTCCACGGT CTGCGAAGCG
CCGCTTTGTC CAGCATATCC TCGCGCGCCT CAGGTAGCGG ACCGGAATTC GACGATCTGA
GAGCAACAGC TTTACTGGCG ATTTTGGGTA TACAGAATGG GGATTTCGAC CAATTGCAGA
TCATGCTCGG GCAGTATCAT GGGATGTCGG CGCAATTGGC GTTTCACGAC GAGTCACAGT
GGCCGCAGGG GCTGGAAAAA TGGGAAGTAC AGGAGAGGAG GTGTTTGGTG CGTATCTATA
TGTCTTGCTT CCGTGGGGAA AATGGTGACG GACGGAGTAG TACTGGTCAA TATATACGCT
CGATGTCTTC GCGTCATCCG TTTGGGGCGG TGTGATCCGT CATCGAAGAT GCCAAAGCTT
CGTCTCATAC CCAGATCACC GACTGGACGA CGCCTTATCT CTTTCACCTA CGTTGGTAGG
CGAGCTGGAT ACCGTGAACT GGATACAGGG GTGGAATACC GTGACCGATT TGTACCGCGC
CATGGAGGAG ATAATGGATG CTGCAGCGTA TCAAAATCAA CGGGCCGCCC CACATAAGCC
TCCAGATCTA CCAAGTCCAT TCGGCCGTCC CCCGCTTTCC GTCAGCGAGT CTTGGCCAGT
CATCAGATAT AGATACGACG CGTTGCCATC GGTGTTCAAG GAGACGAAGC CATTGACGGG
GGACTTTGTC GACAATGTCT GTAGCTTTCA AGCGGCGAAT TTGAGCGTCA CACTACAGGT
CAGTTGATGC CGAAAGTATA CCTTTGCAAT CCGGGCTCAA GTTGCTGACA GAGGAAAGGC
TGTTCGGATG GCAATCGCTT GTTGCCAGCC GATGAGTATG GCAGATCGAT GTCAGATCGC
AGGCGAGCTG CTGGATGTTC TGGCGAATAT ACCCACAGCG TACATGCAGT TGATCAGCGC
GCCATTCGTA CGTCGGTTTG CCTTCATCAG AGCGAGCGCT TCACCTGTTG CTGACCACCT
TCCTTTATAG CTTCATCAGC TATCGGCAGT CGGCAGTCTT TTGGGGAGCG TGGTGCAGGG
TCCTTTGACC ATCAGCACGT ACCTTCATAT CCGCCAGATC CTGTAAGCTG CCCTCCTGTC
TGTCCAACTG CGCTTATTGT CCTAACAAAG CTGACCGAAC AGTTTGTCAT TCGCAAACTT
ACTTGCCTGC CTCGAGACTC CTTTTATCCC CGGAGGAGGG ATCAGCGAGC GGCTTCTTAA
 
Protein sequence
MTVYPVYCFF HWPTFINHVK ARRQDNDRSF QASVLAVCAV AAARLRDGAC SELLKVWSPP 
SSQSDSSGIT DIAIISNVHG LRSAALSSIS SRASGSGPEF DDLRATALLA ILGIQNGDFD
QLQIMLGQYH GMSAQLAFHD ESQWPQGLEK WEVQERRCLY WSIYTLDVFA SSVWGGVIRH
RRCQSFVSYP DHRLDDALSL SPTLVGELDT VNWIQGWNTV TDLYRAMEEI MDAAAYQNQR
AAPHKPPDLP SPFGRPPLSV SESWPVIRYR YDALPSVFKE TKPLTGDFVD NVCSFQAANL
SVTLQYGRSM SDRRRAAGCS GEYTHSVHAV DQRAIPSSAI GSRQSFGERG AGSFDHQHVP
SYPPDPFVIR KLTCLPRDSF YPRRRDQRAA S