Gene CNL04010 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNL04010 
Symbol 
ID3254887 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006681 
Strand
Start bp97624 
End bp98894 
Gene Length1271 bp 
Protein Length382 aa 
Translation table 
GC content51% 
IMG OID638253873 
Producthypothetical protein 
Protein accessionXP_567956 
Protein GI58261092 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.558936 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACAGTG ACGCCGAGGT TATTCTTTCT ACGGATGAGG TTCTGAAATT ATTTGGGACT 
ATTCAGACAG CTGTTGATGC AACTACTTCT TCTTCTGTTC CATTATTAAA CAAGTGAGCT
TGTCTCGCTT TTAAGTCTAT GCTGCAAGAC AGCTGATAAG CTTCTTTAGA GTCCAAAACA
AAGAGGAGGA TCTTGATTTT GCCAATGGTC TTTCCCTTCT CAATCTCCGC CCTCACCTCC
TCTTGTCATC CCTTCACCAA CTCGTTATCC TGCTTGCTTT ACGGTTGACA TCCTCAACAG
AGGCTGCTCC CGATCCATCC ACTTCCACAG CCCTTTCCAT CCCTTTTCCG AACCCCCGTT
CACGACCGGA ACTCACAGAT GATAATGTGT TGAATGAGAT CGCGGGAGAG TTGGTAATGA
ACCAAGAAGT TATGGACAAA GTGAGAGGGC TGGAGAACAA GTTGGAGTAC CAGATTAAGA
AGTTGATCGG GCTTGCCGAG GCTGAAGATA AGAGGGGCAA GGATGTTGTC GAAGACGTTG
AAGAAGGTGA GTTTGTCTAA TTGTCTGTGC AAGAGCAAAA AAATTAAAAA AAATTAACTT
GCCCATCTAC AGATCCATTA TCATTCCGAC CCAATCCATC TGCTATTACC TCCCGGACAT
CTCCCAAGGC CGCCCGCGGC GGATCCCCTA CTGGTTCAGA CGACGAGAAA TCCGGAGTCT
ACCGTCCCCC GCGTGTCGCT GCTGTCCCCT ATTCCGAAGC TGCTCCCCAG GGCCGCGAAA
GAGAACGCCG TGCTCCCGCA CTCCTGTCCG AGTTCGCCGC CACCATGGAC TCTGCGCCCT
TACTTGAATC TACCTCTGGT CTCTCCGTTC GTCCGGTCAC GTCTGCTGCT GCCAAGTATT
CCAACTCTGT CAGCGCTAAG CGTGCTGCAG AGTTGAAGAG GATCGACGAA TTCGAAGAAG
AGAACATGAC TCGATTGGTA ACCAGCAAGA GGGAAGCCAA GAGAAGAAGA GATGATGAGG
CTGCTTTGGC AATGGGCTTC GGTGTTGGGC CAAGCAGAGG TAGAAGAGGA AGAAATGGTT
TGGAGGCAGA ATTGGAAGGG GTGCTCGGAG ACAGAGGAGA CAAGGGAGTC TGGGATGGTG
TTTCGGGCAA GTTTGGCCAG AGGGGAGATG CTTTGGAGCG AGGGAAGAAG AGGGTCAGCG
GCACTGGCTC TACGAGCGGC AAGGCCAAAA AGGCTAGGTT TGAGAAGGAG CTTGCCAGGA
AGCGCAAGTA A
 
Protein sequence
MDSDAEVILS TDEVLKLFGT IQTAVDATTS SSVPLLNKVQ NKEEDLDFAN GLSLLNLRPH 
LLLSSLHQLV ILLALRLTSS TEAAPDPSTS TALSIPFPNP RSRPELTDDN VLNEIAGELV
MNQEVMDKVR GLENKLEYQI KKLIGLAEAE DKRGKDVVED VEEDPLSFRP NPSAITSRTS
PKAARGGSPT GSDDEKSGVY RPPRVAAVPY SEAAPQGRER ERRAPALLSE FAATMDSAPL
LESTSGLSVR PVTSAAAKYS NSVSAKRAAE LKRIDEFEEE NMTRLVTSKR EAKRRRDDEA
ALAMGFGVGP SRGRRGRNGL EAELEGVLGD RGDKGVWDGV SGKFGQRGDA LERGKKRVSG
TGSTSGKAKK ARFEKELARK RK