Gene CNC04740 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC04740 
Symbol 
ID3256383 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1433294 
End bp1436498 
Gene Length3205 bp 
Protein Length881 aa 
Translation table 
GC content53% 
IMG OID638255693 
Productresponse to drug-related protein, putative 
Protein accessionXP_570022 
Protein GI58265732 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.209179 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTTCCATCCC CCGCATCGCC CGCTCTCCCA CCTCTCGTAT TCCCACACCC GCCTGCCACC 
CACTCGCCGC CAGGTTCAGG TGCGTAAAAT TTTACCCATC GCCGTTTCGC AGCTAACCTG
CACTCCAACA ATACTCAGAG AGCGCGAATC CAAGTCAGAT TTTTTTGCTG CTTCCTGCCT
GCCGTCTATC TACGCATTAC TTTCCGGCCC TAACAAATCG TTCGCATACC TTTCTCCTTG
GACCCAAACT CCCATCGGCC GATAAACACG GTCCGAGAAC TGCCGACCTT TCACCTGCCA
TCTTGGCTTG ACATCGGACA TATATAATAA CATGCCCTCG CGCTGGATCC CAAGCGGCTT
TACTACCCCT TCAGCCCCAT CTTCCCGTCC ATCCTCCCGT GCTTCTTCGC CCACCCGCAA
CGGGCCCGCT CCTCGGCCAG CCAACATGCT TTCTTCCCGA AGGCGAGACG GCACCGTAGA
TTCCGGAGAG CTTGGCGACT ATTTCTCTGT CCCGCCAACG CCAGCGTACG ATGCAAGCGC
GCCTACATCT CCTTCCATGT GGGGCGGCCC AGTGACCCCG AGTGTTATGA ACACGCCGGG
AGGAAGCAGT GGCCACACGC TAGCGATTGT GCTCGATTCT GACCAGCTAG TTTTGCGGGG
CCAAGGAGGC GATATGAACC CAGCTTATCT TTCGGGTAGA GTAGAATTGA ATCTTTTGGA
GGCTACGAAT ATCAAGGAGA TTATGATGAG TTTAACGGGT AAAGCAAAGG TGCAATTCTC
GGACGGTTCT GGGTACGTGA TGAAGCGGGA TGCTTCCCCG TTCAAAGCAG CTGACGATAG
GAACGCAGAA CGTCTTCAAA ACATCGGCAT TACAGCCACC CTATTACTTC ACACGACTGG
TCTTTCCTTC AAGGAGACCG TGGCCATTCT CACACACTCA AAGCAGGTCG TCACACGTTT
CACTTCACTC ATACTCTCGA CGGCAACCTC CCTTCGACCC TCCGAACGTA CTCCGGTGAT
GCCCTTATTG TTTACAAACT ACGAGCAGTC GTAGTGCGCT CTGGGTTCGC CAGCAATTTT
TCGGCCACGA GGGAGTTTAA TCTTTCGAGA ATGTTCACTC CGGAAGCTCT CGAATTCAAT
CAGACGTTGG AAATTGAAAA TACTTGGCCA GGCAAGGTCA TGTACTCTTT GACGTTGCCA
TATAAGGCAT ACGCGGCCGG CGATGAAATA CCGGTCAATA TCAAGTTCAT GCCTCTTGCC
AAGGGCGTCA AAGTGACAGC CGTTGTGTCG GTCTTGAAAG AATATACGTT GGTACACACC
AAACACTCTT CGCATCCTGA TACTCGGGTT GACTCGTGTG TCAAACACGA AATCAGGAGA
GGTCGAGCGG AGGAGATAGC CCGAGAGCCC ATTCGTCCGC CTCAGCATTG GATTGACACT
CATGGCCATT CAAGCAGGAG TAGTGCCAAT CAGTCCAGAC ATCCCAGCCC TTCGCAGACT
CCTGTCGCCA ATACCCGTGC TCGCCTGCCA TTGACGACTT GGGGCGCTCG TCCGGCGGAC
AGTTACTTCC CTGGTCAGAC GTCTGATGCT CCGGAACAAG CACAAGCAGG TCCATCAAAC
AGCAGGGATT CCGCATCCAT TACCGAGTCT ACAGAGACGG ATATTGAGGT TGGCGATGAC
GAGGTCAACA CTTACTTCAG CATCCCTATC CCGGCGTGGG TCACTCCTTC ACACGCTATC
CACCCAGTTT TCATCACCCA CAAGATCAAA TGGTCATGCT CAATCGCTAA TCCAGATGGG
CACGTTTCCG AGCTGCGATG TGCTCTCCCA ATCCTCATTC TCGATCATAG CCTTCTCAGC
GAGGCAAGGG CAGCAGGTGC AAGTACCCGA GGTTTGTTGT TCGGCAATGC ACAACCTGAA
GAGCCTCAGG TCGACCTTCC CAGTTACAGC AATCACGTTT ACGATAGAGT CGCCATCGCA
GACTCCAGTA CGACCTCTGG CTGGATTCCA CGATCCATCA ATCCTACTCC GTTGCATTCT
CCTCACGACG ATACTCCTCC AAGAAGTCGC GCGCCTTCTC GCCCTGGTTC TCCGACCCGT
GGGCGTAGTG GCGAATCTAC CCCTGAGGTA CCCCCTCGTA GACAGTTGTC TCAGTTTGTT
GATTCGGAAC TCCTTTTGTC TCTTGGCGCA CTCCGTACAC ACTCTAACGA CACATCACCT
CACTCAACCC CTCCTGATTC TCGTGCGCCC AGTCGACCTC TAAGTCGGCG TAACAGTCGA
TCAAACCTGA GCAGTCGAGT TTCGAGTCGT CCAGGTAGTC GGGCGGGCAG TCGTGCCTCA
AGTCCAGAGA GGGGTAGCCA ACAGTCATCC ACATCTGGGA GTTACGTGGA AGATGCGCAT
ATGCGTCCGG GATACGAGCG GCATCGATCC TCAGGTCTTC ATGGCCTTTT CCATCTACCT
CTCAAACCCA TTCGGCCCTT TTCCAGCCTC GGTGCCAGCC ATGTCAGTCG TCCAATCTTG
CGGAATGGGA ATCAGTCAAA CACCAATCTA CCCGCTGCCG TTGATGACAT CCCGCGCAAC
TCGTCTGTTC CAGGCAGTCT AAACTCCGCT GGCGCCTCCA ACTCCGGTAT GCAATCTTCG
GCTAGCAGCC AGAACCACGT GTCATTTGCG TCTCATGCGG TTACCTTTGA GCCCGAACGT
TCTTCTATAC GATTTGAGAT TGGAGCACCC GACACTCCAT CCGAAACTGA GGATGACATT
GATCCTCTCA TGCGGGTACC TTCGTACGAC ATCGCATCCC GAGGTTTCTT GGGCGGTGGA
GTGGTCCCGA TAGACACGAG ATTGCCGACA TACGACGCTT CTGAGATGTC GATGCGCAGA
ACAAGGAGTG GGACAGATCT TGGGTCTTCA GGAGGTTTGG TTAGGCCTCG AAGTGATACT
GCGCTGGTAC AGTTGGGTGC GCAAGCGGCA GCAGAGGCGG AAGAAAGAGC GAATGACGAT
GTGGATGATA CCGGGGCGCC GACAGGAGCA TAACGGGCAA GGTGGATGGT GCAGCGGTGT
ATGTAGTAGA ACGGAAAAAC AACAGATCAG CCTCTTATAG TATAATATAA TAATTACTTT
TCCTTTACAT TCAATTATAG TGAACGTCCA TACACATATT TTATCACTTC TCTTTGTCAG
TTCTCATTAC GCGATAGATG CAAGC
 
Protein sequence
MPSRWIPSGF TTPSAPSSRP SSRASSPTRN GPAPRPANML SSRRRDGTVD SGELGDYFSV 
PPTPAYDASA PTSPSMWGGP VTPSVMNTPG GSSGHTLAIV LDSDQLVLRG QGGDMNPAYL
SGRVELNLLE ATNIKEIMMS LTGKAKVQFS DGSGTSSKHR HYSHPITSHD WSFLQGDRGH
SHTLKAGRHT FHFTHTLDGN LPSTLRTYSG DALIVYKLRA VVVRSGFASN FSATREFNLS
RMFTPEALEF NQTLEIENTW PGKVMYSLTL PYKAYAAGDE IPVNIKFMPL AKGVKVTAVV
SVLKEYTLVH TKHSSHPDTR VDSCVKHEIR RGRAEEIARE PIRPPQHWID THGHSSRSSA
NQSRHPSPSQ TPVANTRARL PLTTWGARPA DSYFPGQTSD APEQAQAGPS NSRDSASITE
STETDIEVGD DEVNTYFSIP IPAWVTPSHA IHPVFITHKI KWSCSIANPD GHVSELRCAL
PILILDHSLL SEARAAGAST RGLLFGNAQP EEPQVDLPSY SNHVYDRVAI ADSSTTSGWI
PRSINPTPLH SPHDDTPPRS RAPSRPGSPT RGRSGESTPE VPPRRQLSQF VDSELLLSLG
ALRTHSNDTS PHSTPPDSRA PSRPLSRRNS RSNLSSRVSS RPGSRAGSRA SSPERGSQQS
STSGSYVEDA HMRPGYERHR SSGLHGLFHL PLKPIRPFSS LGASHVSRPI LRNGNQSNTN
LPAAVDDIPR NSSVPGSLNS AGASNSGMQS SASSQNHVSF ASHAVTFEPE RSSIRFEIGA
PDTPSETEDD IDPLMRVPSY DIASRGFLGG GVVPIDTRLP TYDASEMSMR RTRSGTDLGS
SGGLVRPRSD TALVQLGAQA AAEAEERAND DVDDTGAPTG A