Gene CNE03230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNE03230 
Symbol 
ID3257898 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006687 
Strand
Start bp917898 
End bp921237 
Gene Length3340 bp 
Protein Length986 aa 
Translation table 
GC content50% 
IMG OID638256906 
Producthypothetical protein 
Protein accessionXP_570880 
Protein GI58267448 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.922942 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCGCCCTCCG TCCATCTTTC TCCCCCATCC ATGGGCACAA AAGCCCCCGG CCCTGACAAT 
GTTCTCTCCT GCAACCCCCA TAGCAGGCCC TTCAAATCCG TCCCATCTCC GCCCTTCCTT
TTCTTCTCCT GCATTCCGCC GTGGGGGTTC AATCTTTCCT TACGAGTATG GAGCTAGTAA
TCAACCATTT TCTCCTCATT ATTCTGCTCC ATTCACTACC ACAGGGGTAA GGATCAGGTC
AGGTTCCGTT TTCAGTCGAG AGCAGTGGAA AAAAGAGGAG CTAGCTCGCC GACGACAAGA
ATCAAGAGAC AAGCTCAAAT CAAGCTGGGA TTTATTGTTT GAGAAATATC GGGATGTCGA
GGATGATGAC GAGATTGACC TGGCAACAGG AACAATAGTG AAGGATAGGG GCAAGCTGAG
AGCTCTGCAA CAACCCATGT GGTTTGGACA AAAGGAAGGT GATGATGGAG AATCGACGGG
AGGAGGAGGT CATGACTTCG AGTCGGACGA GGATGAGTTG GGAGACTGGG ATGAGAAGGC
AGGTCTGGAC CCTCAACTTC CGGAATGGGA AGAAGTGGAA GGATTCCATC AAGCTTGGAC
GGAGGAGGAT GATGCAGACT TCAGAGAATT CATGCGTGCG GAACAACGGC GAAAATCGAC
CTTTGGATCG GATAACGAGG ACGAAGATTC TCTGTCTGAA CACGACTCGA AGAAACCTGC
AGGCTTTGAA GAATATTTAG ATGTATCTCC AAGATCGCGG GGCACTCAGA TTCTTCCTCT
TCCCACCTTA GATGACCTTT TCGCGTCGGA CAACAAGGCC TCTTCGGAGG ATGAGTTAGA
AGCCATCAGC GATAGTGATG CGGAGGAGAA AGGTGTTCGA GATAATTTAT CAGCTTTATC
AAGCTTGCAT GTGCGTGTGT TCGTAAAGAC ACCATGTTTC CGCAAACTGA CTTCCTATAG
GGCACCCCTA TCGTATCACG GCGACCGAAA CGACGCACCA TCATCGAAGT GGTGATCCCA
CCTCGTCCTC GATCCAAGTC AACCGGTGGC ATGGAGGAAA AACCTTCTGA GCACCATCTC
TTGGATTGCG TTCCCAAGTC TATTTCGACT CCTACTCTTG CAGATCTTTT CACCCCACCT
CCGGCTCGGA TACGTCGTTC ACTAAGTGGT TCTTCAGCGA CAAGTGGCTC TTCAGCATTA
TCCAAGAGCA AAGGAAAAAA GCGCATGCCC GGCGAACGAC CAATGGAGGA TACATCATCT
GGAAATCATT CGATTATTCA ACTCAGGAAA CCGTCAAACT CAACAGAGAC TATCAAAAGG
TATGATGAAA AGCTTTTCAG ATGTGACAGC TGTCGTGCCG CTGGTGGCAC TAGAAAGGAT
CAAGCTCCCT TCTGTCCAGG AAGGACTGAC AGCTGCATTT TTGAGGATTC ATCAGGGTCA
TTCGAGCAGT TTGGTGCGTC TATCTTATGG AGTTCAAAAA ATGACACAAC TGATTATTCT
GTGGGTTAGC ACGAAAATCG GCGACGCATA GTCGACCGTC AGGCAAGTCT TTTGCCGCTG
ATACTCGGTT GGTCGAGGGG GCCGGGCCAA CCAAACAGAG GACATGTAGA CTTTGCCGAG
AAGCAGGTGG GGAGAGAGCG AAGACGGCAG GAGTATGTCT GGGAAGACAT TCATACAGGC
GGTGCAACTG GAGAAAGCGG GCAATCCATT CTTCGACTGC GAACACACAT ACAGTGACAC
CTGTCGATGT CGCGAAGGGG GAGAGCAACC CCTCACAGGT TAATGTGTCA ACTGACACAC
CCCAGCTGAG AACAAAACAT TCGCCTCTTC TCAACATGAA GGTTCCGTCC TTGAGATCAT
CATCTGCGAT TAGAAAACAT AGGCGCCGTG TCATCGAATC CGTCAGTGAT GATGACGACC
CCCGCTTTAC GACAGATACC GCACTACCGC CCCGCGAGCC TTCTAGCAAC TTCATTCGCA
AGCCCCAAGA AACGGCACCG CTCCCGTCAC CCCCTCCGAC GTCGTCTGTC GCGCCTTCAT
CCCCACCCAT TGCCTCCCTT CATAAGCCGT CAAGATCCCC GCTGTTGTCA CTTCCACCTT
CTTCACCTCC TCGTCCCGAT ATCTTTTCCC CTGTTCCCAC TCAAGCTGCA CGTCCTACTC
CTTCGCCATC CGTTTCATTA ACACATCCGA TATCAAGCAA TTATGTCACA GACGCCTACA
AGCAGACCGG GGTTATGTAT CACCCAACTC CGCCCCCGTC TACTGACGGT ATGCGAAGTG
CATCTTTGTC CAGCGACAAT GTTGCTACCT CGCTTCCTCA TAAGAGTGCT CTACGACGTC
CATCAGACAC TCTTGGGCTT CAGTCATCCT CGTCTAGCAT TAAACGTACC CGTTTTTCCC
TCATTCGTTC TCCCATGCGT CACCCATCTT CTGATGAAGA GGGTAGTGAA GACGAACTGG
ATTTACTGTC AAATATTGAT TCATCATCTA TCATAGCCTG TTCATCGTCT CCGAAACACA
GTTCTAGCCC TATTAGGACT GAGTGGAGCG TGAGGGCCGC AGATGTCGGA ATCAAACTAG
GGCCGGAGCA TACAGGCCGA CTGCCGTCAG ATATGGTAAA AACGCTTGTG CCATCAATGG
GCTTGTTTAG GCCGACTTTA GGCTCATCAT CTCAAGCAAG CTCCAAATAT ACGCTTCCCA
CTCCTCCATC GAGCTATAGG CCTTCTCAGC CTCGTCCAGT ATCCGACCCA CAAAATCCAT
CGTCTGGCAG TGGTGGTAAC CCACAGGCAC GGCTCATGCT CCCTCCCCCG CTTCCTGCCA
AACGTTCGAC CCATCCAAAT TCACTGTCTA CTCCTAAAGA ACTCAAGAAT TCAACTAGCA
CATTTTCAAC ATCCTCACCT GCTCCTTCAC ATATTAGATT AACTTCTTTG CCGGCTGTAG
TCGTTCGCGC TCGAGCAAGG TCGCGAAGTC TTTCAATGGC GCCGCCCGGT GCCTTGCGAA
CACCAAAAGC ACAGCGGTCT ATAATGTCAC CTGGCTCTAA GATTCCAAAG ACGGCACCTA
CCAGGAAGGG TAAAGTTTTG ATGGATTTAC AAAGGGTAGC AAAGGAAATT GGTGACGAGG
CTGGGCTCGA GTGGGGTCTT GATGAAGAAA CTGACGATGG CGGGAGAATG TGGAGGGAGG
GCAGTGTTGC TGCGTATAAA TGACCGGCCA GTGACTTTGG ACGCATAGGC AGAAGATGGA
GCGATGAGTG GAAAAAAAGC TCTTCTGAAG AGCTAGCTGG ATTTGAGTAT AGTGTAATAA
TAATAATAAT GTTGTCATAC ATGGTTGTAT AAAAGTACTA
 
Protein sequence
MFSPATPIAG PSNPSHLRPS FSSPAFRRGG SIFPYEYGAS NQPFSPHYSA PFTTTGVRIR 
SGSVFSREQW KKEELARRRQ ESRDKLKSSW DLLFEKYRDV EDDDEIDLAT GTIVKDRGKL
RALQQPMWFG QKEGDDGEST GGGGHDFESD EDELGDWDEK AGLDPQLPEW EEVEGFHQAW
TEEDDADFRE FMRAEQRRKS TFGSDNEDED SLSEHDSKKP AGFEEYLDVS PRSRGTQILP
LPTLDDLFAS DNKASSEDEL EAISDSDAEE KGVRDNLSAL SSLHGTPIVS RRPKRRTIIE
VVIPPRPRSK STGGMEEKPS EHHLLDCVPK SISTPTLADL FTPPPARIRR SLSGSSATSG
SSALSKSKGK KRMPGERPME DTSSGNHSII QLRKPSNSTE TIKRYDEKLF RCDSCRAAGG
TRKDQAPFCP GRTDSCIFED SSGSFEQFAR KSATHSRPSG KSFAADTRLV EGAGPTKQRT
CRLCREAGGE RAKTAGVCLG RHSYRRCNWR KRAIHSSTAN THTVTPVDVA KGESNPSQVN
VSTDTPQLRT KHSPLLNMKV PSLRSSSAIR KHRRRVIESV SDDDDPRFTT DTALPPREPS
SNFIRKPQET APLPSPPPTS SVAPSSPPIA SLHKPSRSPL LSLPPSSPPR PDIFSPVPTQ
AARPTPSPSV SLTHPISSNY VTDAYKQTGV MYHPTPPPST DDTLGLQSSS SSIKRTRFSL
IRSPMRHPSS DEEGSEDELD LLSNIDSSSI IACSSSPKHS SSPIRTEWSV RAADVGIKLG
PEHTGRLPSD MVKTLVPSMG LFRPTLGSSS QASSKYTLPT PPSSYRPSQP RPVSDPQNPS
SGSGGNPQAR LMLPPPLPAK RSTHPNSLST PKELKNSTST FSTSSPAPSH IRLTSLPAVV
VRARARSRSL SMAPPGALRT PKAQRSIMSP GSKIPKTAPT RKGKVLMDLQ RVAKEIGDEA
GLEWGLDEET DDGGRMWREG SVAAYK