Gene CNB03170 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB03170 
Symbol 
ID3256031 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp957525 
End bp960875 
Gene Length3351 bp 
Protein Length934 aa 
Translation table 
GC content50% 
IMG OID638254961 
Productexpressed protein 
Protein accessionXP_569018 
Protein GI58263216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.870613 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ACGCCATGCC CTCCGCCTGT ACAACGCCGC AGAGATGCCC TCGTACTTCC CTTTTACTCC 
ACAGCAGCAG CCCGCAGCCC GCGTCCACAG CCCTCTTCTC GACGATCCCC ATTTAGAACT
TACAGTCACC CCTGCCGCAT CAGCATTCTA CGCAGGAGAG ACCTTCTCTG TTACAATCAC
TTTCCGCAAT ACAAGGACCC CGCATACTGA CGCACCCAAA GCGCCTCTTT CCACGAACTC
CACATCTAGC CTTGCCTCCG CCACCTCTCA GATACGGCAT TTCCCGTCTA CCGACCCGCG
ACACTCACCT TCTGTTCCTC GGCTGCCTCG ACGACTCAAT CAAATAGGAC CTGACCTTCC
CGAGACACCC CTGGTGAGCG CTAGACGACA TGAGAGCGCC GAACTGGGTC CTTCACGTAC
ATCATCGTCC GTCGTCTCCA TTCCTCAGTT CGCCACCGAG GACCTGGGCT ATCCGTATAG
CCCGGGAGCG AATCCAGTTA ATCGAGCTCA TGGGTGGCAG AAATCTCCTT CCCCTCAAAG
GGAGGGACCG ATGAACTATC GAAGTCCAGA TGGATGGGGA AACCAAGAAT CTGAATGTAC
AATAAAGAGC AATGGTCATA CTAGGCGGGC GAGAAGTTTG GCCCTTGGAA AGGGCACGAT
GAGCCCGCAA GAGCTGGTAT GGGCTTTGGG CGGAGGGAAA GACAGTAAGT AACCCCCACA
CTATAAGCAA TTACCAATCC CTACTGACTG CAAATTACAC AGCGCCTCCA GCTCTGCCGA
CGCGCCGCCC CCAAGGCGGG ATTCAAATTC CAGCAAAACA CCCTCATTCT CGTAAAATCT
CAATCGCCAA TACCTCCGCT CTTGAACGTT CCGGGGAACC TTCTAGAGAT TCTCCCGAGG
TTTCATCCCC GCCATTGATT TCTGTACCAG AGCGACCCTC ATCAGTAGCT AAAAACGGCA
GCGCTTCTCT CTCCCAATCT CGACATCCGG CTGATGGCGC AGTAACTATC GAGCCATACG
AGACATGTTC GTCGCCAACG GTAAAACCCT CACATTCACG TGCTCCTTCT TATCAGAATG
CCTATGGTGC CTCTTACATG GGTATTAGTA ATGATGACCT TCCTACCCCT CCCTCACATC
CCCATGTTCG TGAACATGCT CTAACTGAAC CGAAAGGGAC AACGACAGTG CTCTGGGCGT
ATACTCGTCT TGTAGGACAT TTTCACCCTT CGATTGCTTA TATACCTCCA GATCCTCTAC
TGCCGCTTCG AACTGCATTG TTACACCAGC CCGTCGGGTC AGGGTCGCTC TCCACCCCAT
CACAGGACTC GTCAGGAAAT GTCGCTGGCA GACCCTCTGG CTCCTCACGT TGGCAGCTTA
GTTTTGGAAC TGGTACAATC GGCAATTCAA CTCAACCAAG TCTGACAGGT AGTTTATTCG
GATTAGCAAA GGATCTAGTA ACAGGTGGAA GTGGGGGAAG TCTAGAGGAA GAACGGAAAA
GAGTCTGGAA CTTGAAGGAT CTACCGGTGC TAGAGACAAC TAGAAGTCTG CTTGCCGTCG
ATATGAAGCT GAAAGAAGGG GAAACTAAAG AGTGTGAGTG TCCCGGATTT TGTGTCTACT
GTCGCCAACG TTTACATAAT CAGTCAACTA CACGCTACAA CTTCCAACAA ATCTTCCCCC
GGCTCATCGA GGTCGAGCCT TTCGATTTTC TTATGATCTT GTGGTATCCC TCAGCGTGGC
CTTGCCCGGA GGTGATCATC GGCAAAAGTC CAAGGACATC GTCGTGCCAA TCCGGGTATG
GGCCAATGTA TCCCGTACGT GAATTTCTCA ATGAGGGTGC GTGGGCTGAC AAATAAAATC
GCAGTGGGGA ATCCGTTGCG CACATATGAT GTCCTGAAAC CGATCATACA AAATAAGGAC
GAGGGGCACG TCGAAGACGT TGGAAACTCT GTCGACTTTC AATCGCCTTA TGTACGAGAG
GGAAGGGCCC AAATTCCTGT GCAACGGAGA CAGACAAATA TACCAGATCA GCTTCGTATT
AAGTCTGTAG ACACTAGCGA ATCTCTTCAA GCGTACGCAT TGCATCTACT CGAAATTTTG
AATGATGGCG AGATAAACAC CCTCCCGCTT TCACCCAGTA CGCCGAAGTC GCTCCATCGA
CTTCGAACTT CGTCTCCGTC ATCCCCTGCA TTTTGCATCC CGATCCTGCC TCCTGTTCTC
GCTAGTCAAC CCACGGAGCT ACTAGACATA CCCTCTTCTG CAAGCCGTCT GCAAGACCGG
AATATCAGAC TGAAAGAGGA TAACTTTATT GAAGGCGACA ATGGGTTATC GGACGAAATG
GGTGATGCTG AAAGTTGCGG AGAAGCGGTT GAAATCCTCA GTCGACATTC TGCTAAAGGT
AAGTGCTTGA GAGCTATTTG GAAAAGAGCA ATATCTTACA ATAAAGTGAC AGCTTCCTAC
GATATTGAAA AGAATGGAGA GTCTGTGGCG ATCCTTACGC TTGTGAAAAC TACATATAGA
TTGGGGGAAT CCGTACTCGG TATAGTAACA TTCAATAAGC CTCAAACGCC TTTTCCCGTT
CTCAAGTTCT CGGCCTATCT CTTTTCCCAT GAACTTATCC CTGAGCCTCT CCTTCCACCT
TCTCTTTCCT CAAGAGGCTC AGCCCAGCCC CCCCTTTACC GATTACACGC AGAGCATCAT
ACCCTCTACG CCCTGAGTGC CCAGAGGCTA GCCTTTTCGC TTGACATACC GTCAGATGCA
ACGCCAGCGT TCAATTTGGC GGCCGGCGAG GGGGAAAAGG GTGGCTTACA ATGGAAACTG
AAGTTGAATT TCACTGTAGG GGTACCTCCT CGTGATTGGA AAAGAAAGAC AGGCGAGAGT
AATCTAAAGT CTGAGAGTGA CGTCGACACA AAAGGGACAA TGAAGAGCTC AGGAGTCCAT
CTGTTGCCCA TTCAGCGTTC TCGAAACATC GACGAGGGAG ACAACATATT TTATTCTGCT
TCGACGGGTA TCACTCCTAT GATTCCTCGA CACCGCCAAA ACGCGTCGTC AACCAGCGGT
AGCAAAAGTG ATGTGAAAAA CGAAGAAGCT GTTAACTGGT ACGAATCAAG GACCGAGTCT
GTAGAATGTG AAGTGCCCGT CAGAGTTTTG GCAGGCAGTA CGGCTTTCCT TGTCAGGCCC
TTAGTCTACG CAGTCTGAGG CTAAAACGAG ATTATGATAA ACAATAATGA TAGGGGGAAT
TCGGAAGAAG GCTAGATTCG TATTAGCTAT TACCTACATG TGACGACGAG TAGTATTCTA
CTGCCATAAT GAAGCATCAA AATCAGATTC TTATTTATGA ACACTCACTA T
 
Protein sequence
MPSYFPFTPQ QQPAARVHSP LLDDPHLELT VTPAASAFYA GETFSVTITF RNTRTPHTDA 
PKAPLSTNST SSLASATSQI RHFPSTDPRH SPSVPRLPRR LNQIGPDLPE TPLVSARRHE
SAELGPSRTS SSVVSIPQFA TEDLGYPYSP GANPVNRAHG WQKSPSPQRE GPMNYRSPDG
WGNQESECTI KSNGHTRRAR SLALGKGTMS PQELVWALGG GKDTPPALPT RRPQGGIQIP
AKHPHSRKIS IANTSALERS GEPSRDSPEV SSPPLISVPE RPSSVAKNGS ASLSQSRHPA
DGANAYGASY MGISNDDLPT PPSHPHVREH ALTEPKGTTT VLWAYTRLVG HFHPSIAYIP
PDPLLPLRTA LLHQPVGSGS LSTPSQDSSG NVAGRPSGSS RWQLSFGTGT IGNSTQPSLT
GSLFGLAKDL VTGGSGGSLE EERKRVWNLK DLPVLETTRS LLAVDMKLKE GETKEFNYTL
QLPTNLPPAH RGRAFRFSYD LVVSLSVALP GGDHRQKSKD IVVPIRVWAN VSLGNPLRTY
DVLKPIIQNK DEGHVEDVGN SVDFQSPYVR EGRAQIPVQR RQTNIPDQLR IKSVDTSESL
QAYALHLLEI LNDGEINTLP LSPSTPKSLH RLRTSSPSSP AFCIPILPPV LAKDNFIEGD
NGLSDEMGDA ESCGEAVEIL SRHSAKASYD IEKNGESVAI LTLVKTTYRL GESVLGIVTF
NKPQTPFPVL KFSAYLFSHE LIPEPLLPPS LSSRGSAQPP LYRLHAEHHT LYALSAQRLA
FSLDIPSDAT PAFNLAAGEG EKGGLQWKLK LNFTVGVPPR DWKRKTGESN LKSESDVDTK
GTMKSSGVHL LPIQRSRNID EGDNIFYSAS TGITPMIPRH RQNASSTSGS KSDVKNEEAV
NWYESRTESV ECEVPVRVLA GSTAFLVRPL VYAV