Gene CNF01930 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01930 
Symbol 
ID3258260 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp568883 
End bp571977 
Gene Length3095 bp 
Protein Length907 aa 
Translation table 
GC content50% 
IMG OID638257318 
Productconserved hypothetical protein 
Protein accessionXP_571685 
Protein GI58269058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.386056 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCATTAT CAACGTCAAC CTTACTCGAG CGTCGCCGAT CATCACTTTC TCCCCCTTCC 
TCTCCCCCAG CCGAACGAAG CAAAACACAG CCCTCAGTCA CTCCCGTCCC TTCTGCAACT
CTTTCAGTTT ATCCCTCCCC CTCATCCATG AGCCTTCCCA CTGTCCCTCT CAAGGACGTT
CCTGCTGATC AGTCTCAAGA AGGAGGATTA CAACCTTCAT CTCAACGCTA TCACCCCCCT
CCTCTTCTCC CAAGGAGAGA GTCATTCCAT TGCTCTAATA ACAGACCCCA TCCACCGCCT
TTTCGTACTG CGCTGAGTTC ATTAGGGCCG ATGACGACCT CCAATCTCAT ACTTGGTACT
CCCACAGAAG AGAAGGCAAG CCCAAAATTC AGTATTAAAA GGGGTAAAGA AGAAAAAGTG
GACAAAGAAG AGTTGACCAC CAAAGGAAGA AAAAGAAAGA GATTAGCCAA GGCGTGTAGT
GCCTGTCATG TAAGTTGTTT TTCTATAGTG GTCAACCGTA TTCAGCTAAA TTAGAATCTA
TCATTAGAAA AACAAGAGAA GGTGTGATGG TTTTGCCCCC TGTTCCAACT GCGAGTTTTC
CAATAGACCA TGTCAATATC TCAATGCGCA AGGGGAGCCC ATTCCGCCAC CACGGACCCG
CGACCCTTCC AACAACACCT CGAGCAAGGG CAAAGACGAC TGCAAAGCTA GCAGTGCCGA
TGGGAGTGAC ATCGCCAGTC AGGAGGATAG GCGAGAAAGT GGGGAGAGCA ACCAGTCGCT
GCGAGATGGA CAAATGGACC CTGAATCCGA AGTGGACAGA AAGCCATCCA TTGGTCCACT
CCAGGTTGTA GACATGGATG TCTCACTGGG CGCTGAGCTT GTGGACAGTG AGTGAGGCTG
AACAAGTCAA GTGTTTATAT ACTGATTCTG AATTATTAGT CTTCTTCAAG CGTTGCTTGC
CTTTACCATT CATGCTACAC GCGCCAACTT TCAATTACCG CCTTTATCTC AACCAGGTCT
CGCCCATCTT GCTCGATTCC ATGTACGCGT TTGCCGCCCG ACTATGCGAA AACCCGTTTT
TTCTACAAAC ATTTCCCCCA AATCACCCCC CCCATTTGCG GGGTGAACTC TTCGCACTTC
GTGCTCATCG TAGCGCAGAA AACTTGATCC AGCAGCGCAA CATATGGAGT GAAGAGACTC
GCCGGGCCGA CCGAGGGTCA TGGCAAGAGA CGGAGTTAGC TCAAGCAGCC TATCTGTTGA
GCGTCTACTT CACTTGTCTC CGTGAACCTA AGCTTGGTCT TTTCTATCTT GATGCTGGGG
TCGATATTCT TCGTCCATCA CCCGCGACTT ATATACAACC GCCAGCCGCG CCGACAGGAG
CGAGTCCTAT AGAGTATACA ACTCACATGG AGTGCCGAAC CCGCACATTC TGGGCCTTTG
TATTGCATGA CCTGTGCGCG GCCTCCAATG GTAGGCCAAG GAAACTTGGA GAGGTAGATT
TGGGAGCAAT TCCTTTACCA GGAACAGAAG CCCATTGGGC CAGATGGGGT GGTGGAGGCA
TCGGGGGGAG AGAGCCCGGC AGAAGGGATG GCTTGATCGC GGGCACGGGG AATTGGCTCG
GCGAAGAGGG AGCTGTAGGG GAGATAGGTA ACGTCATTCG GATTGTAAGT ACATCGTGAT
GGGAAACGCA TAGATACTCA CTTCATCTCT TAGCTTTCCA TATTGGCAGA CATTATGTCT
CTTGCGACTG ATCCTAATGC TGGTGACTCC AAACAGACTC TTGCTGCCAG ACTTGAGGCC
GCTCTCAAGG CCTGGGCGAT GGCCTTACCC TCGCACATGC ACTTCAACGA GCCAAACCTG
ACCATGGCCG TCTCCAAACT GTCATCTCCA GTGGCCGAAA TTAAGACGTC AGGATGGATG
TATGCCTACA TGCATGCTGT CGCCGAGTGT GGGATGTTTT ACCTTCAGGC AGCTGTGGCA
CCGGTCAGCG ACGGCGTGTT CACGGCTAGG AGGCAAAGTC AAGCAATTGA AAATCTAATC
GTCATCATGG ACGCCATCAA TCAAACAGGC CGCGAAGGGT TTAGCTGTAA GTCTCCGTTT
CATATGCAGA GGTTGCGATG ACTTAATTCC GGATTACAGT CTTATTCCCC CTACTCGTCA
TTTCTAACTG GCAGGAGCAC CTTGAGAAAT CGGACCTTCT TGTCAGAGAC GTAAAACATC
ATCTGACTGA GGAGCGTCTC AACCATTGGT GGTCGGAAAT GGCTCGGGAA TGGGGTGTCG
AACGACATGA CGTTCTCAGG CGTGGATTTT ATATTCTGCC CATATCCCCT GTTGTACCAC
AACGAAAATA TCGTTATTCG CAGTCTTCGC ATCCCGACCG CCCTTTCCTG GCAACTTCAA
GTTTGGGACT GTACCAAGCG TCGCCTCCGA ACAGGATTTC TTTGGAATCT GCTGCTACTA
CTTCACCTAC ATCAACAGCG GCAATTCCTG TTACCACTCC CAATTTCAAT CGTACGTCTC
GCTTCAACCT TCCCACCTTG CCACCTCTTC GGCCCCGCGC AACCTCTGGT GCGAGTGTCT
TGTCCAACTA TTATGGCCGC TCGCCTTCTC CTCCGCTCCA TCTTCCCTCT GTTGCATCAG
CATTATCAGA TCGAGGAAGT GACAAGGAGC ACGAGCGATT TTCCCTCCCG TCGATCTCCT
CAGAGCTACG TTATCGTGAG CCTTCAAGCC CCCGTCACCC TCTATCCCAG AGTATTAGTT
TAAGGTATAT CCCGAAGCAG CACCCTTATG TGCGCGAAAA ACCTCGATCA CCCAGAAGGA
GCATAAGTGA GAGGGATATG CGAGACGGGG ACCATATTAC CGGGATTGCG GCATTAGTAA
CAGCTGCCGA GCGAGAAAGA GAAAGAGAGT CAGGAGGGCA GAATATTCGC TCGTAAAGTT
GAAAAGGTTC TTTGTCTTGG GCTCTTCGAA TTAACCATGT GGGAGATATT GTAATTTAGC
CTGTGATTTG TTGAAATTAA AACGCAAAGA TATGTAAAAC CAGCTATAAC ATTTGGAGAG
GAGCCAAAAA TATGATGATG GATGTGTCAA CGACT
 
Protein sequence
MSLSTSTLLE RRRSSLSPPS SPPAERSKTQ PSVTPVPSAT LSVYPSPSSM SLPTVPLKDV 
PADQSQEGGL QPSSQRYHPP PLLPRRESFH CSNNRPHPPP FRTALSSLGP MTTSNLILGT
PTEEKASPKF SIKRGKEEKV DKEELTTKGR KRKRLAKACS ACHKNKRRCD GFAPCSNCEF
SNRPCQYLNA QGEPIPPPRT RDPSNNTSSK GKDDCKASSA DGSDIASQED RRESGESNQS
LRDGQMDPES EVDRKPSIGP LQVVDMDVSL GAELVDIFFK RCLPLPFMLH APTFNYRLYL
NQVSPILLDS MYAFAARLCE NPFFLQTFPP NHPPHLRGEL FALRAHRSAE NLIQQRNIWS
EETRRADRGS WQETELAQAA YLLSVYFTCL REPKLGLFYL DAGVDILRPS PATYIQPPAA
PTGASPIEYT THMECRTRTF WAFVLHDLCA ASNGRPRKLG EVDLGAIPLP GTEAHWARWG
GGGIGGREPG RRDGLIAGTG NWLGEEGAVG EIGNVIRILS ILADIMSLAT DPNAGDSKQT
LAARLEAALK AWAMALPSHM HFNEPNLTMA VSKLSSPVAE IKTSGWMYAY MHAVAECGMF
YLQAAVAPVS DGVFTARRQS QAIENLIVIM DAINQTGREG FSFLFPLLVI SNWQEHLEKS
DLLVRDVKHH LTEERLNHWW SEMAREWGVE RHDVLRRGFY ILPISPVVPQ RKYRYSQSSH
PDRPFLATSS LGLYQASPPN RISLESAATT SPTSTAAIPV TTPNFNRTSR FNLPTLPPLR
PRATSGASVL SNYYGRSPSP PLHLPSVASA LSDRGSDKEH ERFSLPSISS ELRYREPSSP
RHPLSQSISL RYIPKQHPYV REKPRSPRRS ISERDMRDGD HITGIAALVT AAERERERES
GGQNIRS