Gene CNF01450 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF01450 
Symbol 
ID3258113 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp424167 
End bp427407 
Gene Length3241 bp 
Protein Length769 aa 
Translation table 
GC content46% 
IMG OID638257270 
Productconserved hypothetical protein 
Protein accessionXP_571677 
Protein GI58269042 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0507307 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACTGTC GGGCAGGAGG ATGGTCAGAA AGGAGCTTGC CACTTGCAAC AGTTTTGTTA 
ACACCTTGGG CGCTTTTCCA AGGAGCCTCG CAAGAGGTGA TAGACAGGGC GAAGGAGGAA
GGCGATGATG AATTATTGAG GCAGGGTGGA ACTGCGTTAG AGGTGAATTC TAATGGTCTT
AAACGCGCAT GACTGGGCTG ATACATATAG GAAGATGGCC ATTATTAGCA GTTCCAAGAG
ATTTATTAGA AGTCCCTCAT GCCAAAAAGT CATCGGCAAG TTTCTTCTGC TTGGTATGGA
CGATTTGATG ACAATCATTG CAGAGGGCAT ATGGGGCGGA CGTATCATCT ATACTGCCCT
GAATGCGCAT GCTCTCATAG CTGACGTACG CCCGTTTAAC CGTCAAAGAA AAAGAACAAA
CAGAATTGCT GATTCCGGTT GATAGAACTA TAAAAAGAAG CCTATTCAGA TGTATAATCC
TCACAAAGCA CCATTTTTGG ATCATTACAG GTAAGGTTTA GGATGTTCAG CTGTACGCAG
TTTTGGCTGA TGTGATTGTA AAATGTAGGT TAAAGGTACC ACGGTATCGA TCCATGCTTG
AATATGTCAA TTTCCTCGTC TTATTTATAC TGTATGTCAT TGCCATTGAA GGCCTTGTCG
AGAGCCGCAT CAATGGCCGA GAGTGGGCGT TCATTATCTA TGCTATGGGT AAGTACGATA
ATAATCAGGT AGACGCGGAA GTTCAATGTT TCCTTAGCAT TCTCCCTAGA TAAACTTGCG
GCTATCCGAG AGCATGGGCT GAAAGGTGAT GCAGTCCATC TGTTTGTTAC GTGAGGCTAA
TGTGCTCTAT AGTATTCAGC AGCAGCCTTG TCAACGGGTT CGATCTAGTT TTCATGATCA
TCTACGCCGT GTATCTCGGA GCGAGAACTT ACGGAGCCCG ATATCATAAC GAATACGCGC
TGGGGCTAGG AGCAGATTGG TTAGCAATAG GTCAGTCCAG ATATTAACGG AAGCAGTCAA
TGATCGGAGG GTATTTATCA TAAAGACTTA ACAGGTGCTG TACTAATTTT TCCTCGTCTG
GCGTTCGTCA CGCTTGCCAA TAACCTGATG ATTTTGAGCA TACGGTCGAT GTTGACAGAA
TTCTTCTGTA AGTTTGTTAT CTGAAGTCAA CATTAGTCTA TGCCAAAAGT TTACTCGTTT
CAGTTTTGAT GGGAGTTGGT ATATTCTGTT TTCTAGGTAC GTCCGGGCTT CTGAACTGTA
TGATAGGGAT TTGATCGTTT TATAGGCTTC GTATACGCGC TATTCACTCT TGGTCAAGGA
AAATTCGAGT TATCACAAAT AGCGTGGTGG TTGTTGGAGG TGTACTTTGG ACTAGATGCT
TCAGGGTTTG AACATGCTTG TGAGTGAGAC GAAAGGAAAT TATCGCATTG AAACTCTAAT
CGATCCTCGA AGATCTTTTT CACCCATTTC TAGGGCCTCT GCTCATGGTC TTCTACGCCT
TACTTTCAAA CACATTGCTC TTGACTGTAC TTGTCGCCAT CCTTGGCAAT ACCTTTGCCA
CTATCAACGC CGATGCCGCT GCAGAGGTAA GCGCATGGAA ATACATCAGA ATTTTCGCTT
AAACGACGCC ACAATCAGTC AATGTTTCGA AAAGCTGTAT CTACTCTTGA AGGCGTAAAG
GCAGGTGCGC ATCACAAACC CTCTTCCTAT TCGAACAGAC TTAAGCTAAT ATTGAATCTA
GATGCTGTAT TTAGTTACCA GTTACCTTTC AATTTGGTTG CTGTGATCAT AATGTGGCCG
ATGAGATACG TTCTCAACGC GAGGTGGTAA GTAACTGACT TCTTTTTCGC TACTAGCTAA
ACATTTTCCC ATTAGGTTTC ATAAGGTCAA TGGTGAGTGG TAGAAATTTC GATAATGCTT
CAGCTGATCC ATCGCCAGTT TTCATGATCA GGGTGACGAG CGTACATATA TTGCTCTTGA
TAGCCCTGTA CGAAAGGCAG TCATATCAGG ATCAAGGTCT GATGGAGCAG CTTGGGGACT
TCGCGGAAAG ATATGTTGGG AGTCTTCCTA GACGTCTCAA GGCGGCAGGT GATTCTTATC
ATTATCCATC TTGTGTACTA GCAATACTTA TATCATTTGT TCTAATCTAG CTGGTTTCGA
CAATTTTGCT TCGAGAAGCG ATATTGCGGC AGTGTTTGAG ATCGAAAGGG AGGTAGGGGC
CTTCTATGCT GGATGGGACG ATGAAGTGGA TGAGTCAGAG ATCATTTTAC CCCCTGCCTT
CGATGGCGAT CCCCCGTCAA TGAACAATGA CGACGAAAGA CCCGGTCAAG ACATAACTGC
CTTTGATTCC GCCACTGCTC CCTCGAAAAA ACATTCTTCA CCTCCATCAC CCGCATCTCC
TTTGACGGCG TCCCCGTCGC GCCTTGAACA TCAGTTGGAA AACCCTCACG CTCGACGTAA
CTCTATGCCA TCATCACATC GGTCATACAT CCAAAATCCC TATCAAGTCC CTATACGTCG
ACGAAATAGT TCTATCCACG GCCCTAGTCC TTTGGCCCAG CTTTTTGTTC GAGGCTCGGA
ATCTGATGCT TTGAGGGGGA GGAGGGCGTC GATGGCTGGA GCCATAGGCG CAGGTCCCGC
GCTTGCTGCG CCCCCTGTGT TCGGTCCATC TTCCAAACCT AGACGGAGTC ATTTTAGGTC
GGAATCATTC CCAGAGTTTT CAAGCAAAGA GCAAGATCCG CATCAACATC CAGCAACAAA
TTCAGTATCC CGCTCCAATA AATCATATCT CATAAATCCT TCCATCGCGC CTATCACCGA
AGGCAAGAGT GTTTCATTCT CAAGCGATCT GAAAGATCCG GAAGACGATC CAGTATGCGA
TCCCAGCAGC TCCATTGCTG GTCGCAAAGA AGGGAACCTT GCTGTCGGTC ATGGTGGTCT
GCCTATCGGT AATAGGGAAG AAGTTAAAAC TTCACCACTG TCGGTCAAGG CTACGCGCTT
TCAGACAGCT TTTCCAGGCG CATACTCCCC TTTAGGTACC GTTAATGATT CGCGACCACA
CACTCCAAAC TCACAGGCCA GCAATGTGGC TAAGGCTTCT CAAGTGGAAG TTCTGGCTCT
GGCCAGGCCA GAAGAAGAAG CATTAAAACA AAATATGAAG GAGAAGCTTG AAGAAATGGA
CCGGCGACAG AAACACATAG AGCAATTATT GGAACGATTG CTTGGGCACT TTGAACGATG
A
 
Protein sequence
MNCRAGGWSE RSLPLATVLL TPWALFQGAS QEVIDRAKEE GDDELLRQGG TALEMAIISS 
SKRFIRSPSC QKVIGKFLLL EGIWGGRIIY TALNAHALIA DNYKKKPIQM YNPHKAPFLD
HYRLKVPRYR SMLEYVNFLV LFILYVIAIE GLVESRINGR EWAFIIYAMA FSLDKLAAIR
EHGLKVFSSS LVNGFDLVFM IIYAVYLGAR TYGARYHNEY ALGLGADWLA IGFVYALFTL
GQGKFELSQI AWWLLEVYFG LDASGFEHAW PLLMVFYALL SNTLLLTVLV AILGNTFATI
NADAAAESMF RKAVSTLEGV KADAVFSYQL PFNLVAVIIM WPMRYVLNAR WFHKVNVFMI
RVTSVHILLL IALYERQSYQ DQGLMEQLGD FAERYVGSLP RRLKAAAGFD NFASRSDIAA
VFEIEREVGA FYAGWDDEVD ESEIILPPAF DGDPPSMNND DERPGQDITA FDSATAPSKK
HSSPPSPASP LTASPSRLEH QLENPHARRN SMPSSHRSYI QNPYQVPIRR RNSSIHGPSP
LAQLFVRGSE SDALRGRRAS MAGAIGAGPA LAAPPVFGPS SKPRRSHFRS ESFPEFSSKE
QDPHQHPATN SVSRSNKSYL INPSIAPITE GKSVSFSSDL KDPEDDPVCD PSSSIAGRKE
GNLAVGHGGL PIGNREEVKT SPLSVKATRF QTAFPGAYSP LGTVNDSRPH TPNSQASNVA
KASQVEVLAL ARPEEEALKQ NMKEKLEEMD RRQKHIEQLL ERLLGHFER