Gene CNB03390 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB03390 
Symbol 
ID3255864 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1019609 
End bp1023157 
Gene Length3549 bp 
Protein Length938 aa 
Translation table 
GC content48% 
IMG OID638254984 
Productnucleus protein, putative 
Protein accessionXP_568913 
Protein GI58263006 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.175447 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGAAAAAAAA CTCATTCCCC CTCTTGCTTG CTTTCAACAG ACCTCACAGT CGTCCCAGCC 
CTTTCATTGA GACAGGTTTC TCCCACGCAG CGATAAAATA CAAGAAAGGG CCGCAACGCA
CAGCTTTTTG CCTCTCTTTA GACCTCGACC GAGGCTGACT GAGCGCGTCG GCATCGCGCA
TCCGGCCAAA ACATCAACTA GGGGAACGTC TCCTCACAAT GTTCGTCGTC ACTTTTGTCC
CATACCTTGC TATACTCTTT TAGCTAACTC AATCTCTTTG ACTACCATAA ATTGTCTGTA
CAGTAAGACG CCATATACTG CTTGCGCTAA AAAGGACGTC TACTGCAGTC TCAAAGGAGG
ATAACCCACG GCAAACAGTA ATAAAAAGCG TGGAATATTG TGCCACACCA AGGTAAAATG
TCCGATGTCG ACGGAAAAAG GCGTAAGATA CAGGTACATG GTCGTCTTCT ATCGTGAGAA
GGTCGGATAA AGTAATGACA AGTCTTCTAG CGGGCCTGTG ATGTATGCAG GCGAAAAAAA
ATTAAGTGCG AAGGGCCCAT GAATAGCCTG AGTGATGCCA GTCAGTGTAA TATCGTGCTA
TATCATTGAA AATGCCTGAC TGAAGACGGT TTCCTGTAGA ATGTGCTCAC TGTGAAGAAT
ACGGCATGGA TTGCACGTAC GTCGAGGCGG CAAAGAGAAG GGGGCCTCCA AAAGGGTATG
TCGTGTATTA ACTCTACTGA GCATACCATC TCATGTGTAT TATTAGTTAC GTTGAGACAC
TGGAGCAAAG AGCTGGACGA TTAGAGAGGA TGCTGCAACA GGTAATTGCC GTTCACCGTT
TGGTGTTTTA ATATCTGCTG ATATTACAAG ATTTATCCTG GTGTCGATTT AAACGAATAT
GTGGGGCCAA AGCCGGACAG GGAAGACTTT GATATCAGTG CGTATCATGG CACTCTTCGT
TCCCTCAATA TCCCACCATA TCCAGCCTTA AAGCCTCTAC ATTCCGAACA TCTTGTCACT
CCACATACAT CTCGTTCTAC TTCGGTTGGC ACATCTCCGG CGGCCCCAGC ACCTTCACCC
TCAATGCAAG CGCTGGGTCC CTCTCCATGG CGAATGTATG AAAGAGATCC TGCGAAACCT
GCTGAAAATG AGTCCGACGT GGAAGAAGAG GCGGCTGCGC AGCTGTCTAT AGCTACATCA
ATGAGTCAGT TAGACATTCG CGACAGCCAT TGGCGTTGGC ATGGTCGGGC GTCAGGGGCT
TTCCTTATGC GACAGTTTGA AGATCTCAAG TCAGCGACAG GTAATACCTC AAGTATCATA
CAAGACATCA ATAACCACAA ACGACAACAA TTCTGGCATG TCCCAGAATG GGAACTTGTC
ATTGCGAACG AAGGCTTACG CCCTCTTGAC TACTCTATCT GGCCAGAAAA AGGCCTAGAT
CAACGACTTA TCGACGCATA TTTTGATAAC GTCAATCTTC ACCTGCCCTT ACTTAACCGT
AAATTCTTTC AACGACAATA CGATTCTGGT ATGTGGCGGA ACAATCCCGG TTTCTCGAGA
GTCTGTTTGC TGGTTTTCGC CAATGGATCG CGGTTCGTGG ATGATCCACG GGTCTACTGG
CCTGCAAATT TGTCGATGAC AGAGGAAGGG AGTGAACGCC TTGCAACGGA CAAAGACGGT
ACGCTCCGTT ACTCGGCTGG CTGGAAATAC CTCCGCAGCC TTCTTCGCAT GGGAAGAAGT
ATCATGCAGG GACCAAATCT GTATGAATTT CAAACCCAGG TCCTCATTTG TCAATTCTTG
CAGGGGAGTG CTGTCCCACA TCTTATGTGG ATTTTGTCAG GCTTCGGTCT CCGCTCGGCC
CAAGAACTAG GCATTCATGT TCGGGCCACT TTACTCCATG CCGATCCTAC CGAGCGAGCT
CTTTACAATC GCGCGTTTTG GTGCTTGTAC CACATTGACC GGTATAACTG CGCTGCGATT
GGCCGATCAG TCGCTATACA GGATTCTGAC TTCAATGCGG ATTATCCAAT TGAGGTCGAT
GATGAGTACT GGGACACTGG AGATACTGAG CGCGACTTCA AGCAGCCAGA AGGGAAAATC
TCATTAATAA CGTCCTTTGT CCAAACACTC AAACTCGATC ACATCATGGG CGCAATATTG
CAGAACGTGT ACGCAATCAA CAAGCTTCCA GAGCAGCGAG CGGACATTGC TGCTCAGCGT
GCCATCGTTG TTGAGTTAGA CTCTGCCCTC AATTCTTGGG CTGACAACGT TCCACACGAG
CTTCGCTGGG ACCCTAGTTG CTCCGACTAT CAATTGTTCC GCCAGTCAGC TGTGCTATAT
ATCTATTATT ACTACTGCCA AATCCTAATC CACCGTCCCT TCATTCCTGG CCCGCGAAAT
CAACATGCCG CCGATCTACC GTCCCTTGCA GTTTGTGTCA ATGCCGCTCG GTCAATCTGC
AACATCACCT ATGCGGCACT CAAAAGAGGT AGACAGGAGG GGTGCTTACC CGGACGAGCC
CTAAACGTCT CGTTCATGCT GCCAACATGG ATCGCTGCCA TTATCCTTGT GATCAACATC
TACTCTGGGA GACAAACAGC GGCCGAACGA GAAAAGGCTT TGATCGACAT TGGGCGATGT
GTATCGGCAA GTAAGGAGCT GGAATTGATA TGGAGGCAAA GCGGTAAATA CACCGACTTT
TTGTTGCAAT TGGCAAGAGA GGGCGGAATG CCCAACGCCG ACAAGGTGCC TATGGTCGAG
AAAAGATTGC GTGAAAACAA TCCGCAACTG TCAGAGCGCT CACGGCGTCC AGAGTCAGTG
CAAGGGTCTA CCGCCGGAAC ACCTGATCAT AACTCACCTT CAACCAGTTA CCCATACAGT
CATGGTCGAT CAAATGGGCA GAATTCCAGG TCGTCCGGCG AGCATTCCCG CCAGCCATCT
GCGGCACCTG TAACAGGGTT TGATCTGCGC AATTTCACTT GTCCAGAAAC ATATGGTGAT
ACTTCAGCCA CACCTCAATT TCCCTATTCT CGTGATGATC TCCCGTTTAC GCATATGCCC
TCTCCCTCGT CATCCCAAAC TGGCTTTCAA AACATGTTTC AACCATCCTC GCAATCTTCA
CAATATCCTT CCAATACTAG GAATGTGCCT CCACACTTTA CACCTTCCTC CCAAGCGCAT
ACGCAGTACA ATCTCCAAAC TCCTAATAAT GAACATCCTT TACCGTCGCA GAATGATGGC
GAGCTAGCGT CTTCCAATAT CTATGATTCG TTGATTGACA TGACAAGCTT TGAGTCACAA
CTGCTTGACA TGAGTACCAC AGCTTTCGGG GGACCCGAAA ACACGTCAAA TGGCGATTGG
TGGTCTCGAT TATTTAACGA CTACATGTGG GTAATAGCAT ATTATGGCAT TCGAAAGACA
CTTGTAAATC CCAAGCTGAC TGCTTTATTA ATCCCAGGGG TCCTGATCTT CACACCAACA
TGCCTCCCGC ATGTGGTCGT TCGTCGAATT CCGGAGTTTG ACAGTTATCG TAAAAGGTTG
GCCGAAATA
 
Protein sequence
MSDVDGKRRK IQRACDVCRR KKIKCEGPMN SLSDAKCAHC EEYGMDCTYV EAAKRRGPPK 
GYVETLEQRA GRLERMLQQI YPGVDLNEYV GPKPDREDFD ISAYHGTLRS LNIPPYPALK
PLHSEHLVTP HTSRSTSVGT SPAAPAPSPS MQALGPSPWR MYERDPAKPA ENESDVEEEA
AAQLSIATSM SQLDIRDSHW RWHGRASGAF LMRQFEDLKS ATGNTSSIIQ DINNHKRQQF
WHVPEWELVI ANEGLRPLDY SIWPEKGLDQ RLIDAYFDNV NLHLPLLNRK FFQRQYDSGM
WRNNPGFSRV CLLVFANGSR FVDDPRVYWP ANLSMTEEGS ERLATDKDGT LRYSAGWKYL
RSLLRMGRSI MQGPNLYEFQ TQVLICQFLQ GSAVPHLMWI LSGFGLRSAQ ELGIHVRATL
LHADPTERAL YNRAFWCLYH IDRYNCAAIG RSVAIQDSDF NADYPIEVDD EYWDTGDTER
DFKQPEGKIS LITSFVQTLK LDHIMGAILQ NVYAINKLPE QRADIAAQRA IVVELDSALN
SWADNVPHEL RWDPSCSDYQ LFRQSAVLYI YYYYCQILIH RPFIPGPRNQ HAADLPSLAV
CVNAARSICN ITYAALKRGR QEGCLPGRAL NVSFMLPTWI AAIILVINIY SGRQTAAERE
KALIDIGRCV SASKELELIW RQSGKYTDFL LQLAREGGMP NADKVPMVEK RLRENNPQLS
ERSRRPESVQ GSTAGTPDHN SPSTSYPYSH GRSNGQNSRS SGEHSRQPSA APVTGFDLRN
FTCPETYGDT SATPQFPYSR DDLPFTHMPS PSSSQTGFQN MFQPSSQSSQ YPSNTRNVPP
HFTPSSQAHT QYNLQTPNNE HPLPSQNDGE LASSNIYDSL IDMTSFESQL LDMSTTAFGG
PENTSNGDWW SRLFNDYMGP DLHTNMPPAC GRSSNSGV