Gene CNB03110 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB03110 
Symbol 
ID3255836 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp937394 
End bp940539 
Gene Length3146 bp 
Protein Length933 aa 
Translation table 
GC content50% 
IMG OID638254955 
Producthypothetical protein 
Protein accessionXP_568888 
Protein GI58262956 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00148463 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCACCCT CACAGCTCGC ACAGCTCAAG GCGGCTCTCA ATACCGCAGG CCTGTCGAGG 
AAGACACATT CTAAAAAGGA CAAGAAAGCA TACAAGAAAG GCGGCGCCCG CGAGACTGAC
AGAGCAAAGA AGGTCGAAAA ACTGGAGGAG ATTAGAAAAA ATCTCAACAA ATTCGATGAG
AGGGAAACAC GAGTAAAGCA TGATGTTGGA GGGAGGAATC TCAAGGGTGT TGTAGGTAGG
CCTAGTGCAA GCAAGCAGGC TGGGTTAGAA CAGGTGTGTA GTTCCTGTCA ACTCGCCTCC
GATCACATCT GATAGCGTTG GTTTGCAGCG AAAGAAGACG CTTTTGCCTG AACATCAGCT
TCGCGACCAC CGCGGTACCT TTAGAGACAG GCGATTCGGT GAAAATGACC CTTCCATGTC
CATCGAAGAC CGTATGCTTG AGCGATACAC CCGGGAGCGT CAACGTGGTC AAGGCAAAAA
AGGAATGTTC AACCTGGAAG ATGAAGATGA AGATGAGCCT TTCGGTGAAT TGGATGACGG
ATTTGCCTTG GGTGGTTTGA CACACGGTGG AAGGAGTGTG ATGGATCTTC CTGGTGACGA
CTTTGTCGCC CAAGGTCTGG CCGACGAAGA CGAGGATGAA GAAGAGAACA GAGCCGGGAG
AATCGACAAA CGTGTGGTCA GCAAGGTTCA TTTTGGTGGA TTTGAAGAGG CAATGGATGA
AGACCTGGTA AGCGCTTGCT ATTATATTTC GCACATGGAG GCTAACTGAG ACGATCTCAG
CCCGAAAAGA AGAAATCCAA GCAGGAAGTT ATGGCCGAGA TTATTGCTAA ATCCAAGGAG
CACAAATACG AGCGCCAGCA GCAGCGAGAG ATGGACGCCG AGCTTCGTGA AGAACTCGAC
GGCGACATTG CAGACTTGCA GATGCTTCTC GCTGAAAATG CTCCAACTAC CCCTGCTGCT
ACCCTCTTCC CTACCACATC GAAACTTCAC CCTGACCCCA TGCCCGTACC AGGAGGGGAA
GTGATTGGAG ATGGAGAGTA TGATCAAATC GTTCGTTCTC TTGCTTTCGA CGCACGAGCC
AAGCCCAAGA ACAGAACCAA GACGGAAGAG GAGATTGCTC TTGAAGAGAA GGAAAATCTT
GAAAAGGCAG AGGCGAAGCG ATTGCGTCGA ATGAGAGGTG AGAATGTCTC CGATGGTGAG
GAAGAAGAAG GGTCAAGAAA GAAGCGAAAG GCGGATGACA AAAAACCTGA TGCGGATGAT
TTGGAGGACG ATTACGTGGA AGATGAGGCG TTGTTGGGTC CAGGTTTGAC AAGGGAAGAG
ATCGAAACTA TGGTCCTTTC TGGAAGCGAA GCTGGGTCTG ACGACGAGCA AGATGACGAG
GATGGTAGGG AAGAGGAAGA AAGCTTTGAC GAAGACGCGG AAGAAGAAAA TGAGAGTGAT
GCCGAGTCTG CTATGGAAGA CTTGACTGAA AATGAAGAGC TGTCGGTGTC TGAAAGTGAA
GAAGTCGAAC CTGTTGTCAA GAGAACCAAA GGGAAGAAGA CTGCGAGGAC AGAAAAGGTC
AAGGAAATAC CCTACACATT CCCCTGTCCT GCGAACATTG AAGAATTCGA AGAAATAGTT
GATCCTTTGG AGGACAGTGC GTTACCTACT GTCGTTCAGC GTATCAGAGC ACTGCATCAC
CCCAGTCTGG CGCAAGGCAA CAAGGAAAAA CTTCAGGTCA GTTTCGACAT GTGCTGCTGT
CTCATACTCT CTCACTGACT ACTGTTTAAA CAGGAATTCC TTGGCGTACT GATCGATTAC
GTCCTCATTC TTTCCTCTCG TCCTTCTCCT CCATTTTCTC TCATCCAAAC ACTTGTTCCG
CATCTCACAG CTCTCGTCAA GTTAAACCCC ATCACTGCTG CTGCACATTT TGTCGAAAAA
ATCAAGCTCA TGCAAAAAAA TCTTTCTCGT GGACTGGCTC GAGGTGCCGC CAGACCCGAT
TCCAAAACGT TTCCTGGTGC GCCCGAGCTG GCTCTACTGA GGCTGGTCGG TTTGTTCTGG
TCTACCAGTG ACTATTCGCA TCCAGTGGTC GTACCCGCGG TCTTGCTGAT GGGGCAGTAT
TTATCTCAAA GTCGCGTCCG ATCGCTGCGT GATCTGGCAT CTGGTTTGTT CCTGTGTTCT
CTTTTAGCCC AAGTAAGTTG TGCGTCCATC TCGCCCTATA AACAACATAC TGACAGTGTA
CAAGTACGAA TCACTTTCGA AGCGGATTTT GCCAGAGGCC GTCAACTTTG TAGCTTCGTC
AATCCTCATC CTCCTTCCTC GCCGCAAGGG TGCAGAGGTC AACAGGACAT ACCCTGATTT
AAAAGCACCA TCTACATCCC TTTACTTGAA TCTTCCCTCG TCCGTCGTTC CTTCACAACC
ACTGGACCTT GGGTCTGCTA TTACTGCCGT CGGAGATCAG GACGAGGAAG AGGCGGAGCA
ACTCAAGGGC GGACTGCTAG TTGTTGCTTG CAAGCTCGTT GACAAATTTG CTGGCTTGTA
TCTCGACTCA GAGGCTTTCG TCGAGCTGAT GAGTCCTGTA AAACAGGTTC TGGAACAATC
AAGGGCTAAA AAGTTATCCA GCGAGCTCAA CATGGTATTA ACATCTACTT TGACCGCCCT
GTCCAAGCGC CTCAGCAACA CTCTCAGTAC TCGTCGCCCT CTTACTCTCC AATCTCACAA
ACCCATTCCT ATCGCTTCCT ATGCCCCTCG ATTCGAGGAG AACTTTGCGC CTGGCAAGCA
CTACGATCCT GATACCGAAC GCAATGCTTC GGCCAAGCTC AAAGCTCTTT ACAAGAAGGA
GAGAAAGGGT GCCATGCGCG AGCTCAGAAA AGACAACCGA TTCTTGGCCG GCGAAAAGGC
GAGGGAACAA GGCGAAAAGG ACAAGGAATA CAACGCGAGG ATGCGCAAGG CGGAGGGCAG
TATTACAGTG GAGCGTGCAG AAGAAAAGGC CATGGAAAGG TAAGTCTCCT CTCAAGCCCA
GACTTTACAT GTTACTTTTA GTTCGTGCTG ACCATTCAAT TTGTCTTTAG AGAAAAAGCA
AAGGAGAAGC GAAGGGCTGG CAGGGATTAA TACCAGGAAT AGAATCTCGT CTCCTTGGCT
TATACAGTGG TTTACAGTGG TTTATA
 
Protein sequence
MAPSQLAQLK AALNTAGLSR KTHSKKDKKA YKKGGARETD RAKKVEKLEE IRKNLNKFDE 
RETRVKHDVG GRNLKGVVGR PSASKQAGLE QRKKTLLPEH QLRDHRGTFR DRRFGENDPS
MSIEDRMLER YTRERQRGQG KKGMFNLEDE DEDEPFGELD DGFALGGLTH GGRSVMDLPG
DDFVAQGLAD EDEDEEENRA GRIDKRVVSK VHFGGFEEAM DEDLPEKKKS KQEVMAEIIA
KSKEHKYERQ QQREMDAELR EELDGDIADL QMLLAENAPT TPAATLFPTT SKLHPDPMPV
PGGEVIGDGE YDQIVRSLAF DARAKPKNRT KTEEEIALEE KENLEKAEAK RLRRMRGENV
SDGEEEEGSR KKRKADDKKP DADDLEDDYV EDEALLGPGL TREEIETMVL SGSEAGSDDE
QDDEDGREEE ESFDEDAEEE NESDAESAME DLTENEELSV SESEEVEPVV KRTKGKKTAR
TEKVKEIPYT FPCPANIEEF EEIVDPLEDS ALPTVVQRIR ALHHPSLAQG NKEKLQEFLG
VLIDYVLILS SRPSPPFSLI QTLVPHLTAL VKLNPITAAA HFVEKIKLMQ KNLSRGLARG
AARPDSKTFP GAPELALLRL VGLFWSTSDY SHPVVVPAVL LMGQYLSQSR VRSLRDLASG
LFLCSLLAQY ESLSKRILPE AVNFVASSIL ILLPRRKGAE VNRTYPDLKA PSTSLYLNLP
SSVVPSQPLD LGSAITAVGD QDEEEAEQLK GGLLVVACKL VDKFAGLYLD SEAFVELMSP
VKQVLEQSRA KKLSSELNMV LTSTLTALSK RLSNTLSTRR PLTLQSHKPI PIASYAPRFE
ENFAPGKHYD PDTERNASAK LKALYKKERK GAMRELRKDN RFLAGEKARE QGEKDKEYNA
RMRKAEGSIT VERAEEKAME REKAKEKRRA GRD