Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB03110 |
Symbol | |
ID | 3255836 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 937394 |
End bp | 940539 |
Gene Length | 3146 bp |
Protein Length | 933 aa |
Translation table | |
GC content | 50% |
IMG OID | 638254955 |
Product | hypothetical protein |
Protein accession | XP_568888 |
Protein GI | 58262956 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00148463 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCACCCT CACAGCTCGC ACAGCTCAAG GCGGCTCTCA ATACCGCAGG CCTGTCGAGG AAGACACATT CTAAAAAGGA CAAGAAAGCA TACAAGAAAG GCGGCGCCCG CGAGACTGAC AGAGCAAAGA AGGTCGAAAA ACTGGAGGAG ATTAGAAAAA ATCTCAACAA ATTCGATGAG AGGGAAACAC GAGTAAAGCA TGATGTTGGA GGGAGGAATC TCAAGGGTGT TGTAGGTAGG CCTAGTGCAA GCAAGCAGGC TGGGTTAGAA CAGGTGTGTA GTTCCTGTCA ACTCGCCTCC GATCACATCT GATAGCGTTG GTTTGCAGCG AAAGAAGACG CTTTTGCCTG AACATCAGCT TCGCGACCAC CGCGGTACCT TTAGAGACAG GCGATTCGGT GAAAATGACC CTTCCATGTC CATCGAAGAC CGTATGCTTG AGCGATACAC CCGGGAGCGT CAACGTGGTC AAGGCAAAAA AGGAATGTTC AACCTGGAAG ATGAAGATGA AGATGAGCCT TTCGGTGAAT TGGATGACGG ATTTGCCTTG GGTGGTTTGA CACACGGTGG AAGGAGTGTG ATGGATCTTC CTGGTGACGA CTTTGTCGCC CAAGGTCTGG CCGACGAAGA CGAGGATGAA GAAGAGAACA GAGCCGGGAG AATCGACAAA CGTGTGGTCA GCAAGGTTCA TTTTGGTGGA TTTGAAGAGG CAATGGATGA AGACCTGGTA AGCGCTTGCT ATTATATTTC GCACATGGAG GCTAACTGAG ACGATCTCAG CCCGAAAAGA AGAAATCCAA GCAGGAAGTT ATGGCCGAGA TTATTGCTAA ATCCAAGGAG CACAAATACG AGCGCCAGCA GCAGCGAGAG ATGGACGCCG AGCTTCGTGA AGAACTCGAC GGCGACATTG CAGACTTGCA GATGCTTCTC GCTGAAAATG CTCCAACTAC CCCTGCTGCT ACCCTCTTCC CTACCACATC GAAACTTCAC CCTGACCCCA TGCCCGTACC AGGAGGGGAA GTGATTGGAG ATGGAGAGTA TGATCAAATC GTTCGTTCTC TTGCTTTCGA CGCACGAGCC AAGCCCAAGA ACAGAACCAA GACGGAAGAG GAGATTGCTC TTGAAGAGAA GGAAAATCTT GAAAAGGCAG AGGCGAAGCG ATTGCGTCGA ATGAGAGGTG AGAATGTCTC CGATGGTGAG GAAGAAGAAG GGTCAAGAAA GAAGCGAAAG GCGGATGACA AAAAACCTGA TGCGGATGAT TTGGAGGACG ATTACGTGGA AGATGAGGCG TTGTTGGGTC CAGGTTTGAC AAGGGAAGAG ATCGAAACTA TGGTCCTTTC TGGAAGCGAA GCTGGGTCTG ACGACGAGCA AGATGACGAG GATGGTAGGG AAGAGGAAGA AAGCTTTGAC GAAGACGCGG AAGAAGAAAA TGAGAGTGAT GCCGAGTCTG CTATGGAAGA CTTGACTGAA AATGAAGAGC TGTCGGTGTC TGAAAGTGAA GAAGTCGAAC CTGTTGTCAA GAGAACCAAA GGGAAGAAGA CTGCGAGGAC AGAAAAGGTC AAGGAAATAC CCTACACATT CCCCTGTCCT GCGAACATTG AAGAATTCGA AGAAATAGTT GATCCTTTGG AGGACAGTGC GTTACCTACT GTCGTTCAGC GTATCAGAGC ACTGCATCAC CCCAGTCTGG CGCAAGGCAA CAAGGAAAAA CTTCAGGTCA GTTTCGACAT GTGCTGCTGT CTCATACTCT CTCACTGACT ACTGTTTAAA CAGGAATTCC TTGGCGTACT GATCGATTAC GTCCTCATTC TTTCCTCTCG TCCTTCTCCT CCATTTTCTC TCATCCAAAC ACTTGTTCCG CATCTCACAG CTCTCGTCAA GTTAAACCCC ATCACTGCTG CTGCACATTT TGTCGAAAAA ATCAAGCTCA TGCAAAAAAA TCTTTCTCGT GGACTGGCTC GAGGTGCCGC CAGACCCGAT TCCAAAACGT TTCCTGGTGC GCCCGAGCTG GCTCTACTGA GGCTGGTCGG TTTGTTCTGG TCTACCAGTG ACTATTCGCA TCCAGTGGTC GTACCCGCGG TCTTGCTGAT GGGGCAGTAT TTATCTCAAA GTCGCGTCCG ATCGCTGCGT GATCTGGCAT CTGGTTTGTT CCTGTGTTCT CTTTTAGCCC AAGTAAGTTG TGCGTCCATC TCGCCCTATA AACAACATAC TGACAGTGTA CAAGTACGAA TCACTTTCGA AGCGGATTTT GCCAGAGGCC GTCAACTTTG TAGCTTCGTC AATCCTCATC CTCCTTCCTC GCCGCAAGGG TGCAGAGGTC AACAGGACAT ACCCTGATTT AAAAGCACCA TCTACATCCC TTTACTTGAA TCTTCCCTCG TCCGTCGTTC CTTCACAACC ACTGGACCTT GGGTCTGCTA TTACTGCCGT CGGAGATCAG GACGAGGAAG AGGCGGAGCA ACTCAAGGGC GGACTGCTAG TTGTTGCTTG CAAGCTCGTT GACAAATTTG CTGGCTTGTA TCTCGACTCA GAGGCTTTCG TCGAGCTGAT GAGTCCTGTA AAACAGGTTC TGGAACAATC AAGGGCTAAA AAGTTATCCA GCGAGCTCAA CATGGTATTA ACATCTACTT TGACCGCCCT GTCCAAGCGC CTCAGCAACA CTCTCAGTAC TCGTCGCCCT CTTACTCTCC AATCTCACAA ACCCATTCCT ATCGCTTCCT ATGCCCCTCG ATTCGAGGAG AACTTTGCGC CTGGCAAGCA CTACGATCCT GATACCGAAC GCAATGCTTC GGCCAAGCTC AAAGCTCTTT ACAAGAAGGA GAGAAAGGGT GCCATGCGCG AGCTCAGAAA AGACAACCGA TTCTTGGCCG GCGAAAAGGC GAGGGAACAA GGCGAAAAGG ACAAGGAATA CAACGCGAGG ATGCGCAAGG CGGAGGGCAG TATTACAGTG GAGCGTGCAG AAGAAAAGGC CATGGAAAGG TAAGTCTCCT CTCAAGCCCA GACTTTACAT GTTACTTTTA GTTCGTGCTG ACCATTCAAT TTGTCTTTAG AGAAAAAGCA AAGGAGAAGC GAAGGGCTGG CAGGGATTAA TACCAGGAAT AGAATCTCGT CTCCTTGGCT TATACAGTGG TTTACAGTGG TTTATA
|
Protein sequence | MAPSQLAQLK AALNTAGLSR KTHSKKDKKA YKKGGARETD RAKKVEKLEE IRKNLNKFDE RETRVKHDVG GRNLKGVVGR PSASKQAGLE QRKKTLLPEH QLRDHRGTFR DRRFGENDPS MSIEDRMLER YTRERQRGQG KKGMFNLEDE DEDEPFGELD DGFALGGLTH GGRSVMDLPG DDFVAQGLAD EDEDEEENRA GRIDKRVVSK VHFGGFEEAM DEDLPEKKKS KQEVMAEIIA KSKEHKYERQ QQREMDAELR EELDGDIADL QMLLAENAPT TPAATLFPTT SKLHPDPMPV PGGEVIGDGE YDQIVRSLAF DARAKPKNRT KTEEEIALEE KENLEKAEAK RLRRMRGENV SDGEEEEGSR KKRKADDKKP DADDLEDDYV EDEALLGPGL TREEIETMVL SGSEAGSDDE QDDEDGREEE ESFDEDAEEE NESDAESAME DLTENEELSV SESEEVEPVV KRTKGKKTAR TEKVKEIPYT FPCPANIEEF EEIVDPLEDS ALPTVVQRIR ALHHPSLAQG NKEKLQEFLG VLIDYVLILS SRPSPPFSLI QTLVPHLTAL VKLNPITAAA HFVEKIKLMQ KNLSRGLARG AARPDSKTFP GAPELALLRL VGLFWSTSDY SHPVVVPAVL LMGQYLSQSR VRSLRDLASG LFLCSLLAQY ESLSKRILPE AVNFVASSIL ILLPRRKGAE VNRTYPDLKA PSTSLYLNLP SSVVPSQPLD LGSAITAVGD QDEEEAEQLK GGLLVVACKL VDKFAGLYLD SEAFVELMSP VKQVLEQSRA KKLSSELNMV LTSTLTALSK RLSNTLSTRR PLTLQSHKPI PIASYAPRFE ENFAPGKHYD PDTERNASAK LKALYKKERK GAMRELRKDN RFLAGEKARE QGEKDKEYNA RMRKAEGSIT VERAEEKAME REKAKEKRRA GRD
|
| |