Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF00370 |
Symbol | |
ID | 3258496 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 132949 |
End bp | 136217 |
Gene Length | 3269 bp |
Protein Length | 685 aa |
Translation table | |
GC content | 50% |
IMG OID | 638257160 |
Product | conserved hypothetical protein |
Protein accession | XP_571479 |
Protein GI | 58268646 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0970801 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTAGAAAG GAAAAGAATA TACACCGCCC AGCCACCATC ACCCCCCCAT CTCCTCATCG CCCGCACAGA CGCCGCCCTT ATCTAAGCCA CCACTTCACA CGTCACCCTA TTCTTTTCCG CAGTCTCAAC CACCGACTTT CTCTCACCTG TACAAGTCAT CCAAAGATTC CTCCACGAGA CAGCCTTGAA TCTCTGGCCG CCCTGTCTAT ATTGCTCCGT CGTTGAGTAC GGCTTTGTCT ACGGCCGCTG TCTCTCGCGT GTGGTGTTTT GCCGTGGTGA GTCGCATGCG TCTTTCGAGA CGATGTACAC ATCTTCAACC CTGGATGAAC GCTGACCATA CTCTGCCCTG CCGCCTCCTC CTTTACGGTC CATCCTCACC AATACAATCT CCCATCTGAT TTGGTCCCAT ACCACCATGT GATTGCCATT AAACAACGTG CGTCGAATCA CCAACTCATG TGCCGACGGT TCGCGCGAAA CGCATTTGGA AATATCGTCA GCCGCCACCG TGTTCCTCCC TGTTTTATAT TCATCGATCC AACCAAACTG AGATCGTCTG CCGTTATCAA CCAGACCATC TGTATCAATA CATATTCGTT GTAAATCGGT TTTTTATCTA TCCTTCTTAT CGGCGGTGAG TAATCCTTCA ATTATCTTCA CGTAAGGGGG AATTCTTACA CGGTCTGTAT CATGTCCTAA TTCATCCATT CGTAGACATG ATGGCCGGCT CAGAAACGTC TTCTCCAACC GATTTTGCCT TTGGGCCACC AGCACTCGCA TCAGGAGACG CTGCATTCAC TTGTACGCCT TACAAACCTG AGCCCCTCAA ACCCACCTTT GCATCTCCCA CCGGTGTATC CCCATCTACC GGTTCATTCA CACCCGCAGC CAACGCCGCG CCTGGGCATT CATACTTCCC AGATTCATAC AACTACGGGG TATCATCTGC TTCATCCTTT TCATCAGCAT CTGGTAGCGC CGCTGGCGGA TCGTTTTCAT CGGTCGACAA CCTCCCGGGT ATCTCGAAAA ACTTTGTCAG ACCGAATTTG GCCGATACAA GACGGCCTGC AACAGCCGGA GGAGCTTTAC AAAGTCGTAG TCCATATTCA AACTTTATGA TGAGCAAGAA GAATAGCGTG AGTGGATCAG TGTCACAGGA TGCACAGAAG CTGGGCGAAC GGGGAGATAT GAGAAAACCG GAGGGGACAA TAGAGGAAGG AAATGAGATG CTGGGTCTTG GTGGATCAAC AGAAGAAGAT GGGGATGGGC AAGGTTCTCA GAATGTGTCA CCAGAGAACG AAAATACTGT CAATCCCCAA CTTCTTCCTA CCAACCGACG GTCATCTGCA CCTCAGTATA ACATTCCTGG TTCAAATTGG GGCCAATTCC CACCATCTTC GTCACATGAT GCTGCTAATT CATCTCTCGG TCTCCCTTCT GCGCCTGCAC ACATGCCAAT GGGTTATCTT CAACAGCAAC AAGCAGCGCA CCATAGTCTC AATAGTCAAA ATCGACCACC AGCTTTCGCG GGTCGACCGC AAACGAGTGA TGGTTTGCCC AGTTACACCA ATTACCACGG TGCGGTGACT CTTCCTAGTG CTCAATCTAT CGCTCGTCAA ATCCCAGGCA TGACTGATAC TTCATCTTTT TTCCATCACT CTTCAGGATC TGCCCATTTG CCGTTTAAAG ATGACCGACA TTTCCCCTCT ACGAACTTTC CTGGAGACAG GGCATTTACG TTTGACCCCC TGCGATCTTC TCTCCCACCC TCTTTCCCCG GTCAAATCCG CGCATCATAC GCACCTCCAC CTCCAGTCCC TACTGCCGGC TCCGATTCTG GTGGCTCCTC AAATAATACA AACTCTACCG CTGATCAGAT GCAATTTATG AGTTTGTCAA CCGGTCCTGG GCAGAAAAAG AGGCCTAGAA GAAGGTATGA AGAGATTGAA AGGTTGTACC CTTGTGGTTG GAATGGTTGT GAAAAGTCTT ACGGGACATT GAACCATTTG AACGCGCATG TGATGACACA AAAACATGGC GAGAAGAGGT TGCCTGCTGG TAAGTGTGCA CATGGGCTGT TTTGAGACAA CGTTGACTGA TGAATCTGCA GAGTTCAAGG AGATGAGAAA AGCTTGGAGG AAGAAGAAGA GAGAGCACGC TTCAGCTCAA GCGAGCTCTC AATACATGAC CAACGCTGCT GCCTGGCAAC AAAACTACCA ACGTCTTTCG TTCGCCTCTA CGACATCTGC CCCTGATTCT GACTGGGATC GCCGAGAGTC TTCGACCTCG ACACTCTCTG TGTCAACTGA CGGCCGACCA TCTTTTTCTT ACCCTACCAA CTACTCTTGG GGTGTGCCTC CACAGGGTAT GCCCTTCCAG GCTGGTATGA TGTCCGTCGA CTCTCGTCCT TCCACATCCA GCAGCAATGT CTCAAGCGTT AGCATGGACG GCAGATTCAT ATCTAATCAA CCTGCTTCCG CTTCTGCCTC TGCGATGAGT ATGGGCCCAC CTACATATCT CTACCTTTCT GGACCACCTT CTCTCCCAGG AGGCGTTGTT CCCCGTCATC CTTCAGCTCC TCAGCATCTA CCAATGCCGC CTCCGAACCC CGTAGATGGT TTCCGGCCAG CTGATGGGGA TCATCCAACC CCCACCGTAG GTAATCCTTT CCCGATCGGT CCTTCTGGGG CAGGTGTCGT AGCAGGAGCC GGAGCTGGGA CGGGCGCTGC AGGGCAGGCC GGGAAAGCGT TGAACTTTTC GACGTTGACA AGCCCTATGG AAGGAAGTTC AGGTGATTTT GGGAGTCAGT TTGCCTTCAC TAGATAAGAT GAAAGAAATG GTGGCGTGCT GCGGTGAAGA GAGAAGATGA GACACGGAAA AGGCAGTTGA ACTATCCTAT CTGGATACTC GAAGAGCGTC AATGCCCCCT TCCCCCCCAT ATACCTTGTA ATAAGCAACA GCCTCTTTTA TATCCTTTCG GTTTTACTCG CAACAAACTC CAAAACAATC CAAAGACACC AGGCGTTCTT CGAAAGTTCA TATACACGAC AATATACAAA CATTATTTCA GTTGGTTTTT TTCATTCTCT TTCTCTTTTT GTATCATCTA GCGCTGAAGG ATCAAGGAAA TGGAGGTCTG AGGAAGGGGA AAGAAAAGTA TATGATTAGG AATAGTATGG TCGGCAGTAT GTCTTTGCTC GAGTTGGCAT AATTAGTATT TGAATCGCAT GCGTGATTAG GAAAAGACAG GCATTAATTG TAGATGCGA
|
Protein sequence | MMAGSETSSP TDFAFGPPAL ASGDAAFTCT PYKPEPLKPT FASPTGVSPS TGSFTPAANA APGHSYFPDS YNYGVSSASS FSSASGSAAG GSFSSVDNLP GISKNFVRPN LADTRRPATA GGALQSRSPY SNFMMSKKNS VSGSVSQDAQ KLGERGDMRK PEGTIEEGNE MLGLGGSTEE DGDGQGSQNV SPENENTVNP QLLPTNRRSS APQYNIPGSN WGQFPPSSSH DAANSSLGLP SAPAHMPMGY LQQQQAAHHS LNSQNRPPAF AGRPQTSDGL PSYTNYHGAV TLPSAQSIAR QIPGMTDTSS FFHHSSGSAH LPFKDDRHFP STNFPGDRAF TFDPLRSSLP PSFPGQIRAS YAPPPPVPTA GSDSGGSSNN TNSTADQMQF MSLSTGPGQK KRPRRRYEEI ERLYPCGWNG CEKSYGTLNH LNAHVMTQKH GEKRLPAEFK EMRKAWRKKK REHASAQASS QYMTNAAAWQ QNYQRLSFAS TTSAPDSDWD RRESSTSTLS VSTDGRPSFS YPTNYSWGVP PQGMPFQAGM MSVDSRPSTS SSNVSSVSMD GRFISNQPAS ASASAMSMGP PTYLYLSGPP SLPGGVVPRH PSAPQHLPMP PPNPVDGFRP ADGDHPTPTV GNPFPIGPSG AGVVAGAGAG TGAAGQAGKA LNFSTLTSPM EGSSGDFGSQ FAFTR
|
| |