Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB05310 |
Symbol | |
ID | 3255752 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1501247 |
End bp | 1504454 |
Gene Length | 3208 bp |
Protein Length | 912 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255173 |
Product | hypothetical protein |
Protein accession | XP_569040 |
Protein GI | 58263260 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0349] Ribonuclease D |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.153123 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TATCCCATCG CACATTCATC GGTAGTTGCC GTGCGATGGC AAGCCAAAGG GTACAGTACG CGTCGTCTAG TGTTTCCCGT ACAAAAGCCT TACAGGCGCG ACCCATCTTA AACGACGCAG AGCTCGGCAT CGGAATTTGG CAGAAAAAGA CGGCTTTCTC AGACCAAAGA CCAATAACGT AAGTGGTTGG AAGAATCTGC AAATATTGTT GGCTGCTGAT CCATGACGCT TTTTTTCGTC GTGTAGTCCA TCTAGGCAGC TACCTGACGG TCGACAGACA CCCATAAGCA TCAGCAGTCA ATCCTCAGGT CCTCCGACAC CAATCTCCGT CTCATCTTCT TCACCGTCTC CGCTTGTGAA TAGGCCTCGA GCTGTGCCCC GGTTTAACTC ATCTATCGCT CGCACCGCTT TCACATCTCC AAGAGCAAGC CTTGGCCGGA AGACTCCTTC GACCGAGAAC AGTAAACCGC TTCATCCGTT CTTTTTAAAC AATAGGAGGA CCAATATCCC TACACCGACC AAGCCACGAC GCGTAGCCAA TTACACCAGT ACTGAAGCGG AGAGCTCAAG CAGCCAGCAA ACTTCAAGCA GTCAAAGTAC ATCAGAATCC GTATCTTCAA GTCAACGAAA TCAACTCGCC GAAGAAGAAG ATATTATCAA GCTTACGCAA AGTGTCGAGA AGATGAGAGT CGCTCCATCA CCTAAAACCC AGCCACCAGG TCCAAACACA AGCAGGTTGT TCGTGCGTCG ACCTCTTGTC AAGGCAGAAA GTGTACCCGA GCGCCAATTG ACATTTGGGC GACTGCCGGT CAATGGCAAC AAATCAAGTC TAGTTGATGA GCCAGCCGCC ATTCCTTTAG TACCAGATAA CCATGAGCCG GAGGCTCCAC CTGTTTATGT GTTCAATCTT CCCGATAATC CGGATCCCTC GTTACCACTC TTCGACTACA AGACTTACCC TCGACCTCCT ATGGTGGTGT ACACCCGGTC AATGTCGGAA GCGGAAGATC TGGTTGCTTG TCTGAAAGGA CCCATTTTAG GATTCGATCT GGAATGGGCA ACGTCGTATA ACAAGGTGTG GGATGCGAGC ACAGGGAGGT ACGATTTCCA GCAGTACCCC ACTGCTTTGG TGCAATTATG TGATGAGAAG ATGATCGTCC TGATACATCT TCAAGATAAA ATGGGTCAGT TATTCCGACC TTCGTACCTC AGGTGTCCCA CTTTTTACTG ATAGTACAAT TGTAGACCTT CCGGCAAAGG TCGCTGAACT TGTACGTGAC CCTAAAATCT ATAAACTCGG CGTCCAATCT ATGGGTGACG GCCGGAAACT CGTCCGCGAT TTTCCCCATC ACTTCCGACA AGGTGGACCC GCCGGGCTGT ATGAACTATC TCGGATGGCG CACGCTATAG ATCCACAAAG AGCTGGCCAT GGATCAAGGT TGATCAAACT TGCGACGCTT TGTAGAGCGT ATCTGGGGAA GGAGTTGGAT AAGGATACGA AAATTAGAAG GGGCGATTGG GCAGGAGAGC TGAATGAGGT GCAGAAAGCG TGTAAGTCGT CTAGCCTGAT TTTTCAGTTA TAGCTTCTGT GGCTGACATT CAGCTACATT ATAGACGCCG CCAACGACGT TTTTGTCTCT ATACAAATAT TCAACGCCCT CAGAAAGCTC GCCGAAGAGA GGAACGTCCC TCTTGACTTT GATAGTTGGT CATCCTCTGT CATATGTCAG CCTGAAAGAA TAGCTCCATT AGCCACTTCT GTGACGATGG CAGCAGCTAC TGGAGAGAGC GTATCTGTCA ATAGAGTTTT GAAACCGGCT GTTACCATGC CCTGTGGGTT ACCGTCACAA GCTATGGAGC CAGGGAGGAT ATCAAATCAA AGGCAGAGTA GCCAATCGCA GGCTCAGCCG CCGGTTGCTC TTCAAACGGC GACGACAGCA AATCATGCTC AAAGCCAATC CCAAGCACGA CCTGATGCTC AAGCTGTGGT TCAGTCACAA AATCAGCCCC AGTCACAACA ACCTACGACA CAGCTACAAT CCACGTGCGC GTCACCCAAT GTTGGGAGAC ATGTCCAAAT CACAGGGCCT GAATATTCTA ATCAAAAACC CGTATCTGTT AAACAAGCCG TAACTGCGCC ACTTACTCAG TTGTCATGCC CAGCTTTATC TCAGCTTCCG TCCACAGCCT CCACACCAAC CCAAGCGCCT GCTCAAAATC AACCCTTGCC CCACGCTCAG CCTTCGACCC GTATCATTCA TTCCAACATC CGTCCCTGTG CAAACATAAC TGGCAATGCT CATCCCTTTT TGTACGGCGC TGACGAAGAA GACGATGAAT TCGAATCCTA TGACTCTACT TATGGGTCCG TGGGTTCCTT CCAAATGGGT ACGTCTAACC CTATGTATGC GGCTATTCAG CAGCACTTAC CGCATCTCCT GCCTCCTACA GGATCTTCTA ATGCTGCCAG AGAGAGAGCG AGGCAGATGG GGCAAAGAAC GCCAGCGCAG ATACTATCAG CCATGGGTCC GGGTCCGTCG AGGTCAAAAA GTCGGACGAG CACAGGTTCA AATTCTAGCT CGTGGGGAAG AAGCAAGGAT GGTGGAGCTA GAAACGGGGG AAGCGTTGTC CTTCGAGGGA AGGTGATATA TACCCCATCT GGAGTGAAAC CTCCTCCGCT AGCAAAGATG AAATCGCTCA CCGCGTTTTT GGAGGGAAAG ACGTTTGAGC GGATTGCGGT AGAGAAAAGT ATTAAGCTTA ACACTTGCCA GTGAGTGTTC ACACCTCGGA CTACAAATGC TAGGCATAAA ACTAACGCAT TACTAATTGC AGGGGCTATG TCATCGAGGC AATATCTACG CTTGGGACTA AACACTTTGA AAATGCAGCA CTCGAGAGAA TGTGGCAATT GGCCACACCT GATATTTGGA TTTTTCACAG CAGCCAGCAA CTAATGCAAG AGATGATACA GATGTTTGGG GAGCATCCAC GGAAAGAGGA GATTGAGAGA TTGGCAAAAG AAAGGGAGGT GAAATTGAAG GAATCGGGTC GATGGAGAGA AAGCGGTGCC ATGAGTTCCT CTCATCCTTG ATAGAATATC TATTAGATAT TCAAAAGTTG ATTAAAGCTG CTTACTACGA CAGTTGTATA TTTACATATT TACTTGGGAA CCATTTGAAA TAATGGCTGT GTATTACTGT CTACCATA
|
Protein sequence | MASQRVQYAS SSVSRTKALQ ARPILNDAEL GIGIWQKKTA FSDQRPITPS RQLPDGRQTP ISISSQSSGP PTPISVSSSS PSPLVNRPRA VPRFNSSIAR TAFTSPRASL GRKTPSTENS KPLHPFFLNN RRTNIPTPTK PRRVANYTST EAESSSSQQT SSSQSTSESV SSSQRNQLAE EEDIIKLTQS VEKMRVAPSP KTQPPGPNTS RLFVRRPLVK AESVPERQLT FGRLPVNGNK SSLVDEPAAI PLVPDNHEPE APPVYVFNLP DNPDPSLPLF DYKTYPRPPM VVYTRSMSEA EDLVACLKGP ILGFDLEWAT SYNKVWDAST GRYDFQQYPT ALVQLCDEKM IVLIHLQDKM DLPAKVAELV RDPKIYKLGV QSMGDGRKLV RDFPHHFRQG GPAGLYELSR MAHAIDPQRA GHGSRLIKLA TLCRAYLGKE LDKDTKIRRG DWAGELNEVQ KAYAANDVFV SIQIFNALRK LAEERNVPLD FDSWSSSVIC QPERIAPLAT SVTMAAATGE SVSVNRVLKP AVTMPCGLPS QAMEPGRISN QRQSSQSQAQ PPVALQTATT ANHAQSQSQA RPDAQAVVQS QNQPQSQQPT TQLQSTCASP NVGRHVQITG PEYSNQKPVS VKQAVTAPLT QLSCPALSQL PSTASTPTQA PAQNQPLPHA QPSTRIIHSN IRPCANITGN AHPFLYGADE EDDEFESYDS TYGSVGSFQM GSSNAARERA RQMGQRTPAQ ILSAMGPGPS RSKSRTSTGS NSSSWGRSKD GGARNGGSVV LRGKVIYTPS GVKPPPLAKM KSLTAFLEGK TFERIAVEKS IKLNTCQGYV IEAISTLGTK HFENAALERM WQLATPDIWI FHSSQQLMQE MIQMFGEHPR KEEIERLAKE REVKLKESGR WRESGAMSSS HP
|
| |