Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB00520 |
Symbol | |
ID | 3255625 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 151427 |
End bp | 154357 |
Gene Length | 2931 bp |
Protein Length | 795 aa |
Translation table | |
GC content | 50% |
IMG OID | 638254705 |
Product | hypothetical protein |
Protein accession | XP_569062 |
Protein GI | 58263304 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0129615 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTTTCCCCCT TCTTTTCTCC CAAAAGAATC ACATAGAATA CGACACAGCC GCAAATACAA CACTCTGCCA AGTTCTGGTA ATGCTGGGCG CCTCCCATCA ATAACAACTA CCTTTCGAGA CCTCAAAAGC CTCACACGTT CGTGGAGTGG CAGCGGATGA TCGGCACAAT GTCAACCTCC CATTTGAATT CTCCTTCAGC TTCCTTTCCA CACCAAGAGC AAGAGCTATT CTCTTTGGAC TTTCTTGCTC TAACTGGGTT GGACGGCTCC ATCTCAGATA CTAACTCGCC ACAAGCCAAT TCAAGTCAAT CGAGTCACCG CCAACACGAT CAGAGCGGCG ATGCCGAGGG TCAGACGAGC AATGAAAGAC AAAACTCAAT TTTTTTCCGA GAACAAAGAA GATCTTCCAA GGATCTCTTG AACTCAATGG AAGTAGATGA ACATACGGGG AATGCTTTGC GGGGCTTGGG GCACAGTAAT GATGGGGAAA ATCAACTCCA AGATTTTGAT TCTTTGCAAG CCGCATTGTT ACAACAGCAA GTGAGTTGTC AAAGAATTAT ATGGTAATAT ACCAAGGCTA ATAACAATTC CAGCTTCAAG CTATTCACAT GCAGTCTCCT TTAGGCTTTG ATATCCAAAA TCCGACCTAC CCGCTTGGGC AATTGCTGGC TTCGCCTGCG TTTGATGAAC TGCATTCATC GCCCAATGGG CACTCTGAGC AACAAACCCT CTCTGTCCAC AATGCTCACC CGAGCTCCCG ATCCGTATAC GACTCGCCTT TATCACATCT TGCGTTCAAT GCTCACGGCC ACCGTAACTC ATTCTCTTCT ATGTCGACGA GGAGTCCCTT AGAGCAGTTG CAAAGGCAGC AGCAGCAGTT TCAAGAGCAA CTTGGATTAC TGCAACGGCA ACAGCTCAAG ATGCAGGCAA CGGCTGCTGC GGTTATGGCA GCCTCCACCT CACCATACAT TGGGCTAAAT GGTCCATCAT CGACAGGTCC TCGACCTTCG GTGACGCCCG GCATGACTCC TTCATCCTCG AACACTGGCA TGTTTTCACC CCTCACTTCT CCAGCTCTTG AAGCCACCAA CTACTCTCAC CAGTCCCATG TTAGCCGTCA CAGCCAACAG TTTTCTCCTG CCTATGGCTC GCAGCATATT GGCACATCCG GTATCCTCAA CACTGCTCTA TCTTCCCCCG CCCTCAATCC CATTGGTTCT ACAGGAGGTG CCAATCAAAC CCTTTCGCCT GCTCTCAACC CTCAAAATGA AGTGAACAGG GGTGACTCCG AATATCTTCA TGCCTTCATG GGTATGCTCG ATAGCACCAA CAGTGGGAAC AGCACACCTG GTGGTGAACC TCCACAACCG AGCTATCAGT CACCTTCCAT GACAAGCGCT TCCACAGCTG GCAATTCTAC CATAATATCA TCCCCAGCTC TCTATCCTCA GGGTGCCGGT ACCGGTCCTC ACAGACAATC CCTTCCTTTC AAATCACGCC CTTCGCCGAT GCTCAAACCC ACGCATCACC GATCGCACCA CCGCAACTCT GGCTCTGGCA ATGTCTCCAT TCCTTCCTCA CCAGCAATCC AAAAGTATCA TCCTGACGCA TCTATGCCAC CTGCTGCTAT GAACTCAGGT CTGCCTCCGC CGGCAATCGA ACACCGACAG ATACAATCCA ATCTTTCTGT CTCATCGACC TCTACTCCTT CCCCTGTCGA TCTCAGCCAT ATTATGCCAC CACCACCGGT GCCGACTGGT AAACCCAAGG CACGGAAGGG TGTCTTACCC ATGACTCCAG CTAGTCTAAT GAACCTTGGT TCCGTGGAGA AGCATGGATC TCAGTCTGTA CCGCTACCAA AGTCTCAGAC TTCGAGCGAG TCAAACTCAT CGATTGGTAC AGTCACAGCT GCTACATCTT CTGGAAGTAC AAGCAAGCCG GCTGCCGGGA AGAAAAAAAC GGGTGGTCAA GTGGGGAAGA AGACGGCAGG AAGTAAGCTT GTACCGGTGG GAACCACTAA AAGAACTTTG GCTATGCGAC CTCAGACAAC TGTTGGTGTA CGATCAGGTA AGTCACTTCA ATCCTCCGTC AACTTCCTGC ATCTGACTGA CATACATATT ATAGCTACTA AAGCAGCAGC CGCCGCTGCT GCTGCTGCCG CCATCGCCCC GGCCGAACCC GAAAACCGCA AAATATCTCA CAAAGCCGCG GAACAAAAGC GCCGAGATTC TCTCAAAGCC GGTTTCGACG AACTCCGTCT CTTACTTCCA CCCATTAACA CTGAAGCTCT AGACCCATTA TCCGGCGAGC CTATCCCAGG CTCTTCAGCA CCGAGGTTAT TACCCAAGTC TTCTCTTGTA CCAGATGATA ACCCTAATCG GGGCGTAAGC AAAGTCGCGC TTTTGAGGTT TGGGAATGAA TATATCGGTA AACTGCAAGA AAGGGTGGAT AGGAGGGATT TGTACATCGA GAAGCTGAGA GAGGAAGTTA AGCGGTTAAG AGAAGGAGGG GAAGAAGAAG ACGTGACGTT GGATAATGGC GAGGATCTTT TGGAGTACGA CTGGAGAGAA GGCGAAGAGG ATGAGTTTGG AGAATGCAAT GGCGATGACT ATAATGAAGA TGAGAAGGAA GCGGGGGAGG GGGATGAGGG ATGATATGGA TTTGGAGGAT GATGGCGGAT AGCAGGTCAA GACGAAGGGG GCTAAATGCT TTTCAACAGA GCTCGGCGCT GAAGACGATC GAGTCCAACT TGACGAAAGT CAATGGTTTG GAGGCGGGGA ATATCATTCC GAGGATGAAG GACAACCAGG AGTCAAGGAC TAGGACAATC ACAACATTTT TTTGGAGCGG ATTTGCTTTA GAATGTTGAA AATATATATA ATTAGTACAC GTTGGATTAG AGAAGGCATC GCTGTACGTA TAGCAATTAG A
|
Protein sequence | MIGTMSTSHL NSPSASFPHQ EQELFSLDFL ALTGLDGSIS DTNSPQANSS QSSHRQHDQS GDAEGQTSNE RQNSIFFREQ RRSSKDLLNS MEVDEHTGNA LRGLGHSNDG ENQLQDFDSL QAALLQQQLQ AIHMQSPLGF DIQNPTYPLG QLLASPAFDE LHSSPNGHSE QQTLSVHNAH PSSRSVYDSP LSHLAFNAHG HRNSFSSMST RSPLEQLQRQ QQQFQEQLGL LQRQQLKMQA TAAAVMAAST SPYIGLNGPS STGPRPSVTP GMTPSSSNTG MFSPLTSPAL EATNYSHQSH VSRHSQQFSP AYGSQHIGTS GILNTALSSP ALNPIGSTGG ANQTLSPALN PQNEVNRGDS EYLHAFMGML DSTNSGNSTP GGEPPQPSYQ SPSMTSASTA GNSTIISSPA LYPQGAGTGP HRQSLPFKSR PSPMLKPTHH RSHHRNSGSG NVSIPSSPAI QKYHPDASMP PAAMNSGLPP PAIEHRQIQS NLSVSSTSTP SPVDLSHIMP PPPVPTGKPK ARKGVLPMTP ASLMNLGSVE KHGSQSVPLP KSQTSSESNS SIGTVTAATS SGSTSKPAAG KKKTGGQVGK KTAGSKLVPV GTTKRTLAMR PQTTVGVRSA TKAAAAAAAA AAIAPAEPEN RKISHKAAEQ KRRDSLKAGF DELRLLLPPI NTEALDPLSG EPIPGSSAPR LLPKSSLVPD DNPNRGVSKV ALLRFGNEYI GKLQERVDRR DLYIEKLREE VKRLREGGEE EDVTLDNGED LLEYDWREGE EDEFGECNGD DYNEDEKEAG EGDEG
|
| |