Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN01920 |
Symbol | |
ID | 3255326 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 553618 |
End bp | 556977 |
Gene Length | 3360 bp |
Protein Length | 999 aa |
Translation table | |
GC content | 51% |
IMG OID | 638254610 |
Product | hypothetical protein |
Protein accession | XP_568670 |
Protein GI | 58262520 |
COG category | [R] General function prediction only |
COG ID | [COG0724] RNA-binding proteins (RRM domain) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.446711 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCAGAGG AAGACTTCAT CCCTCTCCAA GGTTCAGCTG GGAAGAAAGA ACGCAAGGAG CGTGTCAAGT ACGTACATCG CATACCCATA CAACCCTAGC TAACTCGACC ATCTAAAAGC ACAACCCTTT TCGTCTCCTC CCTCCCTTAT ACAGCCACCA CCACCGACCT TCTCACCCAC TTCTCATACA TCGGTCCTGT ACGACATGGT TTCGTCGCTA CCGACAGAGA ATCAGGCAAG TCAAAAGGTG TCGGTTACGT GACTTTTTCG TTGAAAGAGG ATGCCGATCG GGCAATTCAA GAGTTGGACG GAGGTTCTTT CGGTGGCAGT AAGAGGAAGA TCCAAGTGAA GTGGGCTGAT GAAAGGGTGA GTGCCGTTTT TTTTTGATAT GACATGTGCA AAAACTAATG GTTGGGGGAA CGTTACAAGG CGTCTCTGAA AGACCGAAAA GCCGAGATCA AGGTTTCAAA ACCCATCCCT GGACAGACAG ACAACAAGTC GACGGACCCC AAGGCTATCC AAACTCTTGT TCTCACTGGG TTACCATCTG ATATCACCAA GAATGTGCTT TGGAAGAAGA TTAGAAAGGT CAATGACAAG GCCGAGTTGG TGTTCCCTGT CGAGGCTCAA GAAAACGAGG AGGAGGCTCC CAAGGATACT GGTACGTTCT TCTCTGAAAC TTTTCAAACG TCTAGGAGCT AATCATATTA TATAAAGCCC ACATCGTTTT CCCCTCCCAC GGTGATGCTC TCAAAGCTCT TCCCAAGCTT CACGGCCATA CGTACAAGGG TAACATCTTG TCTTGTGTCC TCAAGAAGCG TCTTGAAAAA CTTTCCGCCA AAGGAGAAGG CAAAGCTCCC AGCCACGCTG GTAGACTGAT CATTAGAAAT TTGTCTTGGG ACGTAAGTCG CAGTACTCTT TTTATTTTCG GGAAATTACT AAAGAAGAAT ACTACAGACC ACTATCCAAG ATCTCCGAAA AGCTTTCCTC CCTTACGGTC CTATCCATTC TATCGATCTC CCTACCCTTC CCTCCAAACT CCCTCCTTCA TCCGACCCTG CCAAACCGCC GCCCCCTCCG CGTGCGCGTG GCTTCGCATT TGTCTGGTTC TTGGCCCGAC ACGACGCTGA AAAGGCTATT GAAGGCACCA ACGGTAAACC AATCAAGAAG GGTCCTGATG GTGAGGGTCG AGTGGTAGCT GTTGACTGGG CGTTGAGTAA GGAAAAGTGG CAAGAGGCGA CTAAAGGAGA GGAGAAGAAG GAAGGGGAGA AGGAAAGTTC TGATTCTGGG TCCGATTCTG GATCCGAGTC TGAGTCTAAT TCGGAATCTG GCGAGGGATC TGATGAAGAA TCATCAGGTG ATGAAGGTAC TTCCGTAGTT AGCGGCTCAG AAGAAAGCAG CAACACGGAT GAAAATGAAG AGGAGGAGGA GGAGGAGGAA GAAGAACCTG TCAAGCCTAC CCTCCCCACT GTCGATGTCG GCAGTACCCT CTTCATCCGT AACCTCCCCT TCGAAACTAC CGAACTTGAG CTTAACACCC TCTTCCGTTC TTTTGGGCCC TTGCGATATG CCAAAATTAC CATTGACAAG GCAACTGGAC GATCCAGGGG AACTGGCTTC GTTTGTTTCT GGAAGAATGA GCATGCCGAT GAAGTTATTG AAGAGGCGCA GAGAGTCGCG ATGGAAACTG GTGCCAATTC TATCCCTGTA TGCTTATTTT TTCTCGCATT ATCCAACAGA ACTACTGCTC ACACACGCAC TACAGCTTGG TGGCGCAGCC CCCAAGAATC CCTTCGCCCT CCCTTCCCTC CTTACCGCTG ACCCTTCCTC TTCTCTCGCT TCCCGTCTTG TCCTTCACGG GCGAACCCTC GACATCACCC GTGCCGTCAC CCGAGAAACC GCCTCCCAAA TGAAGGAAGA CACTGAACGT CTCCGTAACG CCGCAGACAA GCGAAACACC TACCTCATGC GCGAAGGTGT CATCTTCCCT AACTCTCCAG CCGCCGAGGG TCTGCCAGAG AGCGAGATTG AAAAACGTCA GGCGAGCTTC AACTCTCGTA AAGCCTTGTT GCGAGGTAAC CCTTCGCTGT ACATCTCCAA AACACGTCTA TCTATCCGAC AGTTGCCTCT CTTCGCGACT GATAGGACGC TCAAGCGATT GGCTATCCAT GCAGTCAAAG CTTTCGACAA GGAAGTTGCT GATGGTGAGC GAGAAGGTTT GGCTAGGGCA GAAGAGATGG ACGGTACCAT GAGTTCTGCT ATTGCCGCTC GTGGCGACAA GTCAGGCGGT AAGGGTAAGG GTAAAGGAGG CAAGAAGGAG CGTGAGACAG CTGTCATCCA ATCCAAGATT GTCCGCCAAA CTGAAAAGCT CGATCCCGTC ACAGGCCAAG GCAGGTCGAA GGGTTATGGT TTCCTTGAAA TGCGTTCCCA CAAGGATGCA CTTAAGGTGC TTCGATGGGC GAACAATAAC CCTGAAGTGG GTCCATTGAT GTGGGAGTGG TGGAAGGTTG AGCTCGGAGA TATGAAGGAG CGAGTGGAAA AGGCTTTGAC GGATGCTAGG AAGAGGGAGG AAGAGCCACA GAAGGAAACC GCCAAGGAGA GTAAGAAGGG ATTGGAAAGC GTTGAGGAGT TAGAGTCGAG GTTGAAGAAG CTTGATAGCA GATTGGAAGA AGGCGATTCA AGAAGCGGTG GTGGAATGAG AGGTGGAAAG ACTCTTATCA TCGAGTTCAG TATTGAGAAC GTCCAGGTAA GTCACTTTTG AATATCCTTT TGCAAGTCAA GCCAGTGACT GATTCACCGT GATGTGTAGG TTGTGAAGCG CCGAGTAGAA AAGATCACCA CGGCGAGAGA GAACGGGAAG CGCAAGGCGG ATATAATTGC AGCTGAAGAC ACCGATGACG ACGAATCTGC CTCTTCCAAG CCCGCTGGCC GATTTGACAA GCGCGCCAAG CGCGACGGAC GTGATGGTCG TGATCGTCGC GACAGGCATG ATGGTCCCGG TGGTCGTGAG GGGTTAAAAG GCAACTTTAG AGGCAAGCAA GGTCAAGATC GAGAGCAAGG CGGCAAGTTT GATAAAGACA GGCCTAGGGG AGGTCAGAAT GGCCCAAGAC GGGATTATAA CGATCGTGGA GGTCGAAATG AACGAGGTGG TCGCAACGAC CGTGGTGACA GACCTGGTCA AGTCCAGCCC CGACAACCTG GGAGGAGGGA CCAAAAAAGT GTGAACCAGG TGAAGAAGGA TGTCAAGACC AAGGAAGCTG GGGAGAAGAG AGGTATTGAG AAGTTCGGAA GTCAACTGGG TAGCATGATT GGGAGGAAGA GGAAGATGCG CAAGGGTGCA AAGTAGGTGG TGATTGTCCT
|
Protein sequence | MAEEDFIPLQ GSAGKKERKE RVNTTLFVSS LPYTATTTDL LTHFSYIGPV RHGFVATDRE SGKSKGVGYV TFSLKEDADR AIQELDGGSF GGSKRKIQVK WADERASLKD RKAEIKVSKP IPGQTDNKST DPKAIQTLVL TGLPSDITKN VLWKKIRKVN DKAELVFPVE AQENEEEAPK DTAHIVFPSH GDALKALPKL HGHTYKGNIL SCVLKKRLEK LSAKGEGKAP SHAGRLIIRN LSWDTTIQDL RKAFLPYGPI HSIDLPTLPS KLPPSSDPAK PPPPPRARGF AFVWFLARHD AEKAIEGTNG KPIKKGPDGE GRVVAVDWAL SKEKWQEATK GEEKKEGEKE SSDSGSDSGS ESESNSESGE GSDEESSGDE GTSVVSGSEE SSNTDENEEE EEEEEEEPVK PTLPTVDVGS TLFIRNLPFE TTELELNTLF RSFGPLRYAK ITIDKATGRS RGTGFVCFWK NEHADEVIEE AQRVAMETGA NSIPLGGAAP KNPFALPSLL TADPSSSLAS RLVLHGRTLD ITRAVTRETA SQMKEDTERL RNAADKRNTY LMREGVIFPN SPAAEGLPES EIEKRQASFN SRKALLRGNP SLYISKTRLS IRQLPLFATD RTLKRLAIHA VKAFDKEVAD GEREGLARAE EMDGTMSSAI AARGDKSGGK GKGKGGKKER ETAVIQSKIV RQTEKLDPVT GQGRSKGYGF LEMRSHKDAL KVLRWANNNP EVGPLMWEWW KVELGDMKER VEKALTDARK REEEPQKETA KESKKGLESV EELESRLKKL DSRLEEGDSR SGGGMRGGKT LIIEFSIENV QVVKRRVEKI TTARENGKRK ADIIAAEDTD DDESASSKPA GRFDKRAKRD GRDGRDRRDR HDGPGGREGL KGNFRGKQGQ DREQGGKFDK DRPRGGQNGP RRDYNDRGGR NERGGRNDRG DRPGQVQPRQ PGRRDQKSVN QVKKDVKTKE AGEKRGIEKF GSQLGSMIGR KRKMRKGAK
|
| |