Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL03980 |
Symbol | |
ID | 3254739 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 91855 |
End bp | 93112 |
Gene Length | 1258 bp |
Protein Length | 315 aa |
Translation table | |
GC content | 47% |
IMG OID | 638253870 |
Product | conserved hypothetical protein |
Protein accession | XP_567954 |
Protein GI | 58261088 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5160] Protease, Ulp1 family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.401732 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTCTCAG GGAACGTAGG TGCTTTAAAT CGGAGGAGAA CGCTTAGAAA AGGTGGGTAT GAAAGGATTC GTTGGTGGAT AAAGGTACTG ACAATGGATG GAAAGTCGAT GATAGGCTAG CGGGTATACG GACGGTTCTG CGCAACGATC AAGCCACCAA GTCATTTGAC GAAGTTGTAC AATCAAAACC GCAGCAAGAG GTTTACAAAC CTAAACAAAA ACGGGAAATA GGCGTCAAAG CCCAAGAGCA AGCCTCCAAG TCGTGCGTCC AAACCCGCCA CCTCCCCGCA AAGCTAATAA CCTACCGTAT AGATCCAGTT TTGAATTCGT CTCCATTCTC AAGAACCTCA AAGCACTGCA GCTCGCCAAA GAAAAGGCCC TCAAACCTTC CGTGCCATCC AAACTGTCTC CTCAACAAGA ATCCAAAGTT GACGCACACC TTCGAAATCC CAAATTCAAA GTCACTCTCA ACGTCTCTGA AGTGGAAGCT GGAAGTCTCA GGAGGCTCAA GCCTAGTACG TGGTTGGATG ATGAGGTGAT GAACGCGTAC TGCGATTTGA TGTGTAGTCG GTTCAAGGAT GGGAAGGCGG GGAGAAAAGT TCATTCTTTG AATTCCTTTT TCTATGGCAA GCTTGTGGAT CAGGGGTACG CCGCTGGACG GTTGAAGCGA TGGACTAAAA AAGTGAGCTT GTGCCCTATG CTCGTCCTGT CCATCCCGCT AATCCTGGCA CGCTATTTCA AGATCGATAT CTTCTCGCTC GATGTTCTCA TCTTCCCTAT CAACCAAGGT AACATGCACT GGACCGCATG TGCCATTAAT TTTGCCAAGA AACGGATAGA GTACTACGAC TCGATGGGAG ATTATGGGAA TGCGAGGAAA CAAGTGTTTA GAAAAGTGAG AGGATATGTG GAGGCTGAAC ACAAGGAAAA GAAAGGAAGG GCAATGGATT GGGAAGGATG GCATGATTAC TTCAACAAGG TGTGTATCGC CAAATCTTAC CAATCTCATG CGCCATCAGA CTCATGTTTT TTTCTTTTCT TTCTTTCTTT CTTTTAGAAC ACACCACAAC AGAATAACGG TTCAGACTGT GGCGTCTTTT CATGCCAAAC ATTAGAGATG ATCACTCGCG GTCGGGATAT TGTCACCCAG GGTTTCGAGT TTACTGCGAA GGACATGCCG TTCATGAGGA GAATGATGAT TTATGAGATT GGGGAAGGCA AATTAGAGAA GAGGACCTGG GGTTCGCCTG CGTTATAG
|
Protein sequence | MFSGNVGALN RRRTLRKATK SFDEVVQSKP QQEVYKPKQK REIGVKAQEQ ASKSFEFVSI LKNLKALQLA KEKALKPSVP SKLSPQQESK VDAHLRNPKF KVTLNVSEVE AGSLRRLKPS TWLDDEVMNA YCDLMCSRFK DGKAGRKVHS LNSFFYGKLV DQGYAAGRLK RWTKKIDIFS LDVLIFPINQ GNMHWTACAI NFAKKRIEYY DSMGDYGNAR KQVFRKVRGY VEAEHKEKKG RAMDWEGWHD YFNKNNGSDC GVFSCQTLEM ITRGRDIVTQ GFEFTAKDMP FMRRMMIYEI GEGKLEKRTW GSPAL
|
| |