Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN01520 |
Symbol | |
ID | 3255445 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | + |
Start bp | 441032 |
End bp | 442417 |
Gene Length | 1386 bp |
Protein Length | 316 aa |
Translation table | |
GC content | 47% |
IMG OID | 638254567 |
Product | conserved hypothetical protein |
Protein accession | XP_568626 |
Protein GI | 58262432 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG5539] Predicted cysteine protease (OTU family) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 0.999443 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGGA TAGTAAGCAG TGACTTCCGA AATTGCAACT TTTATCCTTA TTATTACCCT GTAGTTAAGT ATGGTTATCC ACCCAAGCCA CTTCCATCAA CGGCGGGGCC GTTGTCTTCC ATCCCCATTG GCAAGGGAGA GCAGCTTATT GTCAGCTCTA TACCATCGGG TGGCCCTTCA AAGAAGGTAC CAGTTGTTGC CCAACCTTCA ACAACTACTA GCTCAGCTCC TGCTGCTTCA AGGCCTACCA ATGCGTCACC AGTTCTTGCT GCCCCCTTAG TGTCGAATGC TTCTCAAGGC GAAGGTGTAG AAAGCGGCGA GAGCGTTGCC GTACCGGGCA GAGACGCTGG ATACCTGCAG CTGAGGGTTG TACCGGATGA CAACTCTTGT CTTTTCAGTG CTATTGGTAT TGTATTTGAG GGCGGTATCG AGGCAGCTCA GAGGCTGCGT ATGGTGGTGG CCAACGCCAT TAAGGATGAC CCTTTTACAT ACTCCGAGGT TATGCTTGGG TAAGTGATCG GGATCGTATA AGTCAGGATA ACTTTGAGCT AATGTGGATG TAAGCCAACC GATCGATCAG TATGTGAAGC GGATCCAAAA GCCGCAGACA TGGGGAGGAG CTATCGGTAT GTTTCTATAC ATTTCCTGTA CAGATCTCAT GCATGGCATT GTTGACGTTG TTCAGAACTC TCAATATTTG CCAAACAGTA AATCAAAAAA ACCATTACAT ATCAGACTTT GCTTACCAAA AACCACAGCT ACAAGACTGA AATTGCCTCG TTTGACGTAG CAACAGGGCG TTGCGATAGA TTCGGCCAAG ATGAATATGA CACACGGTAC GTTCGTGTAA CTTGGGGTCC GTTCTGGTGG CTCATGGCGA TCTGCTCATT TAGTTGCATT CTCGTCTACT CTGGTATTCG TGAGTCCTAA TTGACCTTAG TCGCCTTGCC ATCTCTGATT TTTTAACCCT CCACAGACTA CGACGCCATC AGTCTATCAC CTCTCCCTGT TTCCCCAGCT TCTTTCCACA CCACAATATT CCCTGTAACT GATCAAATCA TTCTTACTAC TGCGGACAAG CTCGTCTCAC AACTTCGAGC TAGGCATTAT TATACCGACA CTGCAAACTT TGATCTCAGG TGTGCGATAT GCAAGAAGGG TCTGAGAGGA GAAAAAGGTG CGAGAGAACA CGCCATGCAG ACTGGTCGTG AGTTATCTCC TGTCAACTAT TCGTGACCTC AAGCTAATTC CGCCAGATGT CGAGTTTGGC GAGTACTAAG GTTCCATCAA TTATCCTTCC GGCACAATAT ATGAGCGTTA TTCTGTCTTT TATGTAGTGT TAAACTACCA TAATATCCAT CTAATGAAGT ATATCAAATC CGTCTG
|
Protein sequence | MSRIVSSDFR NCNFYPYYYP VVKYGYPPKP LPSTAGPLSS IPIGKGEQLI VSSIPSGGPS KKVPVVAQPS TTTSSAPAAS RPTNASPVLA APLVSNASQG EGVESGESVA VPGRDAGYLQ LRVVPDDNSC LFSAIGIVFE GGIEAAQRLR MVVANAIKDD PFTYSEVMLG QPIDQYVKRI QKPQTWGGAI ELSIFAKHYK TEIASFDVAT GRCDRFGQDE YDTRCILVYS GIHYDAISLS PLPVSPASFH TTIFPVTDQI ILTTADKLVS QLRARHYYTD TANFDLRCAI CKKGLRGEKG AREHAMQTGH VEFGEY
|
| |