Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC06820 |
Symbol | |
ID | 3256557 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1994159 |
End bp | 1996565 |
Gene Length | 2407 bp |
Protein Length | 604 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255902 |
Product | Arylsulfatase precursor, putative |
Protein accession | XP_569926 |
Protein GI | 58265540 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG3119] Arylsulfatase A and related enzymes |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TGTACATAAG GACCTGGCCA TATTTTCCTA TTCAGACAAC CCCGAGGGAC GACTCTCAGA ATGCTAGGAC ACAAAGCGCT CCTCCTTGCT AGTGTAGGCT CGCTTCTGCT CTCTGCGGAC GCTCTCGCTA TCAACAAGAA GCCCAACATC ATCGTAATCC GTGGGTCTCA ATTGCGTGCC TGGCGGTGAT TTCGCTAACG ATTACCAAAA AAAGTTACGG ACGACCAAGA CGTCAGTACG CTTGCAAAGC GTGAATACCT TCCTCGCATC CACGAGCATC TCGTCGACGA GGGTGTCCTC TACGACAACT TCTTTGCGCC CGTATCCATA TGCTGTCCCA GTCGAGTGTC TTTGCTCCGG GCTCAGTACG CTCACAACCA CAACGTCACT TTTGTCAGTG CTCCTTGGGG TGGTTGGGAT GTCTTCAACA AGCTCGGTTA TGTTGGGCAT ACCTTGCCTG ACTTTGTTCA GGCTGCTGGG TACAACACTT ACTACACCGG CAAGTTCATG TAAGCCTTGC ATGCTAACTC CCGAACCAGT AACTAATCCG CTGCATAGGA ACGACCATAC CGACGCCAAT TGCGAGTCCT TGCCAGTTTC TGGGTTCAAC TCTTCCGACA TTCTTGTCGG TGAGTGACTC CCTCATCCAG ACCTTGGGTA AAATAGCTTA CAAGTCGCAG ACCCTTACAC CTATGACTAC TGGACTCCCG GTTTCTCTCG AGACAGTATG TACAATCACG CCTGGACGAA GAAAAAAGAA AACCAATGGC TAATAATTAC CGCAGACGGT CCTGTCAAGG TTCACGCCGG CGAATACTCT ACGGACCTCG TCCACGAAAA GGCAGTCAAC TACCTTGAAG ATGCTCTTCA GGAAGACCGA CCTTTCTACC TTACCGTTGC TCCCGTCGCT TGTCACTCCT GTACGCAATC ACATGACTAC CAAGTACTCC AAGCTAATAC CACTTTCAGG GCTCGACTAT AAGCAGCAGG GTGACATAAA AAAATTCGTT ACCGACATTC CTGCAAGTCA CCCCCGACAT GCCAGGCTTT TCCCTGTCGA GCAGATTGAA AGGACTGAGA ATTGGAACCC TGATGAGCCT TCTGGTGTTT CCTGGGTAAA GGAGCTCCCC AAGCTTGTAA ATATTTGCCC TCGGCGATGT TGAGAACAAA ATGTTGACGG TGCCTAAATA GAACTCTACT GAAGAGGCTT ATCTTGATGA GTTCTTCCGT GGAAGACTTA GGGCGTTGCA AGCTGTCGAC GAGCTAGTGG AGGATATCGT TGACAGGCTT GAAAAGGCCG GCGAGCTCGA TAACACATAT ATTTTCTACA CCGGTACGTG AACCCCAAAA TTACTAGAAA TATATCTGAT CATATACTTC ACAGCGGACA ACGGTTATGC TCTTGGCTCC CATCGCCGTC AGGTAAGTAC TCCCCGCTCC AGTCACAAGT TCGTGCCTGA TCTTCTACTA TCAGCCCGGT AAGACCCTCG GTTTCGAAGA AGACATCCAT GTCCCCTTTA TCGCCCGAGG TCCTGGTATC AAAAAAGGTT TCCGCGACAG TCTCTCATCT TACGGCATGG TCGACCTTTC TCGGACCATC CTCGACATTG CCGGCGCCAA CCCGGATTAC ACGGACGACG GACGAAAGAT CAACCTCCAC CAACATGGGG AAAAGAACCT TGAGCACCAG ATCGCCCGAC ATGCTATCAG CGAATACTGG GTTTTGGGTG CGGATGAGGG CGTGTTTGGC GGGCACACTC GACTGAACAA CAGTAAGTAG TCGGAAGAAC TTCAAATGAT CGTCCTTACA CAAAAAGAGT TTCAAACTAA TAATATGAAG CCTACCGAAC TCTCCGTATC CACGATGAGC ACGACGGCAA GTCTCACTCT CACTCGTATT CCGTATGGTG TACCGGCGAG CGAGAACTCT ACGACCTCGA AAAGGATCCT AAGCAGATTA ACAACCTCCT TTCTCCTCTT AACGAACTCG GCGCTTTCGC TCCTTTCAAC TCTACCGCTT CCAATGGCGA ACCTGTCCTC GTCCGCCATC TCCAACATCT CCTTAACCGA CTTGACGCCG TCCTGCTCGT TCTCAAACGA TGTACCGGCG AAGCATGCCA CAACCCTTAT CGCGAACTCT TCCCTTCATC CCAAGCTACC GGCGGGGAAA TCTTCAAGTT CTCTCAGATT CTTGAAAGCC GATTCGATGA CTTTTTCAAA GACCTGCCCA AGGTTCAGTT TGATAAATGT GCATTAGGAT TCCAGGAGGA GCTGGAGAAG CCTGATTGGA AGAAGGAGTG GGCGTATGGT TCGGATGCGC GTGATGCGTC TATTGGTGGG GGGATTGTTT TCCAGGATTT TTGATGTAGA CGAAGATGGA ATAGAAGTAG TATAGTTGAA GTAGTTT
|
Protein sequence | MLGHKALLLA SVGSLLLSAD ALAINKKPNI IVILTDDQDV STLAKREYLP RIHEHLVDEG VLYDNFFAPV SICCPSRVSL LRAQYAHNHN VTFVSAPWGG WDVFNKLGYV GHTLPDFVQA AGYNTYYTGK FMNDHTDANC ESLPVSGFNS SDILVDPYTY DYWTPGFSRD NGPVKVHAGE YSTDLVHEKA VNYLEDALQE DRPFYLTVAP VACHSWLDYK QQGDIKKFVT DIPASHPRHA RLFPVEQIER TENWNPDEPS GVSWVKELPK LNSTEEAYLD EFFRGRLRAL QAVDELVEDI VDRLEKAGEL DNTYIFYTAD NGYALGSHRR QPGKTLGFEE DIHVPFIARG PGIKKGFRDS LSSYGMVDLS RTILDIAGAN PDYTDDGRKI NLHQHGEKNL EHQIARHAIS EYWVLGADEG VFGGHTRLNN TYRTLRIHDE HDGKSHSHSY SVWCTGEREL YDLEKDPKQI NNLLSPLNEL GAFAPFNSTA SNGEPVLVRH LQHLLNRLDA VLLVLKRCTG EACHNPYREL FPSSQATGGE IFKFSQILES RFDDFFKDLP KVQFDKCALG FQEELEKPDW KKEWAYGSDA RDASIGGGIV FQDF
|
| |