Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02180 |
Symbol | |
ID | 3256623 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 618841 |
End bp | 620495 |
Gene Length | 1655 bp |
Protein Length | 388 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255439 |
Product | sulfonate dioxygenase, putative |
Protein accession | XP_569491 |
Protein GI | 58264670 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.450194 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTATCG CCGTCAACGA CAACGTAGAC AAAGACACCC TGGGCAGACC TTTCTCTCTC GCCAAGAGCA CTCGAGGTTG GTTTTGATAC TCTATTAAAT AAGCTTCTCG CTGACCGAGA TACTTAATCC TCTTTTAGAG CGTTTGGTTC AAGCCGGTAT TGACCTTTCC AAGGGTTACC CAGAGTACCC AATCAGGCCC AAGACTATCG AACTCGCCAG TGACATTCGC AAGGAAGTGG GTATTGTTTT TTTCTTTTTC GCTTTTGGGA TGGGAATTGC AATTTAGTGA ACTAACGTCT ATTACTCAGG GATGGGAGCA CAAGGACCCC GGAGCCAGGG CTGACAAGGA AAAGAAAGCA TTGTTTGGCG CCGCCAAAGA AGTTATCCAC TTGTCCCCTC ACCTTGGTAC CGAGTAAGAT TTGATCTACA CTTCTTCTTT GTCTTTGTCA CTGACACTTG TCACAGGATA GTCGGTCTTC AACTTAACCA GCTTAGTGAC CAGCAGAAAG ATGAGCTTGC GCTCCTGATT GCTGAGCGCA CTGTGGTATT CTTCCGAGAC CAGTAAGTAA TGTCTCATCA CTCATTTTAT ACGCAAGACC ATGAGTTGTC GCCGACACTA TTCCTACCTT CAGAGACCTC ACTCCTCAGA CCCAACTCGA GCTCGGCAAA TACTTTGGTA CTCCCGAGAT TCATCCATCA GCCGCCCGAG TTCCTGGGCT TCCTGGAGTC TCAATCATCA CAGACGAAGT CCTCAGGTCC ACCGGTCGTG TCCCCGACTA CAAAAACCCC TTTGCCACCC AAAAGTGGCA CACCGAGTAA GTTTTACAAT CTTTTGAAAG TTTACAATGA CTAAACCTGG TCGATTAGTT TGACTCACGA ACCTCAACCC CCTGGAGTCA CTCACCTTCA TCTTGACCAT CTCCCCGGAG TTGGCGGTGA CACTCTCTGG TCTTCGGGCT ATGCTGCGTA TGACAAGCTT TCTCCTGCTT TCCAGAAAGT CCTCGATGGT CTTGAGGGCT TGTATCGATC TGCTCACAGC TACCCCAATC CGGTGACTGG CGAGCTTGAG CCCATCATCA ATGCTCATCC TATCGTTCGC GTCCACCCGG CGACTGGATG GAAGGCTCTT TTCGTCAACT CTCGATACAC TATTGGTATT AAGGGCTTCG AGCAGTCGGA AGCCCAAGCT ATTTTGCAGA AGGTAAGTTT ATTGATTTGC ATTTGACCTC GCCGTCAACT GATCATATTT CCGAAATGAT CCACAGCTTT TCCAAGTGTA TGAGCAAAAC CCAGATACCC AAGTCAGGTT CCGATGGACT CCTCGATCCA GTGCTTTGTG GGAGTGAGTC CTACCTTCTT TCTTGTTGTT TTAATTCATT TCTGACCGAC GCATTTTAGC AACCGAGTTT GTACGCGCCT TTGTCATTTG ATCACAGGTG CCCACGTTTA CTGACATTTA TCCCTAGCCA TCCACTCCGC TGTTTATGAC TACCTCGACG AGGGGTCCGC CGAGCCCCGA CACGGGACAC GAGTTTCGAG TTTGGCTGAG AAGCCCATTC CCGTAAGCGA AGGAGTTGGT AAAAGCCGAA GAGAAGCCTT GGGTCTCAAC ACTGGTGTCG TGCACGAATT GCCTATTGAT ACTCACGCGT ACTAA
|
Protein sequence | MPIAVNDNVD KDTLGRPFSL AKSTRERLVQ AGIDLSKGYP EYPIRPKTIE LASDIRKEGW EHKDPGARAD KEKKALFGAA KEVIHLSPHL GTEIVGLQLN QLSDQQKDEL ALLIAERTVV FFRDQDLTPQ TQLELGKYFG TPEIHPSAAR VPGLPGVSII TDEVLRSTGR VPDYKNPFAT QKWHTDLTHE PQPPGVTHLH LDHLPGVGGD TLWSSGYAAY DKLSPAFQKV LDGLEGLYRS AHSYPNPVTG ELEPIINAHP IVRVHPATGW KALFVNSRYT IGIKGFEQSE AQAILQKLFQ VYEQNPDTQV RFRWTPRSSA LWDNRVSIHS AVYDYLDEGS AEPRHGTRVS SLAEKPIPVS EGVGKSRREA LGLNTGVVHE LPIDTHAY
|
| |