Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC06800 |
Symbol | |
ID | 3256697 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1987977 |
End bp | 1989387 |
Gene Length | 1411 bp |
Protein Length | 376 aa |
Translation table | |
GC content | 51% |
IMG OID | 638255900 |
Product | taurine dioxygenase, putative |
Protein accession | XP_569951 |
Protein GI | 58265590 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 36 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCTGTCG CCCTCGCAAC TTCCCTCACC GACCCTGTTG AGTCCATCAA GGCTGGCCTT GCTGCCGTCG ATGTGAACAA GGAGACGCCC TTCGACCTTC GAGCATACTC TCACTTTGAC TCCACCCCCA GTATTGGTAC TGAGTTCCGA GACTCTCCTT CCAAGGATGG CAAGCCGGTA TTGAGTATTC GAGACATCCT TGGGAATGAT GAGCGATTGA AGGCCCTCGG GCGGCTTGTG TAAGTATGCT GAACGGCTCA TCGCAGGAGT TGCGACTAAC TGAAGACGTA GTTCTGAGCG AGGTGTCGTC TTCTTCCGTG ACGCCACGAT CTCCCCTGTA GAGCAGAAGG ATCTTATAGA GGCTCTTGGT GCCCTTGGAG GAAAGCCCAA GACTAGTGGT CTACACGTCC ACCCTCTTAC TCTTGGCGGA AGTGAGCTTG GTGATGAGAT TAGCGTTATC TCCAACCAAT TCGTTTTCGA CAAGAACTTC CAGAAAAGTG ATGACACTGT TTTGAAGAGA CCTTTCGGCA ATACTCTCTG GGTACGTTGA GGGTTATTAT GGCATTACCT GGGAATTAGC GCTCATACTT CACCGCAGCA TTCTGATATC ACCTTTGAGC CCCACCCTTC TGACTACGCC ACTCTTCAGA TCAGAACCCT TCCCGAAGTC GGTGGTAAGT ACATTCGCTT TAAGAAACAT CAGCTGTTCC TTCACTCACA TGACTCAGGT GACACTCTCT GGGCTTCCTC TTACGAGGCG TACGACCGTC TTTCCCCCGC ATACAGGACC TTCCTCGAAG GTCTCACTGC CACCCACGTC GGTCAACATT TCATTGACAT GGCCCGCAAG ACCAACGCCA CCTTGCGAGA GCCGCGAGGT GCTCCTGAGA ACGTCGGCCA GCACTTGAGC GCTGTCCATC CTGTCATCAG GACCAACCCC GTCACCGGCT GGAAGGGTCT CTTTGTCAAC CGAGTCTTTA CCAAGAAGAT TAACGAATTG ACTCCTCATG AATCTGACCG TTTGCTCGGA TTCCTTTATG AGCACATCGA TGGTAACCAC GACCTTCAGG TCCGATTCCG ATGGGAGGAG AACAATCTTG TAAGTCCCAA GTTGATTAAG GTATTTTGGC GGGAGAATTT GTTGACGTGA ATTCGCTGCA ATAGGCCATC TGGGACAACA GATGCACTTT CCGTGAGTAC CATTTACCGT TCACTTACCC ACACTAGTGC TGACCGCGTT TTAGACTCTG CCACTTATGA CCTGGACAAG AACGTTCGAG TTGGAACCAG GTCTGTTTCC GTGGGTGAGA GGCCATTCTA CGACCCCAAG TCCGTAAGCC GTCGTGAAGG TCTTTACAAT GAAAAGGCAG AGCTTGAAAA GGCAAACGAG GCAGTCGGTG ATGGGTTGTA A
|
Protein sequence | MPVALATSLT DPVESIKAGL AAVDVNKETP FDLRAYSHFD STPSIGTEFR DSPSKDGKPV LSIRDILGND ERLKALGRLV SERGVVFFRD ATISPVEQKD LIEALGALGG KPKTSGLHVH PLTLGGSELG DEISVISNQF VFDKNFQKSD DTVLKRPFGN TLWHSDITFE PHPSDYATLQ IRTLPEVGGD TLWASSYEAY DRLSPAYRTF LEGLTATHVG QHFIDMARKT NATLREPRGA PENVGQHLSA VHPVIRTNPV TGWKGLFVNR VFTKKINELT PHESDRLLGF LYEHIDGNHD LQVRFRWEEN NLAIWDNRCT FHSATYDLDK NVRVGTRSVS VGERPFYDPK SVSRREGLYN EKAELEKANE AVGDGL
|
| |