Gene CNC06800 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC06800 
Symbol 
ID3256697 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp1987977 
End bp1989387 
Gene Length1411 bp 
Protein Length376 aa 
Translation table 
GC content51% 
IMG OID638255900 
Producttaurine dioxygenase, putative 
Protein accessionXP_569951 
Protein GI58265590 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGTCG CCCTCGCAAC TTCCCTCACC GACCCTGTTG AGTCCATCAA GGCTGGCCTT 
GCTGCCGTCG ATGTGAACAA GGAGACGCCC TTCGACCTTC GAGCATACTC TCACTTTGAC
TCCACCCCCA GTATTGGTAC TGAGTTCCGA GACTCTCCTT CCAAGGATGG CAAGCCGGTA
TTGAGTATTC GAGACATCCT TGGGAATGAT GAGCGATTGA AGGCCCTCGG GCGGCTTGTG
TAAGTATGCT GAACGGCTCA TCGCAGGAGT TGCGACTAAC TGAAGACGTA GTTCTGAGCG
AGGTGTCGTC TTCTTCCGTG ACGCCACGAT CTCCCCTGTA GAGCAGAAGG ATCTTATAGA
GGCTCTTGGT GCCCTTGGAG GAAAGCCCAA GACTAGTGGT CTACACGTCC ACCCTCTTAC
TCTTGGCGGA AGTGAGCTTG GTGATGAGAT TAGCGTTATC TCCAACCAAT TCGTTTTCGA
CAAGAACTTC CAGAAAAGTG ATGACACTGT TTTGAAGAGA CCTTTCGGCA ATACTCTCTG
GGTACGTTGA GGGTTATTAT GGCATTACCT GGGAATTAGC GCTCATACTT CACCGCAGCA
TTCTGATATC ACCTTTGAGC CCCACCCTTC TGACTACGCC ACTCTTCAGA TCAGAACCCT
TCCCGAAGTC GGTGGTAAGT ACATTCGCTT TAAGAAACAT CAGCTGTTCC TTCACTCACA
TGACTCAGGT GACACTCTCT GGGCTTCCTC TTACGAGGCG TACGACCGTC TTTCCCCCGC
ATACAGGACC TTCCTCGAAG GTCTCACTGC CACCCACGTC GGTCAACATT TCATTGACAT
GGCCCGCAAG ACCAACGCCA CCTTGCGAGA GCCGCGAGGT GCTCCTGAGA ACGTCGGCCA
GCACTTGAGC GCTGTCCATC CTGTCATCAG GACCAACCCC GTCACCGGCT GGAAGGGTCT
CTTTGTCAAC CGAGTCTTTA CCAAGAAGAT TAACGAATTG ACTCCTCATG AATCTGACCG
TTTGCTCGGA TTCCTTTATG AGCACATCGA TGGTAACCAC GACCTTCAGG TCCGATTCCG
ATGGGAGGAG AACAATCTTG TAAGTCCCAA GTTGATTAAG GTATTTTGGC GGGAGAATTT
GTTGACGTGA ATTCGCTGCA ATAGGCCATC TGGGACAACA GATGCACTTT CCGTGAGTAC
CATTTACCGT TCACTTACCC ACACTAGTGC TGACCGCGTT TTAGACTCTG CCACTTATGA
CCTGGACAAG AACGTTCGAG TTGGAACCAG GTCTGTTTCC GTGGGTGAGA GGCCATTCTA
CGACCCCAAG TCCGTAAGCC GTCGTGAAGG TCTTTACAAT GAAAAGGCAG AGCTTGAAAA
GGCAAACGAG GCAGTCGGTG ATGGGTTGTA A
 
Protein sequence
MPVALATSLT DPVESIKAGL AAVDVNKETP FDLRAYSHFD STPSIGTEFR DSPSKDGKPV 
LSIRDILGND ERLKALGRLV SERGVVFFRD ATISPVEQKD LIEALGALGG KPKTSGLHVH
PLTLGGSELG DEISVISNQF VFDKNFQKSD DTVLKRPFGN TLWHSDITFE PHPSDYATLQ
IRTLPEVGGD TLWASSYEAY DRLSPAYRTF LEGLTATHVG QHFIDMARKT NATLREPRGA
PENVGQHLSA VHPVIRTNPV TGWKGLFVNR VFTKKINELT PHESDRLLGF LYEHIDGNHD
LQVRFRWEEN NLAIWDNRCT FHSATYDLDK NVRVGTRSVS VGERPFYDPK SVSRREGLYN
EKAELEKANE AVGDGL