Gene CNC02180 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNC02180 
Symbol 
ID3256623 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006685 
Strand
Start bp618841 
End bp620495 
Gene Length1655 bp 
Protein Length388 aa 
Translation table 
GC content49% 
IMG OID638255439 
Productsulfonate dioxygenase, putative 
Protein accessionXP_569491 
Protein GI58264670 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.450194 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTATCG CCGTCAACGA CAACGTAGAC AAAGACACCC TGGGCAGACC TTTCTCTCTC 
GCCAAGAGCA CTCGAGGTTG GTTTTGATAC TCTATTAAAT AAGCTTCTCG CTGACCGAGA
TACTTAATCC TCTTTTAGAG CGTTTGGTTC AAGCCGGTAT TGACCTTTCC AAGGGTTACC
CAGAGTACCC AATCAGGCCC AAGACTATCG AACTCGCCAG TGACATTCGC AAGGAAGTGG
GTATTGTTTT TTTCTTTTTC GCTTTTGGGA TGGGAATTGC AATTTAGTGA ACTAACGTCT
ATTACTCAGG GATGGGAGCA CAAGGACCCC GGAGCCAGGG CTGACAAGGA AAAGAAAGCA
TTGTTTGGCG CCGCCAAAGA AGTTATCCAC TTGTCCCCTC ACCTTGGTAC CGAGTAAGAT
TTGATCTACA CTTCTTCTTT GTCTTTGTCA CTGACACTTG TCACAGGATA GTCGGTCTTC
AACTTAACCA GCTTAGTGAC CAGCAGAAAG ATGAGCTTGC GCTCCTGATT GCTGAGCGCA
CTGTGGTATT CTTCCGAGAC CAGTAAGTAA TGTCTCATCA CTCATTTTAT ACGCAAGACC
ATGAGTTGTC GCCGACACTA TTCCTACCTT CAGAGACCTC ACTCCTCAGA CCCAACTCGA
GCTCGGCAAA TACTTTGGTA CTCCCGAGAT TCATCCATCA GCCGCCCGAG TTCCTGGGCT
TCCTGGAGTC TCAATCATCA CAGACGAAGT CCTCAGGTCC ACCGGTCGTG TCCCCGACTA
CAAAAACCCC TTTGCCACCC AAAAGTGGCA CACCGAGTAA GTTTTACAAT CTTTTGAAAG
TTTACAATGA CTAAACCTGG TCGATTAGTT TGACTCACGA ACCTCAACCC CCTGGAGTCA
CTCACCTTCA TCTTGACCAT CTCCCCGGAG TTGGCGGTGA CACTCTCTGG TCTTCGGGCT
ATGCTGCGTA TGACAAGCTT TCTCCTGCTT TCCAGAAAGT CCTCGATGGT CTTGAGGGCT
TGTATCGATC TGCTCACAGC TACCCCAATC CGGTGACTGG CGAGCTTGAG CCCATCATCA
ATGCTCATCC TATCGTTCGC GTCCACCCGG CGACTGGATG GAAGGCTCTT TTCGTCAACT
CTCGATACAC TATTGGTATT AAGGGCTTCG AGCAGTCGGA AGCCCAAGCT ATTTTGCAGA
AGGTAAGTTT ATTGATTTGC ATTTGACCTC GCCGTCAACT GATCATATTT CCGAAATGAT
CCACAGCTTT TCCAAGTGTA TGAGCAAAAC CCAGATACCC AAGTCAGGTT CCGATGGACT
CCTCGATCCA GTGCTTTGTG GGAGTGAGTC CTACCTTCTT TCTTGTTGTT TTAATTCATT
TCTGACCGAC GCATTTTAGC AACCGAGTTT GTACGCGCCT TTGTCATTTG ATCACAGGTG
CCCACGTTTA CTGACATTTA TCCCTAGCCA TCCACTCCGC TGTTTATGAC TACCTCGACG
AGGGGTCCGC CGAGCCCCGA CACGGGACAC GAGTTTCGAG TTTGGCTGAG AAGCCCATTC
CCGTAAGCGA AGGAGTTGGT AAAAGCCGAA GAGAAGCCTT GGGTCTCAAC ACTGGTGTCG
TGCACGAATT GCCTATTGAT ACTCACGCGT ACTAA
 
Protein sequence
MPIAVNDNVD KDTLGRPFSL AKSTRERLVQ AGIDLSKGYP EYPIRPKTIE LASDIRKEGW 
EHKDPGARAD KEKKALFGAA KEVIHLSPHL GTEIVGLQLN QLSDQQKDEL ALLIAERTVV
FFRDQDLTPQ TQLELGKYFG TPEIHPSAAR VPGLPGVSII TDEVLRSTGR VPDYKNPFAT
QKWHTDLTHE PQPPGVTHLH LDHLPGVGGD TLWSSGYAAY DKLSPAFQKV LDGLEGLYRS
AHSYPNPVTG ELEPIINAHP IVRVHPATGW KALFVNSRYT IGIKGFEQSE AQAILQKLFQ
VYEQNPDTQV RFRWTPRSSA LWDNRVSIHS AVYDYLDEGS AEPRHGTRVS SLAEKPIPVS
EGVGKSRREA LGLNTGVVHE LPIDTHAY