Gene CNA03910 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNA03910 
Symbol 
ID3253406 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006670 
Strand
Start bp1050504 
End bp1052323 
Gene Length1820 bp 
Protein Length575 aa 
Translation table 
GC content49% 
IMG OID638252710 
Productmitochondrion protein, putative 
Protein accessionXP_566735 
Protein GI58258645 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones36 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAGCACCCAA CACCTGAACT TGCTATCACG TCCGCCTCAT GTCCCTTCGA CTATTCCTGT 
CTCGCGTTCC ACCAAGCCAG CAATTGCCGA GTAAACTTCT GCCCCAACTA TCGCTGCGTC
GAGCGCAGAG CTCTAGTTCA GATTCACGAC CTCCGCATCC CGCCGTGCAG CAGAGCTTCG
AGGCACTCGC CTCTGACCTT CTTGCCACCC CGAGCACGCT CTCGTCCTCT GGCTTGAAGA
GATCTGCACC GGAGGCTCCG CGAACATTCA TCAAGTCGCC CAGTTTGCCT CCAGTCCAAC
TGAGTCGTAA CCCCTATGCG GGGCAGTATG CGATTTCTGA AGATGATTTG GATGACCCTT
TTGAGACTGA GATTGCTGTT GAAGAACCCA AAGGCCTTGC TGTACCTCCC AGGCGCAGAC
GCTTGGCTGG GCGGCAGAGC TCGGCAACAG CACGACCACA GAGAATTATC AATCTTTCAC
CTCATGATAC TGTCTCGGTG TATCCCGATT TTATCTCTTT AACATCCCAT GGTAGGACGG
GAGTCATAAC CAACGCACGA TTACTTGATG CTTGCCACTG CAAGAAATGC AGGGACCCAT
CTACCCGACA AATGAATACT ACTACAGGGG AAGCCGTTCG TGAATCCAAA ATAGCAAGGA
TCACCAGAGG CAATTCAGTT CGTAAAGGTG GCATTCGTAA AGACGGGCTT GTAGTGAGCT
GGGGTGAAGG AGTGAAGCAT ATGAGCTTTT TTCCTCTCCA CAGATTACGG TCGATGCTAG
AAAGAGACAT GGGCACTGTT TATCGTAGCC CAAGTTTTGT TCACCAGACT TGGGACGGGG
AATCACTTTC TCTTACCAAT CTAAGATTTC AATACTCGGA TCTATCTGAA TCTCTGTTGA
AAGTTTTAGA GCAGCTTCAG GTGTACGGTA TAGTCGTGAT AGAAGGCGTA CCTACGGACC
CTACGGATGA TAAGGAGTGC ATGCTGAGAA AAGTTACCGA TATGATCGGG AAGATTAGAA
ACACATTCTA CGGGGAAACG TGGGATGTGA AAAGTGTGAA ACAGAGCAAG AATATTGCGT
AAGTGGAACT GCTTATTGGG TCTCTCACTT CCTATACTAA TACGACATTC AGCTATACCA
ATCTCAACCT TGGCTTACAT ATGGATCTTC TTTACTTTTC ATCCCCTCCT CGCTTCCAAG
CACTTCACTG CCTCCGGAAT AAGGTTGAAG GCGGTAGCTC TTACTTTGTG GACTCTTTTC
GCACCGTCTC CGACCTACCC CGAGATCAAT TCGAATTCCT GCAAAAAATC AATATAACCT
ATCAGTACGA CAATGACAAC CATTATTTTC GCTATCGTCA TCCCATCATC AGTTCCGATT
TTGTGCGTGG TCGAAACAAT CGACATGCCG CCGTTAACTG GAGTCCCCCT TTCCGCGCCG
CTGCCGAAGC TTTAGACTTT CCCCAGCACG ATTTCGTTGC GGCCGCCAAA CATGAGCAGA
AAGTGCTTCA AGCCATTGCG GATTTTGAAG AACGCCTGAG CGACCCTCGC TATCGATACG
AATTTACCAT GCAGGAAGGG GACCTAGTGC TATTTGACAA TCGAAGAGTC CTGCACGCAC
GCACGGCGTT CCGCGACAAG AAAGATATGG AAGTAGAAGA AGAAGAAAGA GTCGAGCAGA
AATCGGAGAT GGAAAGTGAT AAGGAACCAA CTAGGTGGCT GAAGGGATGT TACTTGGATG
GGGAAGCTGT ATGGGACAAG TTGGCTACAT TAAGGAAACA GTCTTTGGAA AGGAGAGCGG
CTTCTGTGGG GGTTCAATAA
 
Protein sequence
MSLRLFLSRV PPSQQLPSKL LPQLSLRRAQ SSSSDSRPPH PAVQQSFEAL ASDLLATPST 
LSSSGLKRSA PEAPRTFIKS PSLPPVQLSR NPYAGQYAIS EDDLDDPFET EIAVEEPKGL
AVPPRRRRLA GRQSSATARP QRIINLSPHD TVSVYPDFIS LTSHGRTGVI TNARLLDACH
CKKCRDPSTR QMNTTTGEAV RESKIARITR GNSVRKGGIR KDGLVVSWGE GVKHMSFFPL
HRLRSMLERD MGTVYRSPSF VHQTWDGESL SLTNLRFQYS DLSESLLKVL EQLQVYGIVV
IEGVPTDPTD DKECMLRKVT DMIGKIRNTF YGETWDVKSV KQSKNIAYTN LNLGLHMDLL
YFSSPPRFQA LHCLRNKVEG GSSYFVDSFR TVSDLPRDQF EFLQKINITY QYDNDNHYFR
YRHPIISSDF VRGRNNRHAA VNWSPPFRAA AEALDFPQHD FVAAAKHEQK VLQAIADFEE
RLSDPRYRYE FTMQEGDLVL FDNRRVLHAR TAFRDKKDME VEEEERVEQK SEMESDKEPT
RWLKGCYLDG EAVWDKLATL RKQSLERRAA SVGVQ