Gene CNF02980 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNF02980 
Symbol 
ID3258373 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006691 
Strand
Start bp852121 
End bp853968 
Gene Length1848 bp 
Protein Length447 aa 
Translation table 
GC content47% 
IMG OID638257425 
Productmitochondrion protein, putative 
Protein accessionXP_571373 
Protein GI58268434 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID[TIGR02410] trimethyllysine dioxygenase 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.455571 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCGTGTCTA CTTTTCTACA TGATTTGCTA ATTACTCTTG TTGTACATGC TTCCTTCTTC 
ATCAACCTCA CGCTCCCTAT ATTGCACATC CATTGATCTC CATTCCTCGA CACCAACCGT
GCATGAGTCA TGTCCATTGC AAGGATTTTT GGTATGGGGC TCCGTAACAA GCGAGTATCA
ACGACTGCTT TTCGCTACAA CGCAAATCGG AATTCCAAAG TGCAGTTGCG CTCAATAGTG
CGATACAGAC ATTCATCTCC AAGTCAAGAA TTTGAGACAA ACTCCGCAGA AATTCGCGTC
AACAGTACCA CCATAACTAT CAAACAACCC GACGAGGATG ATCTTCAGTA GTGAGTCGAC
GACAATGGCA TTGAAGGCTG ACTGTTAGCG ACCATTTCTA TCTTTTCGAT CACTGCCGCT
GTCCGCAATG TTTCCACCCT CGCACCAAGC AGAGACTGAA GACTTTATCC CAGGTACGTT
GCGGTCAAAG TTTATATGCC CCAATCTCTC GTAATTTGTT ACCATAACTT ATCCTTGCAC
TACTAGATAC CTTCTGATAT ACACCCTACA GCCGTTGCGC TGAGTAGATC AGGTTTGCAT
ATCACGTGGT CGACGCCGTC TGCCCATACG TCTTTTTTTC CCGCTGGCTT CCTCAGGCGA
GCGGCATATG AGACTCAACT TTCTGAGCAT GTAGACTGTC GTGACGAGTA GGTGTCCCTC
TATTCATGGA TATTGCTGAC AACGCTAAAC TAGCCGCACA CTATGGAACT CTGAGATATC
AAAATCACCT CCTTATGTTG CGTATGATGA CATTATGTCA CAACAGGTAC ATCAGCATGA
ACAAGCTGTA CTGCAGGTCT TGAATAAAGT GGTCAGTCAA ACTGTAGCAT GGCATTCACC
TTGCCGCCGA CGAAACTATG TTGTTGACGA TTGACTCGTA GCATCAATTT GGCTTCTGCT
TCGTTACTGG AGTTCCAATA GATGCAAAGG AAACTGAGAC ACTTATTAAA TCTATAGGTC
CTATCAGACA GACCCATTGT AAGTGGAAGT ACCTTCATCT TGACCATATG CAGTATACAG
ACACTGATAC ATCTAAGACG GCGGCTTTTG GTCATTTACC GCAGACTTAA GCCATGGTGA
TCTGGCATAC AGTGCTCAAT CATTACCGGC TCACACGGAC ACCACATATT TTACGGATCC
TGCCGGCCTT CAGATCTTTC ATCTTCTATC ACATCCTTCA CCTGGGCAAG GCGGTAAAAC
TCTGCTGGCA GACGGCTTTC ATGCAGCTTC GCAACTTTCA GCCGTCGATC CTGCCTCTTA
TTCTGTTCTT TCGCGGCTCC CTATTCCAGC TCACGCATCA GGGACCAAGG GGACTCTATT
GAGACCACTG ATTAGTTTTC CGGTTCTGCG ACATGATGAA TGTGGACGCC TGGCTCAAGT
AAGATGGAAC AACGAAGATC GCGGAATTAT TGGGCATGGC TGGTCTGCTA CAGAAGTCCG
CCAATGGTAC CAGGCGGCGC AACGATTCGA ATCATTGGTA AAAAGTGAGC AAAACGAGTA
TTGGGTACAG CTTAATCCTG GAACAATGTT GAGTAAGTCA CAACTGCCCG TCTTTGGTCT
ATTGGCTGAC AGCTTACATT TCCCTTTCGT AGTAATTGAT AACTGGAGAG TCATGCATGG
ACGGTCAGAG TTTACGGGAT CTCGCACAAT GTGTGGTGCT TACATTGGCG CGGATGACTG
GTATTCTCGG CGGGCAGTTC TGACGGAACG GCATGGAGAT GTAGGGGGAA TGGACGACGT
ATGGCGCTTC GGTTGGTAAA CAATGCAAGA ACGAAGCAGA AATCATAC
 
Protein sequence
MSIARIFGMG LRNKRVSTTA FRYNANRNSK VQLRSIVRYR HSSPSQEFET NSAEIRVNST 
TITIKQPDED DLQYDHFYLF DHCRCPQCFH PRTKQRLKTL SQIPSDIHPT AVALSRSGLH
ITWSTPSAHT SFFPAGFLRR AAYETQLSEH VDCRDDRTLW NSEISKSPPY VAYDDIMSQQ
VHQHEQAVLQ VLNKVHQFGF CFVTGVPIDA KETETLIKSI GPIRQTHYGG FWSFTADLSH
GDLAYSAQSL PAHTDTTYFT DPAGLQIFHL LSHPSPGQGG KTLLADGFHA ASQLSAVDPA
SYSVLSRLPI PAHASGTKGT LLRPLISFPV LRHDECGRLA QVRWNNEDRG IIGHGWSATE
VRQWYQAAQR FESLVKSEQN EYWVQLNPGT MLIIDNWRVM HGRSEFTGSR TMCGAYIGAD
DWYSRRAVLT ERHGDVGGMD DVWRFGW