Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF02980 |
Symbol | |
ID | 3258373 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 852121 |
End bp | 853968 |
Gene Length | 1848 bp |
Protein Length | 447 aa |
Translation table | |
GC content | 47% |
IMG OID | 638257425 |
Product | mitochondrion protein, putative |
Protein accession | XP_571373 |
Protein GI | 58268434 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG2175] Probable taurine catabolism dioxygenase |
TIGRFAM ID | [TIGR02410] trimethyllysine dioxygenase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.455571 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCGTGTCTA CTTTTCTACA TGATTTGCTA ATTACTCTTG TTGTACATGC TTCCTTCTTC ATCAACCTCA CGCTCCCTAT ATTGCACATC CATTGATCTC CATTCCTCGA CACCAACCGT GCATGAGTCA TGTCCATTGC AAGGATTTTT GGTATGGGGC TCCGTAACAA GCGAGTATCA ACGACTGCTT TTCGCTACAA CGCAAATCGG AATTCCAAAG TGCAGTTGCG CTCAATAGTG CGATACAGAC ATTCATCTCC AAGTCAAGAA TTTGAGACAA ACTCCGCAGA AATTCGCGTC AACAGTACCA CCATAACTAT CAAACAACCC GACGAGGATG ATCTTCAGTA GTGAGTCGAC GACAATGGCA TTGAAGGCTG ACTGTTAGCG ACCATTTCTA TCTTTTCGAT CACTGCCGCT GTCCGCAATG TTTCCACCCT CGCACCAAGC AGAGACTGAA GACTTTATCC CAGGTACGTT GCGGTCAAAG TTTATATGCC CCAATCTCTC GTAATTTGTT ACCATAACTT ATCCTTGCAC TACTAGATAC CTTCTGATAT ACACCCTACA GCCGTTGCGC TGAGTAGATC AGGTTTGCAT ATCACGTGGT CGACGCCGTC TGCCCATACG TCTTTTTTTC CCGCTGGCTT CCTCAGGCGA GCGGCATATG AGACTCAACT TTCTGAGCAT GTAGACTGTC GTGACGAGTA GGTGTCCCTC TATTCATGGA TATTGCTGAC AACGCTAAAC TAGCCGCACA CTATGGAACT CTGAGATATC AAAATCACCT CCTTATGTTG CGTATGATGA CATTATGTCA CAACAGGTAC ATCAGCATGA ACAAGCTGTA CTGCAGGTCT TGAATAAAGT GGTCAGTCAA ACTGTAGCAT GGCATTCACC TTGCCGCCGA CGAAACTATG TTGTTGACGA TTGACTCGTA GCATCAATTT GGCTTCTGCT TCGTTACTGG AGTTCCAATA GATGCAAAGG AAACTGAGAC ACTTATTAAA TCTATAGGTC CTATCAGACA GACCCATTGT AAGTGGAAGT ACCTTCATCT TGACCATATG CAGTATACAG ACACTGATAC ATCTAAGACG GCGGCTTTTG GTCATTTACC GCAGACTTAA GCCATGGTGA TCTGGCATAC AGTGCTCAAT CATTACCGGC TCACACGGAC ACCACATATT TTACGGATCC TGCCGGCCTT CAGATCTTTC ATCTTCTATC ACATCCTTCA CCTGGGCAAG GCGGTAAAAC TCTGCTGGCA GACGGCTTTC ATGCAGCTTC GCAACTTTCA GCCGTCGATC CTGCCTCTTA TTCTGTTCTT TCGCGGCTCC CTATTCCAGC TCACGCATCA GGGACCAAGG GGACTCTATT GAGACCACTG ATTAGTTTTC CGGTTCTGCG ACATGATGAA TGTGGACGCC TGGCTCAAGT AAGATGGAAC AACGAAGATC GCGGAATTAT TGGGCATGGC TGGTCTGCTA CAGAAGTCCG CCAATGGTAC CAGGCGGCGC AACGATTCGA ATCATTGGTA AAAAGTGAGC AAAACGAGTA TTGGGTACAG CTTAATCCTG GAACAATGTT GAGTAAGTCA CAACTGCCCG TCTTTGGTCT ATTGGCTGAC AGCTTACATT TCCCTTTCGT AGTAATTGAT AACTGGAGAG TCATGCATGG ACGGTCAGAG TTTACGGGAT CTCGCACAAT GTGTGGTGCT TACATTGGCG CGGATGACTG GTATTCTCGG CGGGCAGTTC TGACGGAACG GCATGGAGAT GTAGGGGGAA TGGACGACGT ATGGCGCTTC GGTTGGTAAA CAATGCAAGA ACGAAGCAGA AATCATAC
|
Protein sequence | MSIARIFGMG LRNKRVSTTA FRYNANRNSK VQLRSIVRYR HSSPSQEFET NSAEIRVNST TITIKQPDED DLQYDHFYLF DHCRCPQCFH PRTKQRLKTL SQIPSDIHPT AVALSRSGLH ITWSTPSAHT SFFPAGFLRR AAYETQLSEH VDCRDDRTLW NSEISKSPPY VAYDDIMSQQ VHQHEQAVLQ VLNKVHQFGF CFVTGVPIDA KETETLIKSI GPIRQTHYGG FWSFTADLSH GDLAYSAQSL PAHTDTTYFT DPAGLQIFHL LSHPSPGQGG KTLLADGFHA ASQLSAVDPA SYSVLSRLPI PAHASGTKGT LLRPLISFPV LRHDECGRLA QVRWNNEDRG IIGHGWSATE VRQWYQAAQR FESLVKSEQN EYWVQLNPGT MLIIDNWRVM HGRSEFTGSR TMCGAYIGAD DWYSRRAVLT ERHGDVGGMD DVWRFGW
|
| |