Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL06740 |
Symbol | |
ID | 3254886 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 873152 |
End bp | 876146 |
Gene Length | 2995 bp |
Protein Length | 744 aa |
Translation table | |
GC content | 51% |
IMG OID | 638254151 |
Product | uroporphyrin-III C-methyltransferase, putative |
Protein accession | XP_568193 |
Protein GI | 58261566 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG0007] Uroporphyrinogen-III methylase |
TIGRFAM ID | [TIGR01469] uroporphyrin-III C-methyltransferase [TIGR01470] siroheme synthase, N-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.605616 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCCCCC CAGCCTCCCA GCCACCCCTC TGGATATCAC TCTGGATGCT TTTCACGACT CTTATCGTCT CGTGGGGTAA GTCGTGCAAA ATCGTTCCTT CCCGTGGGCC ATCACTAACT CCGCCGCTAG ATGCCGGATA CTGTCTCATG AGACCACGAA GCTTGCCTGG CGGCGATCTT TCTTGGATAT GGAAACCTTA CAAGTGGGTA CATTATTTGT CTTGACGCAA TGAGCTGACA TAAACTAGCG ACTTTCCTTA TGGCGAAGTG AGCTCGGCAG TACAGGTGTA AGAGTTTAAC TGACATGGCA TAGATCGACT ATCTCTATGG GTGGAAAGCT TTCAACGAAG AAGATGGATT TACAGCTGCC CAAGGTGGGA CTTAACTTTT CGTAACTCCA ATTTTTCTAA TTTCATTTCC CACAGCCCTT CTTAACGTCG TTGAAATCTT CCTCGCCATC GGTTACCTCT ATCTCCAACA CATTTCCCCT CGAGACAAGC CCTTGCACGC CGTAGCGCCA TTAATAGGAT TTGTGGGAAC TGTTATGACT GCCTCAAAGA CAATTTTATA CTTCCTTCAG GGTGAGTTTA ATTCCTCGGT CATGTTCCTC GCGGCGGGGG TTGGTTTATG TTGTCGTTTG CCATGGCATA AAGTCGCGGA TCGCTTTGAA GGTGTAATCG GAAAGCAGAA TATTCTGAAA GTGTGACATT GGAATGATGG CATTCCGGCG ATCTTCCGGT AGAACCACCA ATCTTGGTCT CATTTTTCCC CGTATCACTT GAGGTATAGT GAGGTATAGG GGTCCGCAGG GCATGAGCAA AGAGTTCTGC AATTCTATCT TTCCGGTGAA CGGCTGTGAC GAGTGGAGCT TCAATACGAT GGCTTTCCCC GTCCATTAGA TTCAGAGCAT TTCTTGTGAA CAAGCGCTAA CGACTATCCA GAATACTTCT GTGGCTGGTG CAATGTTGGT CACAATGACA GGCGGACATT TTGGATGGTT TGGGTTATCC CCAACGGGAC ATGGATTATT CTCCCTACAA TCGTATCTAT TGTCCTCGGC CGATACATTG CCGTTGCTCT AAAACGTGAT GCCGTCTACT CTCCGGTCTC CACCTCCCTG CTGCGCGCTG AATTTCTCGC CGAGAAATTA GCTTCCACAC TTACCGAAGA ACCGACCCAA CTCCCTGCTG CCCTTCCACT CACTTTCCAC CCTCGCACCT TGTCTGTCCT TATTGTCGGT TCCAATCGCC TCGCCGCTAG TCGAGCGCTT ACTTTCCTCG AAGCCGACGC CAAGGTTTTC CTTCTCACGT CGTCTGAAGA GGTGGCTAAG GAGGTCAAGG AGCTTGAAGA GGGTGGTCGG GTTTCCCTCC TCCAGAATAC AGCCTCAGAT TCTGTCGCCT GGTCAGAACT CCTTACAAAG CATGACATTT CTCTCGCATG CGTTACCGAT ACCCTCATCT CCACTCCTTC TCGACGCTCC CTCGCTTCTG CGACAATCAT CTACCAAACC TGTTTGTCGC TTCACATCCC TGTCAACATC TCTGACCAGC CTCATTTCTC CACTTACACT TTCCCCTCTG TACACCGTTT CGCCGGTGCC GATGGCCCTT CGCACTTGCA AGTGGCAGTA TCGACTAACG GTCAAGGCTG TCGAATGAGC GGAAGGATAA AGCGAGAGAT TGTCAGCAGA CTACCTGCAG ACGTAGGCAA GGCAGTTGAT AACGTTGGGA AACTCCGGGC AAGAGCCAAA GCCCGAGCCA AGATCTCGGA AGAAGACGAC GGACCTCTCA ATACGCCCGT ACCCCAACTC CCTACCCCAT CCATTTCTCG CCGCAACTCT GAAGAAGTCA CCCAGCAGCT GAGCGACGAG GAGCAACAAC TGAGAAGGAT GAGATGGGTG TACCAGATGT CAGAGTACTA CAGCTTTGAA CATCTTGCGA AAATTTCAAA TGAAGAGATG GACAAGGCTT TAGATATTTG GGGCCAGCGA GACGAAGGCT ACCTTCCTCA CCACGACGTG AAGACAGTTA ACGCTTCAGA GAAGAAAGGC CGTATCCTCC TTATCGGTTC CGGTCCCGGT CACCCTGGTT TACTCACGAT GGCCGCCCAC ACCGCACTCC GCACAGCCAC CCTCATCCTT TCCGACAAAC TCGTCCCCGC CGAGATCCTT GCACTTATCC CGGACTCTAC CAAACTCCAC ATTGCAAAGA AATTCCCCGG TAACGCCGAA GGCGCCCAGA ATGAAATGAT GGAACTTGCT CTCGAAGGTG CACAAAGGGG TGAGACAGTG GTCAGGCTGA AACAGGGTGA TCCATTCGTA TATGGGCGTG GTGGTGAAGA AGTGCTCTAC TTCCGACAGC ACGGGTTTGA ATCAACTGTT ATCCCAGGTA TCTCCTCCGC TCTCGCCGCG CCGCTCATGA TGGGTATCCC CGTCACTCAA CGAGGCGTCG CGGAATCTCT TGTCCTCTGT ACTGGTGTGG GAAGGCAAGG GAAAGCTGTT CAGCTACCAG GCTATGTCAA ATCTAGAACT CTCGTTATGC TCATGGGTGT TGCGCGTATT TCTCAAATCA TCGAGGTCCT CACGTCCACT ACTGCCACTA CTGCCACCGG ACGTGATGGC GCTGCTTACC CGCCCCACTT GCCGATCGCC GTTATTGAAC GCGCGTCCTC TCCTGATCAG CGAGTAATTC TCTCGACGCT TGAGAAGATT CAGCCGGCGT TGAAGCAGGT GGATGAGAGA CCGCCAGGGA TGATGCTTGT GGGCTGGGCA GCGTTAGCTT TAGAAGGGAA GGGGAGGGTG GACGTACTTG ACAGGTCGGA AGATGATGAG TTTGAGATGG TGGAGAGCTG GTTAGCGGAG GGACAGGAAG GTGAAATAAA GGGATGGAAG GTTAGGGAGG GTTTGAACGA CGAATGGAGA GGGATTTTGA ATGGGATCAT ATAGCCGTGT GATAGTTAGT CGACGTTCGG ATTCATCAGC ATTATTGGCA TATTC
|
Protein sequence | MSPPASQPPL WISLWMLFTT LIVSWALLNV VEIFLAIGYL YLQHISPRDK PLHAVAPLIG FVGTVMTASK TILYFLQEYF CGWCNVGHND RRTFWMVWVI PNGTWIILPT IVSIVLGRYI AVALKRDAVY SPVSTSLLRA EFLAEKLAST LTEEPTQLPA ALPLTFHPRT LSVLIVGSNR LAASRALTFL EADAKVFLLT SSEEVAKEVK ELEEGGRVSL LQNTASDSVA WSELLTKHDI SLACVTDTLI STPSRRSLAS ATIIYQTCLS LHIPVNISDQ PHFSTYTFPS VHRFAGADGP SHLQVAVSTN GQGCRMSGRI KREIVSRLPA DVGKAVDNVG KLRARAKARA KISEEDDGPL NTPVPQLPTP SISRRNSEEV TQQLSDEEQQ LRRMRWVYQM SEYYSFEHLA KISNEEMDKA LDIWGQRDEG YLPHHDVKTV NASEKKGRIL LIGSGPGHPG LLTMAAHTAL RTATLILSDK LVPAEILALI PDSTKLHIAK KFPGNAEGAQ NEMMELALEG AQRGETVVRL KQGDPFVYGR GGEEVLYFRQ HGFESTVIPG ISSALAAPLM MGIPVTQRGV AESLVLCTGV GRQGKAVQLP GYVKSRTLVM LMGVARISQI IEVLTSTTAT TATGRDGAAY PPHLPIAVIE RASSPDQRVI LSTLEKIQPA LKQVDERPPG MMLVGWAALA LEGKGRVDVL DRSEDDEFEM VESWLAEGQE GEIKGWKVRE GLNDEWRGIL NGII
|
| |