Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01370 |
Symbol | |
ID | 3258126 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 401651 |
End bp | 403792 |
Gene Length | 2142 bp |
Protein Length | 330 aa |
Translation table | |
GC content | 48% |
IMG OID | 638257261 |
Product | conserved hypothetical protein |
Protein accession | XP_571513 |
Protein GI | 58268714 |
COG category | [R] General function prediction only |
COG ID | [COG3491] Isopenicillin N synthase and related dioxygenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCACTGGGA CCTGTTCTAT ATCAGCTCAA CGTTCATGGT TATCCAGAAC AACCAGGAGA TCAGGCAATG CGAATACGTA CCACTGGGGA AATCCTGAGC AAGGTGGACG TGCGCATCTA CGCCCCCGGG CATAACATAT CCCCCTCTGC GTAGGTTTCA GCCGTTGAAT CTCTCGCACC AACTTCCTCA TCTACTCACT CGGCATCGAC GACCTTTTCC GCAAGATCCT CTGAGATGTT TGCAGCGAGA TGGGCGATTT TGCCGTCCTT GATGGCGATA TCAAGCTTTC CAGTATCTCC GGATGTGACT ACGATGCCAT TTTTCACGAC GAGGTCGAAC TGTTTAGACA TGTTGTTGGG TGATGGTAGT TCTTGTAATG AGGGACGCAG AAAAAAGGAT GGATACCATT GGGGTAAAAT GAGCAATCTC ATCCGCCCTC TAAAGGTCTG TAGGTCAGGT CTGCGTGAAA CCCGAAGCCG GATAAGCTTA CGGCGGAGAT AGGTTATCTC TCTCGTTATT CGGAAACTGC TCAGCCGAAC CCTTGTTCAT ATTTCAACTC TCATATGGAT CACATTCTTG ACCAACTCGA ACACAATTCA CAAACTCTCA GCTCTTGAAG ATATCAAATA GCACACTCTA GCATTAGACT ATGACCGTCA CCATTAACCC TGCTGCCTCC CTTCCCATCA TTTCGCTCGC CGAGCACAAT TCGGTTGACT CCCTTGCTAG GGCTCTCTAC GATTCTTGTA CTCAAGAGGG CTTTATTTAC GTCTGTGACC ATGAGATCCA ACAAGATCTC ATTGACCAGG CCTTTGCCAT TTCGGCGAAT TATTTTACTC ACGCCCGCCC AGAGGATAAA GTCGATCTTA AGACCAACCT TGGCTATACT GCAGTGTGAG TGTCGCCCTT GGAATCTCGC TTGGATTCAG CTGATGTCCA CTTGCAGCCG ACAAGAAAGG TAAGCTGATG CGTTGGCTAA GTTAACCATT CACTCATTCG CTTGCCCTTA CAGTCTTGAC TCCACAAGGC CCAGCTCCGG TGATCTCAAG GAATTCTTCC ACGTTGCTGA TAATCATTGG CGCGTGAGGA ACGGAGAGAG CCCGCAAGAA CTTCCTGAAG CTCTCGAATC CTCTCGAAAA GCATTAGACG ACTTTATCGA GCAGATTAAT GGTCTTGCTG ACAGAATCTT GAGGGGTTTG TCAGTGGCAC TCAAGGTGCG TCCTGATCTG AGCGATGATC AGTGTGAAAT ATGGCTAACC TGATTACCAG TTGAAGCCCG AATTCTTGAC GAATCAGCAC AGGGGAGAGT AAGTTGTGGA TTTACCCATA ATGATTTCCA TGCTGACAAA TCGCTAGACT CAACCGACTC CGTATGCTCC ACTATCCACC TGTTGAAGTG GAACAAAATG GAATCAACTC TGATAGGCAA GCCTATCCAT TTTGAAGCGC ACGACTACAG ACTGACAGAT TTTTTGTAGC AATGAAATCC GAGCAGGGGC TCATACCGAC TATGGGTCCA TAACTATCCT CTTCCAGCAC ATTGTATCTG GTTTGCAAGT TCATCGTAAC GGCTCTTGGA TCGATGTTGC GCCTAGAAAA GGCTGTGTCG TTATCAACAT TGGCGATGCT CTTGAGTTCT GGTCTGGTGG CTTGTTCAAG GTAGGTCATG GGTGATCGAA ACGTAGACTG AATATAGCTA ACTTTCCGAT CAGTCCACTC TCCATCGAGT TGTCATGCCC CGCTCCCAAG CTGAAATGGC TTCTAGGTAC TGTGAGTCAC CCTTCAAGCC CAAGGATCAG AAGATTAATA ATTGATGAGC CTCTTAGCTA TTGCCTATTT TGTTCATGCC GACAATGCTA GTATTCTGGA GCCTTTCACT GATGGAATGT GAGTTACAAT ACCCTCAATG ATTTAAAAGC TCTGAGTGCT GACATCCCTG ACAACAGTGA TGAGGATGCG CTCGACGAAA TTATTGCCCG CAAAGGACTC CCTCGAGGGA CGCGAAGGAT TACCGGAGGA GATTATGTCC AAGCTCGACT TGCTGCTACT TATGGTATGA AGGTGGCGGC CTGATGGATA TGATCTGGGG GAATGAAGTT AGTAAAAGAA CTAGAAAAGT TGTAACTATT GTCGGAGATG TG
|
Protein sequence | MTVTINPAAS LPIISLAEHN SVDSLARALY DSCTQEGFIY VCDHEIQQDL IDQAFAISAN YFTHARPEDK VDLKTNLGYT AVRQESLDST RPSSGDLKEF FHVADNHWRV RNGESPQELP EALESSRKAL DDFIEQINGL ADRILRGLSV ALKLKPEFLT NQHRGELNRL RMLHYPPVEV EQNGINSDSN EIRAGAHTDY GSITILFQHI VSGLQVHRNG SWIDVAPRKG CVVINIGDAL EFWSGGLFKS TLHRVVMPRS QAEMASRYSI AYFVHADNAS ILEPFTDGID EDALDEIIAR KGLPRGTRRI TGGDYVQARL AATYGMKVAA
|
| |