Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00420 |
Symbol | |
ID | 3257764 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | - |
Start bp | 106244 |
End bp | 109830 |
Gene Length | 3587 bp |
Protein Length | 812 aa |
Translation table | |
GC content | 50% |
IMG OID | 638256627 |
Product | conserved hypothetical protein |
Protein accession | XP_571126 |
Protein GI | 58267940 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.0686503 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGCGATCCTT GATATATCGC AGCGCCCTTT GTACTACTCA CCGTTTTTCT AACAAACTTA CATCGCTCTA CAGTCTCAAG AGCAGTAAAG AACCGGCCGC CACTCGCACG CCAATACAAA CTTTTTACTG GCCTAACCAC ATCGACCCAC ACAGAAGATA CGGCATGCCC ATACGCTAAC AAACTCCCGA TATCCGACAT GTTCTCAGCC CGCCGCCAGT CATCCACATC CTCCTCCAAC TCGGCCGCCA TACCTCTAAA TCACCCCACG GCACCGATTA GTAACAATCC ATCATCAAAC GAGGCGTCAC AAGGTACGGG CTCTGGGAAT GGCGATAGCC CATCAACTAG CACAGAGGGG AATGGGAAGC AACGACCAAG AAGAAAAGTC ACAATCGTTC CACCCGTAAC CGCCGCTTGC CTTGATGATC TCCGATCAGC TTTACCGACC GGCACCATTC TACCCAAATC ATCCCAAGCC AATGTACCAT CCACTCCGGG TCCTCATGTT ACAACTGGGG CAAGCACGAC AAGGACGAGC TCAAAGAGAC GGCGAGCACA TACTGTTAGT GTCTGTCTCC CTGATTCCGC CAAGTATGTA CAACTAGAAG AAAAAGATTA TCCGGCGGTG GCACAGGGTG CAAGAGCTGG TAAAAGAGGT GCTCGAAGAC AGTGGTCGCA CGATATAGAG CTCGGCATGA GTTCCACAGA TACAAACCCC GCACATCCTC CTTCCTCCCC CGTTCCCGGT GGGCGGCAAA GAAGTGTATC CACGGCCAGC CAATCAGCCA CATATCCCCA CAGGAGACGA CGTGAACGGC GAGCAAGTAG TCCTGGACCA GTAGGGTATG AGATGGAAGG GAGAGATGGG GATGTAGAGC TTGGAGATGA GTTGGTGGGT GTTCTGGATG TCGTTGATCC GCAGGTCTCG ACAGGTAGGT GATCTTGTTA CGACAAAAAA AGAGTATAGC TAATGGGAAT AGTGAATCAT TTGCAAAATA TGTGCAATTC GGTGATGGTG CCGTACCTAC CGCAGTTGTG GACGCGGCGA CCAGAGGTGC AGTTACCTTC GACGCCTAGC GATGAGGATG TGGGGTAAGT CTCGTCTCGT GTCTATCTCA CTCCCTTTTA CGTTGCTAAA AACTATTACA GATTACTAAA TAGCACTGCC ACCGTTACTC GACGGCGATC TAATACTACC CGCTCAAGGA GACGCACCCT TTCTCTATCA CGATTTGTCC CTGGTAAAAG TGCCTCTGAA TCAACAGCCT TAGCCGCTGA AGATGATACG CTTCCGGTGA CAGCACCGGC AAGCTGGGGT GGGGCGCATC CTTTGAGTAT CATGGAAGAA GAGCCTGAAT TCCAACCACA AGACTCCACC ATCCAATCCT TACCTTCCGC CTTTGCTACA AAACAATTAA TTCCTTCATC TACCACGTCA CTCCCCCTCC CAACTAAACC CACTTCCCTT ACTTCCTCTC CCACCGCTTC CACGATCTCT CTTCGCCGTT CCACCCTCTT TGACAAACAC ATCAAATCCG TCCTCTCCAC TTCTACCCGC CAAAAAATCC TGCTCGCCCT CCAAGGTCTC TGGACGTTCG TCAAAACGCC TATGGGATTT CTCACTGCCA TTTATGGGTT CGCGGTCGTG TTTTGGGGAG CGGCGATTGT GCTGTTCTTG TTGGGCTGGA TACCAACAAG TAGTAAATAT AGGCAGGATG TGTGGGTGGA GATATCAAGT CAGGTCGAGA ATGGGTTGTT TACTGTGACG GGGGTGGGGT TAATACCCTG GCGAGTGGTT GATACGTATC GTGAGTCATT GTTTTTGGAC CCTTTTGATC TTCCCTTCCT CCTTGCCGAC TTGCCCACTT TCACCTGTAC CTTTTGCTTT CTCGACCTCG TGGTGTCACG AAATCGAGGA CGAGCATGCT AACCGAATGA TTAGGAATGT CGGTGATCTG GACTCTTAAA CGTAGAGCGG AAAGGCGGCG AGAAAAAATG GGTCTACCTC CTATCGAGGA TGAGAACGAC TTACCCGATC CGCAAGATAT ACCCGGTTAC ATCCATGTAC GTCAATCTGT CTTTCTCTCG TCTGCGTCTC CCCTCCCCTC TTCTCACGAC TAGCGGAATA TCGTGAATGT GGACTTTTGA AATTAGTGCT GACTTAAACC GGTAAATAGG TATTGGACGA AAAAGAGACA GCAAAGTTGA GACATCATCA AGAGAAGTTT GCTCTCAGTC AAACGTGGTA TAAACCACAT GCTACTGCTA CACATCGTGC ATTCCCTATT CGCTGGGCCC TTTGGAACAC TATCGTGCGT CTTCTACCCA TCCCTTGTCC TCTCCTTCAT CGGCCAAATG TGTCGTCAAA ATGGGATGGG TGCTGATGTA AACTGGTGTC GTAGTTGATG GACGGCAACT CATTTTTCCA ATGTATACTT TGCGGATGTA TGTGGGGAAT GAGTGAGTGA AGATTCTTAT TGAATCGGAT TTGCCCTCGC GCCAGATTGA GACCTTTTCT GACTTCCTTG AACTTAGATT GGCATATAAG GCCAGCTTGG ACAACTGGGA GTCTTATTCC GCTCTCATTT TTGTGCGGTA TCGGGTGAGT TCAACGCCGA TCCTCTCCTG AGAAGACACT TCTGGTTTGT CTGATTCGTT CAATAGAGCT GCAGTTCTTA TCTATTGCGG TTCAGTCAAA ACTAAGAAAC ATCAAGCCGT CTCCGAAAAG CTCAAACATG CGATGGGCAT CCCACTCGCT ATCGGACAAC CAGCCGTGAG GCCTCCAGGC GTTGGAGAAG ATCCAACATC AAAAGAAAGG GTCAAGCACG GGGGTGGTGG CAGTGAGAAT GACATTGTTG AAGAAAGAGG AGGGGGAGGG GAGGGAAAAA GAGCCCAAGC CCAAGCCCAA GCTGAGAGAG CGCCTGATGG AGGCGTGATG GTTGGGGTTG GCTCCAGGGA TGTGAACGGA GAGCAGCTAT CTGGTGGCGA TGAACGAGAT AACGGGAAGG ACAACGAGGA TGAGACAGAG GGAGGAAAGA AGGGGGACAG TCAGGGAGCA AAACAAGACT CGCTAAACTT TCTCGCACCA CTGCTCAGGC GAAGCAGACA TCATCCACAA CGCCGGGCGA CAGTCACGTT CGGTCCAACT GTCTCAACTT CTCCTCACCA TCCCTTGCCT TTACATCCCC AACCTCTGTC GTATTCGAGC CGCCATCACC CCCTTTTACA AAAACGGCAC CGCGACAATA CAATGTCAAT GCGCTTACCC TCTCCCTCTC CCTCCCCATC CTCTTCCCCC TCTCCCCCTC TGGGAATCAA AGTGGAAACA GAAACATCAG CCAAGCCGGA TCATCAAGAA AAAGACATTC CTTCGATTGG CGTAAGAAGG GTGGAGAGCG ATAGCGATGA AAAAGGAGAT ATAATGTTGA GTGAGGAAAA CTTGAGAAGG CCTAAGGAGT CTTCTGTTTG AGACCGATTC CATGTGGCGC AGATAATGGG TTGTGACCAT CGGATGTTTG CATATGTCGT TGTTTTTTTT CCCTAAGGAT AGATTTGTAC GTGCATTTTT GTGCGTGTTA GTAAGTA
|
Protein sequence | MFSARRQSST SSSNSAAIPL NHPTAPISNN PSSNEASQGT GSGNGDSPST STEGNGKQRP RRKVTIVPPV TAACLDDLRS ALPTGTILPK SSQANVPSTP GPHVTTGAST TRTSSKRRRA HTVSVCLPDS AKYVQLEEKD YPAVAQGARA GKRGARRQWS HDIELGMSST DTNPAHPPSS PVPGGRQRSV STASQSATYP HRRRRERRAS SPGPVGYEME GRDGDVELGD ELVGVLDVVD PQVSTVNHLQ NMCNSVMVPY LPQLWTRRPE VQLPSTPSDE DVGRRTLSLS RFVPGKSASE STALAAEDDT LPVTAPASWG GAHPLSIMEE EPEFQPQDST IQSLPSAFAT KQLIPSSTTS LPLPTKPTSL TSSPTASTIS LRRSTLFDKH IKSVLSTSTR QKILLALQGL WTFVKTPMGF LTAIYGFAVV FWGAAIVLFL LGWIPTSSKY RQDVWVEISS QVENGLFTVT GVGLIPWRVV DTYRMSVIWT LKRRAERRRE KMGLPPIEDE NDLPDPQDIP GYIHVLDEKE TAKLRHHQEK FALSQTWYKP HATATHRAFP IRWALWNTIL MDGNSFFQCI LCGCMWGMSE AAVLIYCGSV KTKKHQAVSE KLKHAMGIPL AIGQPAVRPP GVGEDPTSKE RVKHGGGGSE NDIVEERGGG GEGKRAQAQA QAERAPDGGV MVGVGSRDVN GEQLSGGDER DNGKDNEDET EGGKKGDSEA DIIHNAGRQS RSVQLSQLLL TIPCLYIPNL CRIRAAITPF YKNGTATIQC QCAYPLPLPP HPLPPLPLWE SKWKQKHQPS RIIKKKTFLR LA
|
| |