Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNM01040 |
Symbol | |
ID | 3255217 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006682 |
Strand | + |
Start bp | 311991 |
End bp | 315142 |
Gene Length | 3152 bp |
Protein Length | 882 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254255 |
Product | hypothetical protein |
Protein accession | XP_568305 |
Protein GI | 58261790 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.149993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACGGGC CAACCCCACA CGACGAAATA CTAGCGGACC AGAGCCTTGC GCCTCTGAAA CGTAACCACG CATGTCGGCA ATGTAAAAAA CGCAAAACAA AATGTGACGG TGCCCATCCC GTGTGTTCAC CATGTCTTCG ATCGCATGCG CATGCCGCTC GTTCTGCGAA TCGGAATGGA ACAAGTGTGC CTGTGCTTGT TTGTACTTGG GCCGACGGCG AAGGTGGGGA GAATGGGAGT CCTCCTATTC AGGAACCCAT GCAACGCCTT TCTAGGCCTG GGTCTGCATC TGGAGTGAAA AGACCAGCAG TCTCTCAAGG TTCAAGACCA ACTCGCGACG AAGAGAATGA GATTCTCAGA CGGAGGATCG GTGAGTAGCT GAAAAGGACT TTTGTGTGGT CGACATTAAC TTTTACGTTA CTAGCCGATC TTGAAGCCAA ACTCGTCAAT CTATCATCCG CCACCAGACA ATCCGGGCCT GAACCCATTG GACCAGAGCT CTCGATGCCA GAAAACACCC TTTCACCACC ACGCTCCGAC AGTATCGTAG ATAATTGGAT CACAGAAGTC GGACATATTA TCAGATATGA TCTCAATCAT TTCGGGACCT TTGGGAATAA GGAGGCGCCT TCGAGCAAAT CTTCTCAGCC CTCCGTTTCT TCTTTGAAGG CCCCGCCTGG GGGTAATGGG GGTGGTATCG GAACTACCCA GACACGGGAG AGCAGCGGTT CCGGCACTCG CCTCAATATC GGCACTATCC CTAAAGATGG GTTCTCCAGC AGCTTTGGAT TAGACGACCT TTTCGTCATA CCTGCTGATT GGCCACGCGG TCTGCCATCA CCTTGTCAGT CAAACATTCT TTTTTCGAGA CCGTATGTCG CTAATACACT TCCGTAGTCC TCCTAGAACA CCTCGTCGAG ACCTTCTTCA ACCACGTCCC TCAAACTCCC CGAATGCTCC ATCGCCCAAC TCTTCTCACT CGTATCAAAC TATCCCCCAC CTCTGGCAAC TTCCCCTTCC CCGGTCTGCT CCATGCCATC TGTGCAACTG CTTCCAGCCA TACCGCATGG GTGAATAATC TCTCTCCTCA TCAGATCGAA GCTGCTGTGC AGAGGCATGT CATCACCGGT ATGGATTTGA CTAGTATCGA AGATTTCGGG TTGGCTCAAG CCGAAATGGC AAATAGGTCA GTTGACCTTG TTGCTTCTGC TTGTGTGATG GGTGGAGGGG ACCTGATCTT CCAAGTCACT CAAACCTGCG TGAGTTCATC TCCACCAATT AACTGAAGGG TTTTTACTCA TAATATCTAT CTGTAGATCC TTCTTAGTGA TATTTACTTT TGCAAAGGTT TCCCTCTCAA AGGATGGTTG CTTGGTGGCC AACCGGCACG ACTTATCAAC ACTCTCCAGC TCAGTGACCG CAACCCGCGG AAGTCATACA AGGAGCCTCT CTTGCCGCGT CCTCGAAATT CAAGAGAGCG CGAGGAACGA TTGGCTACTC TGTGGATGGC GTTTATCAAT GATTCTGGCT TGGCTTGCAA CAGCACTTGG GTGCCTAGTA TGTCTCTTGC CGATATAAAA TGCAATTTAC CTACTACCGG TCAGGAATGG TCAAAGGTAC GCACGTTTCG ATTTGCACAA TGACTTTTTA ATGCTGAGCA CAAAATACAG CTGGACGATA TGCTGGAAAA TCCCCAGAGT CCAGAGTCGG GAGATCTCTT CACATCGTAA ATATCTTGTG ATCCTGTTGC CGACTAGCTT ACTGAGGTTT CATCTTTTAT CTAGTCATCC CATGGAAGAC GCATTCGTGT TGGTTATCAA GTCTACTATT CTGCTGAGCG AAGTTGCACA GTAAGCCCTG GCCGATGAAG TTGTCCCACA AGTCGGATGA TGACTAATTA CTTGTCTAGA TGGCTTCGCA ACTGGTCTCA ACGAACACAG GTCCCAGGAG ACGAACTGGC AGGACCTGAA ACGGAATCAT TCAAGACTGT TGTCCGACAT ATTGAGGACT TCATGTGAGT GTACCATTAT ATCCATGAGT CTAGGTCTAA TTTTCCATAC TTCCCAGTTC AACCATCCCT AATGCGCTGA AGAACGTATT TAAACTTGTG GACTCTGCCA ACTCTGGCGG CCTCAACGTT AATCTTCTTT CACTGCATAT TTTCCCCAAC GTCGCGCTCG CTCTCATGTT CGAACCTTTC ATTGAATGGA AGCCGTCGAA TCAGTGTCTG AAAGCTACGC AACAAGCGTA TGAAGCCATC CTCGGTGTCC TGCACCTTAT CCCGAGTAAC TTGGATGTCA CTATGGTCTT TACTCCCCTG ATCGCTTGGT GAGACAGTTA GATATTACAA AGAGGTATAA TGTACTGACA GACGATGAAT AGCTCTTTAT ATACTGTTGG ACGAATCATA GCGGATTATG TCAAGTATAC CATGAGGTCT CATCAATACA GTCTGGCCGT CCGTTACCGT GCCGATCTCA CCACTATCCA GAACCGTGAG TAGATACTCT GTCCGACGCA AACATAGTAT TCAGCAAACT TATCGCGACC TTGCAGTGCT CGAACGGTAT GGCCAACGCC ACTCTCTCGG TAGCGCCATG TCCCATTTCC TCGAGAACTA TGTCCAGTAT CTCGGAAACG AGTGCATGGA CCCTGCAGCG ATGTGCAGCA AACTCGAACG TCAATTGGCT TACCCTACCA ATAACGGTAC TTATGTCATG GGCGCGGCCA ACGATGGTTA TGCTCATCTT AACGACCCCG GAAGCTCTTG TTCGGGACCC AACGACCCTC CTTCAACCAA ATCATTCTCT GATTCATGGT CAGCCAGCGG TCCTAGTCCA AGCGTCTCTA CGCCTGCTAT ATCAAAAACG AACGAACCCT CTCCTGCACA AGTATATTCC ACGGCTGAGG GGAAAAGTCA AGATCCAATA TCGAATTGGG ATTGGGGAAG AGAAGCCATC AAGATGATGG GTGTGGATGC GAGTAGATCT GTTTCGGGGC TAGCGGCGTT GGATGGTATG CCAATGTATA TGGGCGAGAG GTCGAGCTTG AATATGGATG GTCGGTTACC GGTTGGACCG TTTTCGGATG TAGCAGAGAT TGGGGGATTT GAGGGCTTGC ATTGGAAGAC GGGCAATAGT GATACCATTT AG
|
Protein sequence | MNGPTPHDEI LADQSLAPLK RNHACRQCKK RKTKCDGAHP VCSPCLRSHA HAARSANRNG TSVPVLVCTW ADGEGGENGS PPIQEPMQRL SRPGSASGVK RPAVSQGSRP TRDEENEILR RRIADLEAKL VNLSSATRQS GPEPIGPELS MPENTLSPPR SDSIVDNWIT EVGHIIRYDL NHFGTFGNKE APSSKSSQPS VSSLKAPPGG NGGGIGTTQT RESSGSGTRL NIGTIPKDGF SSSFGLDDLF VIPADWPRGL PSPFLLEHLV ETFFNHVPQT PRMLHRPTLL TRIKLSPTSG NFPFPGLLHA ICATASSHTA WVNNLSPHQI EAAVQRHVIT GMDLTSIEDF GLAQAEMANR SVDLVASACV MGGGDLIFQV TQTCILLSDI YFCKGFPLKG WLLGGQPARL INTLQLSDRN PRKSYKEPLL PRPRNSRERE ERLATLWMAF INDSGLACNS TWVPSMSLAD IKCNLPTTGQ EWSKLDDMLE NPQSPESGDL FTSHPMEDAF VLVIKSTILL SEVAQWLRNW SQRTQVPGDE LAGPETESFK TVVRHIEDFI STIPNALKNV FKLVDSANSG GLNVNLLSLH IFPNVALALM FEPFIEWKPS NQCLKATQQA YEAILGVLHL IPSNLDVTMV FTPLIACSLY TVGRIIADYV KYTMRSHQYS LAVRYRADLT TIQNLLERYG QRHSLGSAMS HFLENYVQYL GNECMDPAAM CSKLERQLAY PTNNGTYVMG AANDGYAHLN DPGSSCSGPN DPPSTKSFSD SWSASGPSPS VSTPAISKTN EPSPAQVYST AEGKSQDPIS NWDWGREAIK MMGVDASRSV SGLAALDGMP MYMGERSSLN MDGRLPVGPF SDVAEIGGFE GLHWKTGNSD TI
|
| |