Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH03040 |
Symbol | |
ID | 3259122 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 249746 |
End bp | 251782 |
Gene Length | 2037 bp |
Protein Length | 610 aa |
Translation table | |
GC content | 53% |
IMG OID | 638258181 |
Product | hypothetical protein |
Protein accession | XP_572474 |
Protein GI | 58270636 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACCTCGC CCGATACACA GGAACGAGAA TGCGCGTCGA CACGAGCTGC GGCACCAGCT CAACTTACTG CAGACTCGCA CCTGACTTCA TTATCCCCCC CTTCTGCTTC CGACTCACCC TTATCCTCGC TCCCTTCATC ATATCAGACA CTCCCTGAAC ATCCGAAATC CCCTCGTGAC GATCCCATTG CATACGGCGG CCACCCCAAA AACAAAGACC GACCATATCC TCTCTCTCTC GTCCCCTTCG ACCTCAGCGA ACCTTATGAA GCCTCCGAAT ACCTCACTAT CCGCTCCTAC GATGAGACAT GCCAAGGCGA CTATCTGGGC CAACCCTGGG CCAATGAGTA CAGGGACGGG GTGATCGATG ACGAGCAGAT GTTCGATGAC AGTGTTTTTC GTCGCTATCT TGATCGGTTC ATCCAAGAGA ACTACAACCA CATGACGAGG GCTACCAAAT TCTTTCATCA ATCAAATTTG AAATCGACCT TCTACGACAG ACATGACAAA GAGTGGGCCA AGCTAGATCT GGTGGATGTG TACAGACGCC GGACGGGGAA AGTCCTGGGA GTCGTGAGTC ATGGCTTGTG CAAAAGGTGA CACAATGCTT ATTACGCGTT CAACATAGCC CTCTCAATCG CAAGACACCT CCAAATCAAA AAAGGATGGT GCCACTACTC CTGATGCCTC TCTTATCGGT GTTGTCAAAG ATCTCAGCTG CTCATCGGGC TGGAAACGAA CGCTTTATGC GGTCATCGAA TTGAAGTGGA TGAAACTAGC GACTTTTCTT ACCGGCGAGG CTAAAAGACA GAACGACGAG AAATCTCTCA GATACCTCTG CCAGGAAGGC GTGTTTCAGA CTATGTGGTA TGTCATATTA GGCTACGCCA TCTCGGGTTG TATTTTCGGT CTCTCTATAG TCAACGAATA CTTTTATCGG GTTGTGTATC TCTCTCGAGA CTCGACCCCA GACATTCCCG TACTTGCCCT AGAGGCAGAT AGCGAGTTTT TGGAGAAATC CAGACGACAT TTTGGATATC TGCCAGATGA TTACTCTGTC GAGGAACTCG CAGAGCTGCA AGACTTTTGG TCGTCGCCTC CCAATTGTCT GATCAGCGAC CGTGCCAACG CCACTTTGAA TAAAGAGGCA AGGTATCACC TCGATGCGAC CATTCTCTTG TTCCTCGCTC ATGCAGCGGC ACTTCCAACG CAACGCTTCC TCAACGACCT GCCCCTCCCT TTTGCTCATC ATGTTCCTGT TGATGCGACC GCTGATTCAG CCACCGACAT GAGATTGAAA GGATTAGAAG TTGGGCGCAG GCGACACAGT CGTCGTTCGA CCAAGAGGAA CAAGCGCACA TTGGCGGATT TGTATGATGA AGAGAAAGAT GAAGAGGACA AGCCAGGGGA CGACAAGCCA CCTGGCAAGG ATAATGATGG CTCGCACGGC GGAAACTCTG GCCCTGGAGG CGATAACTCA CGTGGCGGAG GGTCGCGTCT TGGTGGAGGG TCTCGTCCTG GCGGTAGGGG TGCAGGCGGA GGCTCCTCTT CTCGTCGTGC TGAGGCGTTT GACAGTCGAA CGTCCACTGC ACCGCAAGAG TTCAGGAGAG GCCTGGAGAG GCTATCCGCT CCTAAGGAAA TGTTCCACAT GAAGACGTCC ATCATGGCCT CCCTCCTCTC CAATGACCGT ATGCACCTTT TAATCTTCCA CGTTGAAGCT ACGCTTACGA ATCGACTGTA GGTGCCAGAT GCTCTAGGGC TCCCTCCTCC GTCGACAGTG ACTCCTCTGG ACAGTTGGAC TCGTCGTTTG ACACGTCCTT CGGCTCCAAT AAGGCAGGTC TTATCCTTGA CGATCTCCGC CACGATCCCC CCCCAATAGT CAACAAGCCC GACCCTGTCG ATATCGACCT TGAAGATATC GACCCAGAGT CGGGCGAGCT TACGTTGGCG GCCTTTAAGG ACCGCCTAAC GATGCTCGGG GTGCGGGTGA AGCTGGTCAC TCGGGACCAG ATGGGCGTCT TGTTGGCCCG GGGATGA
|
Protein sequence | MTSPDTQERE CASTRAAAPA QLTADSHLTS LSPPSASDSP LSSLPSSYQT LPEHPKSPRD DPIAYGGHPK NKDRPYPLSL VPFDLSEPYE ASEYLTIRSY DETCQGDYLG QPWANEYRDG VIDDEQMFDD SVFRRYLDRF IQENYNHMTR ATKFFHQSNL KSTFYDRHDK EWAKLDLVDV YRRRTGKVLG VPSQSQDTSK SKKDGATTPD ASLIGVVKDL SCSSGWKRTL YAVIELKWMK LATFLTGEAK RQNDEKSLRY LCQEGVFQTM WYVILGYAIS GCIFGLSIVN EYFYRVVYLS RDSTPDIPVL ALEADSEFLE KSRRHFGYLP DDYSVEELAE LQDFWSSPPN CLISDRANAT LNKEARYHLD ATILLFLAHA AALPTQRFLN DLPLPFAHHV PVDATADSAT DMRLKGLEVG RRRHSRRSTK RNKRTLADLY DEEKDEEDKP GDDKPPGKDN DGSHGGNSGP GGDNSRGGGS RLGGGSRPGG RGAGGGSSSR RAEAFDSRTS TAPQEFRRGL ERLSAPKEMF HMKTSIMASL LSNDRLILDD LRHDPPPIVN KPDPVDIDLE DIDPESGELT LAAFKDRLTM LGVRVKLVTR DQMGVLLARG
|
| |