Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00960 |
Symbol | |
ID | 3254559 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | - |
Start bp | 299770 |
End bp | 302700 |
Gene Length | 2931 bp |
Protein Length | 881 aa |
Translation table | |
GC content | 59% |
IMG OID | 638253586 |
Product | conserved hypothetical protein |
Protein accession | XP_567784 |
Protein GI | 58260748 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCATGCCAGA CGCACTCCCA GACGACGCCT GGCTCACCGC CTTCCTCCCC ACCCCCCCCC GCCCCCTCCT CGCCGCCCTC CAGAAACGGC TCCACGCATC CACAGCCCAC CTCGCCGCCC TCGCAGACAT CTACAAGCAG CGCGCAGCCA TCGAGGCGGC CTACGCAGAC GGCCTCCACA AGCTCGCACG CACAGCAGAG CAGGGCGGCC TCACGGGCAA GGCCGGCAAT GACTGGCCAA GGGGCAGCGC CGAGGCCCGC CTGTGGGACA GCGTCGTCTC CGAGCTCGCA GAGGTGCGTC TCTCCGCCCG TCCGTGTGCG CCGTGCGACT GACCACTGCT GCAGACATCC GCCTCCCACT CGACCCTATC CGCCATGCTG AGGACAGACT TTGAGCAGCC GATCCGCGAG ATCCCCAACA AGGTCGTCGC ATGGCGCCGC ATCGGCGACC AGGACGCCAA CCTCGACAAG ACGCTCAGAG ACTATGAAAA GGTCTCCGCC AAGCTCGAAA AGGCGTCGTC CAAGTCCAAG TCGAGCAAGG TCGACGCCCT CCAGTCCGAC ATCAACAACA TCACCCACGC GCTCTCCTCG CTCTCGCCCA TGGTGTACAC GACGTACCAG CGACTGGACG AAGAGCGGCT CCGTGCGCTC AAGGAGATCA TCGTCAGGTG GGCCACCGTC AAGGGCGACA TGGCTTCCAG AGACGGACAG AGGGCGGAAG CCATCGTTTC CCACCTCCTC CAGTGGGAGA CGAGCGACGA AGTCATGGAC GTCGGGCGAA AGCTGGGTGC GATCGGCGGT GCTCGCGTCC CAGAGCGATC CGCCTCGGTC GCTACTTCCG CCACCACTCG TGCGTTTCTA ACCTACTTGT GTTGACGAAT CACTAACAAC AAGCGGCCCG CGGTAGCCCA ATCGAATCGC CGACTTTCTG CCGTTACATC TACCACCGCA ACCCACGGCG ACTTTTCGCC CCGTCCTCCC GCTGCCCGCG CCAACGGTTC TTCGACCAAT GTCAGTCAGG GTACGCCCGG CTCTTCCTTC ACCGGCGGTT TCAAATCCAT GCTTGCACGG TCAAAGACTG TGGGTGGAGG CAGGAACAGG GGCGAGAGTG ATGCTACTTC TACACGAAGC GGTACCAGAG GCGACAATTT TGAAGCCATC GGAGAGGAAG CTCCCCAGCT GAGAGAATCT TCAACTGTAA ATTCATCTTG CCTATCTACC GCCGAACATT TACTGATTTA TTTATTTATT TATTTTTTGG GTAAAAAGGC ACCTCCAGTT GACGAAGAGG GATTCTCCGT CGCTCCTTCC GACCGTCACC GAAACCCTTG GGAAGACCCC AACGAGCTTA TCCCCACGCC GGCTGGTCAG ACTGTTCCAT CTCAAGCCCA AGCTCCGGTC GTCCCCACCA AAGAGGCTCC CGCTTCCGCA CCTGCATTCA GCCAACCATT CGACTCTTCT CCCAACGCTT CCGACGAAAA CCTCTCCACT CCCACATCTT TGCAGCAGCA GCCTTTCAAA AACCTCTCCC TCGCACCTCT GCCTATCCAG GAGAGCGAAG AAGAACGTCG AGCAGCTTTG GAAAAGATGC AGAAAACGCT GCAGCTTCCG CCTTCTCAGC CTTCTAGACG GTCTACGATT GCGCGAGGGA GGAGGGATGT GAGGAACACG ATGTTTGCAG GGTCGACGGA TGAAGCGACC GCAGCTTCCA ATGCGGTCTT TGGTGCCGGT GCCGTCGCTG CAGGTGGATT CGCAAACGGG GCGTCTAAAT CGGCTGAACC AGAGGAGAAA CTCGTCGATT CACCTACCTC TACCGTCCAC ACCGGTGTAT CCCTCCCTCA CGACATGCCG TCTCCATCAC CCATCGCCGC CCGTCGAACT TCCCTTTCAT CCGTCACTTC CAACAATCCC TTTGATTCGC CCACCATCGG GCTCGGCGGT ACGATGACCC CGCCGATCAA CCTCCACACC ACAACGCCTC TCTCCGCCGC CGCCGCCGCG GACCAACCCG GTTTGCGCGC ACACGTCAAC GAATCGATCA ATGTCATCTT CCGCAACAAG CAGGTCCAGC GGATCCACAT TACAGGCGAG ATCCACCTCA GTCTTCGAGC AAAGGATGCT TCGTCGTCGT CGCCTTCCGC CCTTCCCGGT GGACCGATCC ATATCCGTCT CGCCGCGTTT GAGCACCTCG AGAAAATCGC GCCCAACCCG GCGTACCTCG CCCAAGTGCC CGATAAGCCG GGAGAGTATT TCCTAAACTC GGACGTCCTC GCTGCTGCCA CCACCGCCAG AGGGTCCGCC GCGGGCCCCC TGCTGTTCAA GTACGTCGTC CATGTCCAGC CGGGCAAAGA ACTGGCTACG GCCCCGTTGA TCCTTGATCC GGTGTTCCAA TGCAAGTCGG GCGAGACGAG GATGATTTTA CATTACAGTG CGAACCCGTC GTCTCCCCTC GCCACCACGG GCACACAGCT GGGTAGCGCC ACCGTTGTAG CCGCGTTTAC GCCCGGTGGA CCGAGCGTGA GCAACGTCCA GGCGAAACCG GCAGGAGGCG TGTGGTCCCC CTCGACGAGG AGGATGACGT GGAAAATGGA TTCGTTGTTG GGCACCACCG GAGGCAAGGT GATTGCCAAA TTCACAAGTG AACCGGGACA AGAAGCCTTG GTACCGCAGG TCGTACAAGT GTCGTGGGCG GCGGAAGGGT CGTTGATCAG CGGTCTGGGG CTCGACGTGG TGGATGGAGA GTTGGAAGGG AATCAGTGGG TGTTTGAGGA GATTCGGAAA AGTACGACGA CGGGCAAGTA TTTGGCCGAA CCGGTTGTTA CTCCATAAGT TTTATATATA TATTACACCC TCGAAAAGTA GTAAAAAAAA AAACACAATT GGTCATAATT TCTTTCACTT TTTCCTCTTG CAATGCAGAT GGTCTTTGTT A
|
Protein sequence | MPDALPDDAW LTAFLPTPPR PLLAALQKRL HASTAHLAAL ADIYKQRAAI EAAYADGLHK LARTAEQGGL TGKAGNDWPR GSAEARLWDS VVSELAETSA SHSTLSAMLR TDFEQPIREI PNKVVAWRRI GDQDANLDKT LRDYEKVSAK LEKASSKSKS SKVDALQSDI NNITHALSSL SPMVYTTYQR LDEERLRALK EIIVRWATVK GDMASRDGQR AEAIVSHLLQ WETSDEVMDV GRKLGAIGGA RVPERSASVA TSATTPQSNR RLSAVTSTTA THGDFSPRPP AARANGSSTN VSQGTPGSSF TGGFKSMLAR SKTVGGGRNR GESDATSTRS GTRGDNFEAI GEEAPQLRES STAPPVDEEG FSVAPSDRHR NPWEDPNELI PTPAGQTVPS QAQAPVVPTK EAPASAPAFS QPFDSSPNAS DENLSTPTSL QQQPFKNLSL APLPIQESEE ERRAALEKMQ KTLQLPPSQP SRRSTIARGR RDVRNTMFAG STDEATAASN AVFGAGAVAA GGFANGASKS AEPEEKLVDS PTSTVHTGVS LPHDMPSPSP IAARRTSLSS VTSNNPFDSP TIGLGGTMTP PINLHTTTPL SAAAAADQPG LRAHVNESIN VIFRNKQVQR IHITGEIHLS LRAKDASSSS PSALPGGPIH IRLAAFEHLE KIAPNPAYLA QVPDKPGEYF LNSDVLAAAT TARGSAAGPL LFKYVVHVQP GKELATAPLI LDPVFQCKSG ETRMILHYSA NPSSPLATTG TQLGSATVVA AFTPGGPSVS NVQAKPAGGV WSPSTRRMTW KMDSLLGTTG GKVIAKFTSE PGQEALVPQV VQVSWAAEGS LISGLGLDVV DGELEGNQWV FEEIRKSTTT GKYLAEPVVT P
|
| |