Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK01820 |
Symbol | |
ID | 3254585 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 531742 |
End bp | 533572 |
Gene Length | 1831 bp |
Protein Length | 571 aa |
Translation table | |
GC content | 56% |
IMG OID | 638253675 |
Product | conserved hypothetical protein |
Protein accession | XP_567661 |
Protein GI | 58260502 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCACATCG CATACATTCT CGGCCTCGTA CCCCTCGCTT TCGCCGGTGT CATCAAGCAC GACCCTCCCA AGTTCCAGCC TATTCAGTCT ACCAGGATTG TGCGGCTGCA CCCAAACGGG GACAAAAGCA AGTGCGTTGA CCTCCTGGGT AATACTCGCC AGGATGGTCA GCCCGTGCAG GTGAGCCCAA GCAATGACAT CACACCATAC TGTGCCACTT TTGCTGATAT CGCTCAGATT TGCGACTGCG ACGGTACCCC GGCTCAGGAC TGGGTCCTCA ATGCCGGCCG CGGTCAGACC AAGGTCCAGC TCGCCGGCAC CAGTTTCTGT CTCGATGCCA CCCACCCTTA CGCAGCCGAC GGGACCAACA TGAAGATCTG GAAGTGCTTG GACGTCCAAC AGCAAGACTG GTATTGGACG AGTGATAACA GAATCGTTCT CCGCGACCAG GGCAAGTGCC TCGACTGGGC CACTGGGGAT CGGTCTGATT TCAACCAGCT GCAGGTCTGG CGGTGCAGCA CGGATAACAA CAATCAGTGA GTCACCTGCT TGCAGTGGCC TGAGTTGATG GGCTGACAAT AATCTCACTA TTAGGGTCTG GACAACGGGA CCGGACTACG GTGGGAACCA TGGGGGTGAT GCTGGTGGGA ACCCCGGAGG TAATCAAGGC GATGATTCAA GAGGCAAAAC CAATACTGGT GGAAACCCCG GAGGTAATCA AGGTGGTGAT TCAGGAGGGA AAACCAATCA CATCATTCCC GACCCCCCAG GGCCAGACCC CAACAGCGAG CCCCTCAACC CCGCCCTTGA AGCCATTGTT AACGTCACCG AGGCAGCTGG ACCCTGGCCG CCCATGATCA ACTTCGACGG CGATTACAGC AATGACGACG TGACCGTATC GGACCAAGTA CCGTTCGACT ACTGTATCGG GGAGGGGTCT GGTAACCCGA CGGATGATGA AGGACAGCAG CAAGGACAAA ACTTCACAGC AAATGTAGCT GGGATAGGGA GAGACTTCTG CCTGGACAAT TTTGGCAATC CTGACATTCG GAACACCATT TCTTTCGACA ACAACACCAG CATTGGGAAC GGAGCGGACA CTGGGCGAGC CCTTCACAAG CGGACATTTG CGGATTCAGG GGCGACGGGT ACGCCCAACC GGTGGAGACG AGGGTCGGTG ATTTCCATTT GCGTCGAGAG GAACAACAAT TATCTGGTTC CATATGCGTC CTCCCCCGTT CCCATCCGAG CATCGGCTAT CGTCGCATCC GCCATGGTAC GTGCAATCAA CTTCTGGAAC GCAGGTCTGA ACAAGCGATT CGTCTCGTTC GAGTTTGTGG AGAACTGCAA CGACGCCGTG TTCCATACTC TTGCTGTTGA CCAGATCAAG TCTGCCAAAG AGCCTACTGT GCTCGCGACT GCCCCCTTCC CTCCTCGGGG TGAAGAGGGT GCTAGGAACC GCAACATCTT CGTGTGGAAT ACGGCTTTCG AGGCCAACTT TCAGAACGTC CTTACCTTTA TCATGTCACA TGAGCTGGGG CACACTCTTG GCCTGGCGCA TGAGGACTGC AAATCCAGAG ACCAACCTTG CGAAGTTATC ACTGACAAGG TGGCTGGGTC AGTCGTGGAA AGCCGTATCT CCGGCAGCAC CACACAGCTG TTCAATGGCC CCACCCCGCT TGACATAGCA GGGGCGAACG AGTACTACTC ACTTGCAGCG GGACCCAACA CCCCGGAGAA CATCGTACTC TGGCCTGCGA CGAGGGGTCC GTTTATCAAC TACCCGCCGC TACCGAAATG CAAGTGGTTC CTCGGTATTT GCTATTACTA G
|
Protein sequence | MHIAYILGLV PLAFAGVIKH DPPKFQPIQS TRIVRLHPNG DKSKCVDLLG NTRQDGQPVQ ICDCDGTPAQ DWVLNAGRGQ TKVQLAGTSF CLDATHPYAA DGTNMKIWKC LDVQQQDWYW TSDNRIVLRD QGKCLDWATG DRSDFNQLQV WRCSTDNNNQ VWTTGPDYGG NHGGDAGGNP GGNQGDDSRG KTNTGGNPGG NQGGDSGGKT NHIIPDPPGP DPNSEPLNPA LEAIVNVTEA AGPWPPMINF DGDYSNDDVT VSDQVPFDYC IGEGSGNPTD DEGQQQGQNF TANVAGIGRD FCLDNFGNPD IRNTISFDNN TSIGNGADTG RALHKRTFAD SGATGTPNRW RRGSVISICV ERNNNYLVPY ASSPVPIRAS AIVASAMVRA INFWNAGLNK RFVSFEFVEN CNDAVFHTLA VDQIKSAKEP TVLATAPFPP RGEEGARNRN IFVWNTAFEA NFQNVLTFIM SHELGHTLGL AHEDCKSRDQ PCEVITDKVA GSVVESRISG STTQLFNGPT PLDIAGANEY YSLAAGPNTP ENIVLWPATR GPFINYPPLP KCKWFLGICY Y
|
| |