Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNK00010 |
Symbol | |
ID | 3254597 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006680 |
Strand | + |
Start bp | 905 |
End bp | 4960 |
Gene Length | 4056 bp |
Protein Length | 1047 aa |
Translation table | |
GC content | 53% |
IMG OID | 638253494 |
Product | hypothetical protein |
Protein accession | XP_567769 |
Protein GI | 58260718 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCGTCG ACTTGTCCCA CCCCGACATC GCAGCCGCAC GTGCATCCAT CTCCAACCCG TCGGACCCTG CCGCCTGGCT TCTCCTGCAG TACGCGAGCC CCCCCTCCAC CACGCAACTC CCGCAGCTCC TTTTGCTCGA CTCCGGCCCC CTCCCGGTCT TGCCGGCATG GCAGCACCAT CTCCAAAACA CACACCAAAG TATCTTATTC GGCTACGCCG AGATTGCAGA GAAGGGTCTG GTCCTTGTGT ATCTCACGCC GGATGTAGGG TGAGTCCTGC CCAGCTCTCG CGCTGAAAAC AGACGCTGAT ACGTCTCGCA ACGCAGGGGT GTAAAGCGGG GTATGTCTTG CATCCGTCCC CCAGTAACTG GAAAGTACAG CTAACCACAC ATCCAGCACG AGCTATCGTG CATTCTCGAG CTTTCGCGAG TTTGTTCCCT GTACGTTTCT CTGCTCCCGA ATCTCCCCTC ATCTACTGAT CCTTTCTGCC TTTTTCCCCT TGTTCTCGTT GTCACTTCAC AAAAAAGGAC TATTCCGCAA TAATTACCAT CTCCCACCCT TCCGAACTCA CCGAACAACT CATCACCGAT CAACTCGCCC TTAACCTTCC TTCTGCCACT TCCCTTCCTG CCTGTGTGCC CGTGGCCTCG ACGCGGTACG TCGTTCCAGG GGCGGATGTC TTGGTACCAG ATCCACTATC ACCTCGAGCT GCTGGAATGG GCAGGGATTT ACCGAGTCTA CCTTTTCCCA CATCAAATCA ACAGAGTTTA CAAACGACGC AGCTACAGAC CTCACAGTTA CCGCCATTAC CACCTTTACC TCCTTCCCCC ATCCCGCCGT CACCTGTGTC GAAAGATGAG ACTGGAATCA ATCGTGTAAG CAATGCGAAC GAGATGAATG AGGACAATCA AATAGAGAAA GGAATGGATC TGGAAGAGGG AGGCGAACAG GAGCGCGAGG ATGGACTGGA GGATAGTCAT GGATCAAGGT TTGAAAATGG GAATGGAAAC GGAAATGTGA GCACAAACAC AAACACAAGC AGCAGAAAAC CAGTCCCCGC TCTCTCTCTT TCACCTTCCC CTCCCCGCCC ACCTCGCCAT TCGCAACCGC GTACACCACC CAAACCCCAG CCTATCCTCC TCCCCACCTC CGCTCATGCC CCTGCTACCT CTCTTGATCT TGTACCTGCC CCAGCTACCC CAACTCAAGG AACAGGAGGA ACGGAAACCC GCAAATCTTC AACTCCGCTC TCCCTAACAT CCCGCCTCAA AAACACATTC CACCGCTCGT CACTCAAAGA CAAAGATTCC CCTACCGTTG TCCCCTCCTC TCCCACCGCC ACACCTTCAT CCCCTTCCGT TCCGTCTTCT ACCAACAATG CCAAGCCAGA TGGTCTAGGT AGCCCGACCT TTAATCCCAA TGCGACTTCG ATGCCTCATT CGCCGAGCAC GCCAAGTGCA GGAAAGTTCA AAGCTAGTTC TCTAAGCAAA GTATTTGGGA AACGAAAATC CTCTGGTACC TCCACATCTA CGCCTTCCGG GGCGGGGACG TGGGCTGGGC GAGGTGAGGG ACCAAAATTG ACAGAAGAGC CAGAAGGGCA GAGGATCGAT GAATTTGGGA GACCGCCACA GGCGGAAGAA GGTCTTCAAG TGTTTTCCCA GTCGACTGGA ATAGACCCAC CGGCTGCACC TGCATCAATG CCGATTGGTT CTCCATCCTT GGCAACAGCA CCTACGCCCC CACCAAAATC TCCTCTTCCC CGTCCTTCCG ATCCTTCCGA TCCTTCCGAT CCATCCGGTC CCACGGACGA GCTTAACTCC TCTCCCACGT GTTTGGAACC AGGGCCAGGG CTCGCCACAC CCCTTCCGCC TAGAGGTAGC TCACTGCGTC CTCAACATTC CCCTTCACCT TTATCTTCTC CTGGAGCACA AGAAGGACAG AGCATCAGGG CCGGAAACAC CGCCTTTCCC CTCGCAGGCG TCCCACTCTT AGGTCCCGAA GCACCCCTGC CCCACCTCCC GCCTTCATCC CCTTCCTCCC ACAGTCACCA CGGCCTTGAC AGTAATCTGG AAAGAGTGGG AACACCATAC TATACCCCTT CACTTATGCA ACCTTCCCCT TCCTTTCAAC CGCAAGAAGA GGGAACTCAA CACCAGCCAC ATACTTATCC TTCTCGCCTC CACCCTCACA CGCAAATCCC GTCTCATTCT CATCCATCGG CAACGGCAAC GACGCTGCAA CCCCCGCTGT CACGTTCTAC ATCTCTTTCT CCCTCTGTCC CACTCCCCGC CACATTGGCC ACAACAACAT TGGACCCAGT TTATACAGCT GATACCCAGG GTGCTTACGC GAACGCGGAG GAGGATAGGG AGGAGATCGA GGCTGAGGAA GGAAAGCAAC GGGATAGGGA TAGGGAGAGG GATAAGGAAA GGGAAAGTGT TCTGCTAGCT TACGATCGGC CTGAGATTGA GCCAGAGTCG GATCCGAGGC AAGTGGAGAG TACGACTGAA ATCATGGAGG GAATGGAAAA GGGTTTCTTC CCACGTCAAG GTGAACAAGG TAGGTTTGAT CATTATTACA CATGCCGAAG ATGATTGCCT TCTTTTCTTG ACGCTTACCT TTTTTGTTCC CTATAGTGTC TGACGCTTCG GCTCCCAGCC CCGTTTCCCT ACGCGCACCT GCGCCTGCAT CTGTGGATGA GTACGACAGC CCATCTCATC CCTCGGCTTT ACAGCCTCCC TCGCCCCTTC ATCCCGCCGC TTCTATCCAT GCTACCGTTC CCCATCTCGC CGAGGGACAA GCAGAAGAAA TGGAGAAAGA AACTCCCGTG GAGACGGAAG AGGCAAAAGC GGAGGTAGAA GGTGATGAAC AAACTGAAAA AGAAGAACAA CGACAACGTC TCGAACGTGA ACGCGAACGC GAAGTCCTTG AAGCTCAACA AATCGCGCAA TGGCAAGCCG CCGAGACGGC TCGCTGGGAA GCTGAAGAAT CCGCTCGTCT CGAACAGGAA GAAGCGGCGC GTCTTGCCGC GGAGGAACAA GCCCGACGAG ACGCGGAGGA AGCGGAACGG TTTAGGTTAG AAGAGGAAGA AAGGATGAGA GCAGAGGAAG AGGTGAGGCG TCAAAAGGAG GCGGAAGAGG AGGCTCGCCG GCAGGTGGAA GAAGAACAGA GGCGCCAACG GGAGAAGGAG GAAGAGGAGA AGAGGCAGGT GGAGGAAGAG GAGAGGCGAA AGAGGGAGGA GGAAAGGAGG AAGGAGGAGG AGGAAAGGAG GAAGAGGGAA GAGGAGGAAG AGCGAGAGAG GAAGCGTATG GAGGAAGCGG AGGAGAGGAA GCGGACAGTG AGAGATGGAT TGGAGAGGGG CAAACGAGAA GGAGGAGTTA TGTTAAAAGG CGTAAGTTCA TCTTTTATAT GCTTTCAACC CAACCTCCTC CATCCGAAAA TCAAAAGAGG CTTTAAGAGA GCGCTGACGA AAAGCGGGCA ATTGCAGTGG GTCACGGTAC AAACGTGCAA ATCCCTGACG TGGCGAAGAC GCTATTTCCA ACTGTTCCCT CGTGAGATGC AATTGTTTAA GAACGAGAAT GTGAGTATCA TCAGCTTCTA TAATTTTATT TTTTTTGCAG TCATCCTTTC ATACCCTGTT TTGGTTCCAT TCTCTAAATT CGTGGGTTCT TGCTGACTTC TAAACTATCA CAATCTCAGG ACACCAAGCC GATCCAAACC ATTCCCCTCT CCAAATCCAC TGGCATATCA GAAACGTACG AAGAATCTCA AGTGAAAGAT AGCTTCATGA TCACGTCTGG CGATGAAGGC GAAGGTGGAG AAGCGTTTTT CTTGTTTACA GACTCGGCGG AGGATAAGGA GTTTATCTTG GACGGGATGA GGCTTTGTAT TGGGTGATCT TTTTGTTATT TTTATTATAT TTCTTGGAAA CTATTTCTTG GAAACTAGAA TGGGGGTTAT ACGAAGGAAG AAAAGGTGAT CAGAAGTTGG GCGCCGCGTA TTTTTTTGGT GCCAGTATCT GATCACTGGG TGCTGGACAT GCAGCCGTTA TGGTAA
|
Protein sequence | MSVDLSHPDI AAARASISNP SDPAAWLLLQ YASPPSTTQL PQLLLLDSGP LPVLPAWQHH LQNTHQSILF GYAEIAEKGL VLVYLTPDVG GVKRARAIVH SRAFASLFPT SQLPPLPPLP PSPIPPSPVS KDETGINRVS NANEMNEDNQ IEKGMDLEEG GEQEREDGLE DSHGSRFENG NGNGNVSTNT NTSSRKPVPA LSLSPSPPRP PRHSQPRTPP KPQPILLPTS AHAPATSLDL VPAPATPTQG TGGTETRKSS TPLSLTSRLK NTFHRSSLKD KDSPTVVPSS PTATPSSPSV PSSTNNAKPD GLGSPTFNPN ATSMPHSPST PSAGKFKASS LSKVFGKRKS SGTSTSTPSG AGTWAGRGEG PKLTEEPEGQ RIDEFGRPPQ AEEGLQVFSQ STGIDPPAAP ASMPIGSPSL ATAPTPPPKS PLPRPSDPSD PSDPSGPTDE LNSSPTCLEP GPGLATPLPP RGSSLRPQHS PSPLSSPGAQ EGQSIRAGNT AFPLAGVPLL GPEAPLPHLP PSSPSSHSHH GLDSNLERVG TPYYTPSLMQ PSPSFQPQEE GTQHQPHTYP SRLHPHTQIP SHSHPSATAT TLQPPLSRST SLSPSVPLPA TLATTTLDPV YTADTQGAYA NAEEDREEIE AEEGKQRDRD RERDKERESV LLAYDRPEIE PESDPRQVES TTEIMEGMEK GFFPRQGEQV SDASAPSPVS LRAPAPASVD EYDSPSHPSA LQPPSPLHPA ASIHATVPHL AEGQAEEMEK ETPVETEEAK AEVEGDEQTE KEEQRQRLER EREREVLEAQ QIAQWQAAET ARWEAEESAR LEQEEAARLA AEEQARRDAE EAERFRLEEE ERMRAEEEVR RQKEAEEEAR RQVEEEQRRQ REKEEEEKRQ VEEEERRKRE EERRKEEEER RKREEEEERE RKRMEEAEER KRTVRDGLER GKREGGVMLK GWVTVQTCKS LTWRRRYFQL FPREMQLFKN ENDTKPIQTI PLSKSTGISE TYEESQVKDS FMITSGDEGE GGEAFFLFTD SAEDKEFILD GMRLCIG
|
| |