Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL03950 |
Symbol | |
ID | 3254758 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | - |
Start bp | 82145 |
End bp | 85590 |
Gene Length | 3446 bp |
Protein Length | 1075 aa |
Translation table | |
GC content | 55% |
IMG OID | 638253867 |
Product | hypothetical protein |
Protein accession | XP_568210 |
Protein GI | 58261600 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.215848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGAGTC CGAACACTGC AAAGCGGGCT GCTGCTGCCC TTGACAGGTG AGTCCCTGCT CGTGCATGCA CCGTGAAAGA GAAGAGGGGC GCTTATGCAT GGAAACAGCG AGGCCAAGTC AGAGTCAGAC CGCCTTCGTA ATGTCCTCGG GTCGCAGCCG CCATGGAGTA AAGAAGTAGA GCTCCATCGC CAGCAGTCAG TCTTTGCCTC TTGCAACCAA ACCCATCCAG CTAACATGCA TGCGCTGGAA GATGTCGTAA CGCCCATCTC ACGCTCCTCT TCTCCCACCC ATTCTCACCG TATTCGCAAT CCCTCGACTC GCTGTGGCTT CACACCACAT ACGTCCTTAT CCAGGCTTAC CGCGACCTCA TCTCCCGGTT AGAACGTTTC CCACCTGCTG CCTCGAATAG CAACGGGGGC AGCAAGGGTC GACGAGGAGG AGGAGGCGGA GGTAACTCGG AGCTGAAGAA GGCGCTTACG AGGTTCAGAC AGGTGCTCGC TAGTGAAGAG ACGTTCTATC GTTCGCTCGC CGCTCGGCTC GTACGTTTCT ACAATCTAGG TGAAATTTCA GGCGTAGATG AGACTCTCAA GGCCGTCAAG CTCCCAACAG AGTACATGCC CACTAATGAC AGCGGCGAAG ATGAATCGTC GTCATACGCC GCGCGATTTG CCTCGTTGCA AGAGAAGAAG GACAAGATTC CCTTACTCTA CAAGGGCCTG ATCTGCCTAG GTGACTTGGA ACGGTACAAG AAGCAATACA AACAGCCTGT CAACAATAAC CGCCATGCGC AGGAGCGAGA GAGGCAAGCG GATAAATTTG AAGTCGCTGA GAAGTATTAC TTGGCAGCTT GGCGCTTGAT GCCTGACGAT GGTTTGTTGC TTCCTCTTTA CAACTTGGTT CATGAACTGA CAATGTATAA AAGGTGCGGC ATGGAATCAG CTCGCAGTCA TCTCGACATA CGTCCACAAC GACTATTCCA CCACGTACTA CTACTATCGT GCTCTCGCAG TCAAGAATGC TTTCCAAGGC GCCGACGGAA TCTTGCAGAG ATTCTTCGGT AGAATCTTTG ACAAGTGGCG AGCAAAGAGA AAAGACGGTG ACGGCGAAGA AGGTGGTGAA GTGGGAGATG GTGTGGAAAA GTGGAAAGAG GAGATGGTTG TGCTCATGGC AATCCTCTAT CTTAAAGCTG GGTGAGCCGC CTGTTTCCGT TTCGTTGGAC ACATGTTGAC AAGATTACAG ATTCACATAC ATCCCGACTA TTCTCCCACC TCTCCTCACT TCGCTGAAAG ATTTTTTATC TGAACGACGG CTCCCCACCG AGTCTATTGT TCAGCTCACT TCCATCCTCC TCGGCTCTCA CTTCCGCGCC CGCTCCACCT CTGGCCTGGA GCAAGACCCA AATTTACTCA AACGTTCCTT CGAAGCCGAA GGCAAGACGC TCGAGGTCGC GTTAGGAGTA TGGAAGATCT ATCTGGAAAT CGCGCGCGAA GAGATCGACG AGGCTAGAGC GAGCTTGCGA AGAGGTTTGG AAGATAGTGC ATTGTTGGAT GATGAAGATG AAGAAGAACT GGAAACGGAT GAGATGCCGC AACTGATTTC CGCCGTCTTG CGACGTATCC TTCCTTCCCT GCGTATCATC TCCAAATGGC TCAAGCTAAA CACTACCTAC CTCTCTGGTC TCGCATCGCC ATCAAGCAAC GATCAAGTCT CTTCGCCGGA GCTCAGGGCG GCGATATCCT CTTTCTGGAA CACTTACAAC GCCTTTTTCC AATCCTCTTC CAGACTCTTC ATTCTTGAAC GTCTTCCATC GATCACGAGG CCTCTGGAAG AAGATATTGA TATGCGAGGA TTCGCGCCTC TCCAAAAGGG CAAGACAGCT GATATGGACT CTTTTGCAGC TGGTGCGACC TCGGCGACGG ATTCAGCGGA TGCAGAGGGA GATGTGGATG GTGGACAAGA GGGGAGGGAG GTACACCCGA ATGAGGAGCA TCTGATGAGG TTGGGGGATA TACAAGTGGA CGGGTTGTTG ATCGGTCAGA GTTTGGGGTT TACTGATCCT CTTCATACCT TTACCACCGG AACCTTTGCC ATCCCCGAAG CAACAGAGTT ACCAAACGTG CCGCGGGTGG AAGAACATGA AAGGGAACAT GATCTGCAGT CAATCTCGAC CAACACTGAG GATGATCCGG TCAACCTCGC TATGCGTGCA AGTCTTGGCG TTGAGAGTGT TGGTGAGGAA GACGAGGATG ACGAGGAGGT TATTGTGTGG AGCCGTGGAA GGGAGAATGC GGTGGATACC AGCGTATACC AGCAGGTCCA AACCATTCCT GCTCAGCTCC CGGCCCCAAT GGTCCCTCCT CCCAAGCACC GCCTTCAACA TCAGCAAAGC CAAACAGCCA TGGATCTTTT ACAAGATCTC CTACAGTCTA CCCCGCCCCT GGCGCACTCG TCGGGCAACA ATTCGCCTGC TCTTGTAGCT TCTTCACCGT ATGCCCATCA GTCCCCGTCT TATACTCAGA CTGGATTTCC CGGCATGAAC AGAAACATGA GCGGACCTAG TGCTGGTATT CCTTCTCCCC ATAATACGCA AATGCCCCCC GGCCTGCCTA TACTCGGCCC AGCCCCTGTT CCGCAATCTG TACGGCCTGT ACAGCATTCT GGACAAGCCA AGAATCCCGG CCAGGTACCT TCTCTCTTCA TGGGACCTCA GGGGAATAGT ATTTGGACTA TGACTCGTGA AGAAAGCCAG CAAGGTCGCG CAAGGAGAGC TGGTAGTGGC GCTGGCACAG GCCCAGGTAT GGGTTGGGTA GGGGGTGATG AAGGTCAGCA AGGTTCCGTT GCTTCCCAAG GCCCTCCTCC TCCTCCTGGT TTCGGAATCC CTCAAGCACC TCAGATTGCT CAACCTTTGG CGACGCAGAT GCATCAGCCT CAACCTCAAG CGCCTGTGCC GATCCAGCAC ATGGGGTTAT CCACTGCTTC TGCTACTCTC CAGGCCCAAG CCACTTTCCC CCGCTCTTCG CCTGGAGCCG TTGCGCCTCC AAACCTGACA CACCGGCCCC CCTCTATCCC CAAGCTCCAC CATGAATCGC CAGCGCGACA GACCCCCGTC CAAGTCCAAC ACCCTACGCG TCAACCGCGC GGACCGTCGG CCAACCCTTC GGGGACTTGG GGGGGCTTGT CCAACATGAA TATGGGGGGT ACAGGCATGG GCATGGGCAT GTCCAATGTC CCGTTGCCTG TACCCAAGAG TGAAGGGGAG GCAGCAGTAC CGTATTATAT GCGGCCTGGA GTGTTTGGTG GAGGTGCAGC GACGCCCGCG GGGGCGGGTG GGATCAGTGG AGTGGGTGGA GTAGTACCCG CAAGTGGAAG GGATGGGGCA GGGCAAGGGA TATGGGATGG CACGAGGGGT TTCGGGCAGG GACAAGGCTG GGTCAGCGGC TCGTAA
|
Protein sequence | MQSPNTAKRA AAALDSEAKS ESDRLRNVLG SQPPWSKEVE LHRQQCRNAH LTLLFSHPFS PYSQSLDSLW LHTTYVLIQA YRDLISRLER FPPAASNSNG GSKGRRGGGG GGNSELKKAL TRFRQVLASE ETFYRSLAAR LVRFYNLGEI SGVDETLKAV KLPTEYMPTN DSGEDESSSY AARFASLQEK KDKIPLLYKG LICLGDLERY KKQYKQPVNN NRHAQERERQ ADKFEVAEKY YLAAWRLMPD DGAAWNQLAV ISTYVHNDYS TTYYYYRALA VKNAFQGADG ILQRFFGRIF DKWRAKRKDG DGEEGGEVGD GVEKWKEEMV VLMAILYLKA GFTYIPTILP PLLTSLKDFL SERRLPTESI VQLTSILLGS HFRARSTSGL EQDPNLLKRS FEAEGKTLEV ALGVWKIYLE IAREEIDEAR ASLRRGLEDS ALLDDEDEEE LETDEMPQLI SAVLRRILPS LRIISKWLKL NTTYLSGLAS PSSNDQVSSP ELRAAISSFW NTYNAFFQSS SRLFILERLP SITRPLEEDI DMRGFAPLQK GKTADMDSFA AGATSATDSA DAEGDVDGGQ EGREVHPNEE HLMRLGDIQV DGLLIGQSLG FTDPLHTFTT GTFAIPEATE LPNVPRVEEH EREHDLQSIS TNTEDDPVNL AMRASLGVES VGEEDEDDEE VIVWSRGREN AVDTSVYQQV QTIPAQLPAP MVPPPKHRLQ HQQSQTAMDL LQDLLQSTPP LAHSSGNNSP ALVASSPYAH QSPSYTQTGF PGMNRNMSGP SAGIPSPHNT QMPPGLPILG PAPVPQSVRP VQHSGQAKNP GQVPSLFMGP QGNSIWTMTR EESQQGRARR AGSGAGTGPG MGWVGGDEGQ QGSVASQGPP PPPGFGIPQA PQIAQPLATQ MHQPQPQAPV PIQHMGLSTA SATLQAQATF PRSSPGAVAP PNLTHRPPSI PKLHHESPAR QTPVQVQHPT RQPRGPSANP SGTWGGLSNM NMGGTGMGMG MSNVPLPVPK SEGEAAVPYY MRPGVFGGGA ATPAGAGGIS GVGGVVPASG RDGAGQGIWD GTRGFGQGQG WVSGS
|
| |