Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH01190 |
Symbol | |
ID | 3259129 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 838605 |
End bp | 843469 |
Gene Length | 4865 bp |
Protein Length | 1464 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258364 |
Product | conserved hypothetical protein |
Protein accession | XP_572309 |
Protein GI | 58270306 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00756] pentatricopeptide repeat domain (PPR motif) |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AGCACATCTC CGCCACCGTT GTCTCCCTCC CCCGCTGCAC TGCAAGAAAA GAATGACAGC ATAGAAGAAC CAACACGGAT CAGGACAACT TCACTTCACC CCCATTCGAC ATTCGTTAAC ACCTTCATCT TGCAGCCATG TTCACAAAAG CAACGTAAGT ACTCCCAAAC ACAGCTCCCA GCTCCAGCTA AATTCCCCTA GTTCCCACCT CCGTCCCTTC ATCCGTCTCC CCTCTTCTCA CTCCCCTGCC TCTCCAGACC ACTTCACCGC CAACCCTTCT CTTTTACACC ATCTCCCTCA GCATGGCACG AGTAATTCAG TAATTGTTCA GGGACAACAT TCTGCCCAGG CCGGTGGTGC GTCTGGACAT GCTGGACGGA CAGGCTATGG TGGCGGCGCA GGTGCCGGTG GAGGATATAC CGTGAGTGGG GAAGTCTGCA AAATGCATGG TGGCAAGAAA AGCTGATCAG ACGGTAGGGT CACGCGCGAG CTTTCCTCTC TCTCCCTCAA ACAGCTTCTT TCGATCCTTC TTCCACCCTC TCTGACGAGC GAAAAAAGGA CGCTTCAAGT TCATCCTCTT CAAATACATC TCTACTCCTT AAACACCGTC TGTCCAAGAC ACCACGAATA GTTCACCTCA ATGATTCTGT TCGGGCGCGA CGAGCAATTG AAGAACGAAA AGGGGGTAAG GATTTTACGG TCGTTGAACT CGAAGATGTT CCCAGAATCC ACAGACGAGA TTCTATCGCC GAATATACCT CTGGCCGATT ACCCTTAACG TCTACTTCAA CCCGATCACT ATCCCGTTCC CATACTTCTT TCGACATTTT CCAGGTCGGT ATTCCTCAGC CTCGATCAAT AGGTCTTCGT GCTCTTTCGA CAACATCTAA TGCACCTCGA GCTCTTGAAG CAGAGGATAT CGAGGCGGTT GAAGAATTGG AGTTGATGAC TCCTCGTGCT GGGCGAGAGG TTAGGACGAT CGGTCAGCCC AATCGAGTGT TGATGGACCT TGCTGGACGA GATTTGCCCT CAACAGTCAC AGCTGGTTGG CAGTTGGGAG GGATCAGGAG GAATTCCACT GCTGCAGTTG AGCGGCCATC GCTCGACCTC CCTCCTGCTG ATCTTTCCAG TATTGCGACT CACGAGCAGA AGGAGGACAA ACAGCGTGAA GAGGCGATTC TCCAAGCACT CCGCCAAGCG AAACAGAATG GAAATGCCGA ACTCGTGAAA AACCTCATCA CCCATTATCG TTCTCCTCGA TCGCTTCCAC CTTTCTCTCC AGACAGTCTT CCTGGTGTTC CAGAATTGTC AAAGCATTAC CCCCTTCCAT CCGGTTATTC TATCAGGATT TACAACGCCT GCCTCATTGC GGCTCTTGCG ATTCGAAGTC CTGGTCAATC CATCGCGCCA ATCCTCGAAA TTTACAACGA GCTTTTAGAG AAAGACATGA TCCCTGATAG TATGACCTAC GGCGCAGTAA TCCGAGCTTT GGCTATGAGA GAGCGAGACG TTAGAGGGAG TCAGGAGATC TGGGAACGCC AAAAGACTTG GGGCTTGTGG CGGTCCGAAA TTAGTGGAAA TCAGACTTGG GATCCTGAAG TAGCGGCTGA AAACGACGCA AACATAGAGG CATACCTCGC CGAAAACAAT GTTTTGTCGG CAGTCAGGCT TTTCCGAGCG GCGGCTTTAA CAGGCCAGGC TGGGCGATTT GTCGTGTCGA TACATGGCAC TGTGTTAGAC GCTTTGAGCA AACTGGAGAA GCCTGATGTA ACTGTGATGG AACAGTTGGT CGAGCACGCC GAGTCCCATG AAGTGCCAGG TATAATATCG CTTTACAAGC ACCTTTTCTT GGCCTACGCT AAAATCAAGG ACGCTTCTGC TCTTAGTCAA GTATGGAACA AGTACCGAGG TCTGGCCAAG ACCGCAAGTC GAGAAGACTG GGCCGCTGCG TGCGGTCAGC GTGAAGCCTA TGATGTCAGG ATCGAAGGAG CCATGCGGGA GCCTTGGGAT ATTGTGATAA GTGCTTTCAT TGAAGTTGGG GAGTTGAGCA AAGCGTTTGA AGTTTTTGGA GAAATGGCAG AGGTCGCAGA AAAGAAAACT AAGGCTGAGG TGCCGCCTGC CACCCACAGG ACGTGCGGTG TACTTGTAAC TGCCCTTGCT CGAGCTGGTG AATTCGACCT CGCCTTAGAG TGGTTCAACA GCTTGCAAGC CTCTGCTATT GTTACTCAGA ACTCGCCCCA TCGACTCACC CTTGAACACA CCATTGTCCT TGCAGAACAG CTCCTCCTCA AGGGACGTTG GTTTGACGCT GCTGACGTCG TTATTGGGCT TGGGGGGATT CGTGATGAGT TGGCTCAACC AAGCGCCCAA CACAGCCTGC GCACCGTCGC GTGGAAGACT AACTTTGCTC TTATTCACCA CGCCGCTCGC TCGACTCCTG AAGAAGCTGC TCGCACGCTC GCCCGTGCGC AGGAAGTACT TGCTGCCGTT CCCATTCAAC TCGATCTGCG AACTGTCGTC CTCCACCTTA CACTCCTCGC TCAGGCTAGC GATTATACCA ATATGATTCG CGTCGTTGAT CTCATCTCTG CCCGAAAGCT TCCTGCCAAC ACCATCCAAA GACTGAATGA TGCTTGCGAC CAAATCGCTA AGGAAGACAT ACCCTTCAAG TACCGAATCG AGTTGATCAA GGCATTTGGC CGGCATCACG CTGATTTGAC GCCATTCGTG GCGGAGAAGC TCATCCAGGC ATATATCTCA GAGAAAAGCA AGCCTGTTGA TGAGCGCGAT CTGAGCAAGG AATCGCTCTA TATTCTTCTC GATGCCTTCA CTGTCGTTGA ACAAAAGCAA GTAAACGAGG GGGAATATGA TGAGGCGCTT GAGCAATTGA TGCAGGATCT CGCGGCGACC GACGTCAGTG AGGATATGAA GAATGCGACC TCAAACCAGG CCATCGGCTA TTTAGTGAAA AACGTGATAT TCAGGTTCGG TGCTGAGCGA GCTCGATCCA TGTTGGGTGC GGTCTTTGGC GAGCAAGAAG CCACTAACTT GGTCCAACCC GTTTCTCCCA CTTTTATCCA ATCCGAAACT GTCTCTGGTA GTGAACGTCT TTCCGAAGCC GCCTCTCCCA GTCAATCTCA TGCCTCCTCG GCTACCTCTG CCCCTGCCCT TTCTTTCTCA CAAACGCTTA CTTCCTCGAT CGAGCGCTTC ACTTACCGTA ATCCCACCAT CACTCCTCTA GAAGCTTACG CACTCCTTCG AGACGGTCTT TCCGTCTCCC AAGTCCCTCG ACTCGAAATC ATTCTTTCCT TGATGGATCA TCTTGCCCGT CGCAACGATG AACCTAAAGT CCGGGAGCTT TACGAGCTGG CCCAGGTTGT ACTCAACTCT TTGGTCAGAC CAGATGTTCA AGCCCAGAGT TGGTACTCTG CCGAGAACGC TATGCTCATC GCTTGCTGTC ACCTTGGTTA TCTCGAAGAG GCCGGTCTCC ATCGAGCTCG TATTGTCCAA GCTGGCTTCA CGCCCAGTGC TGATGCTTAT GCTACCATGA TTGCCTCTTC AAAAGACACC ACCGATGATG CGCTGGTCGC TAGAGAATTA TGGGAAGAAG CTCTCGGTAT GGGTGTGAAA CCTCATCTGT TCTTGTACAA CACTGTTATC AGCAAATTAA GTAAGGCCAG GAAGGCGGAA AGTGCGTTGG ATCTGTTCAT GAGGATGAAG TCCGAAAGGA TTAGGCCATC CAGTGTTACT TACGGAGCTG TTATCGTAAG TTTTGCCTTT ATATCTGCAG TTTCTTGAAC TAACAAAGCG TTCAAGAATG CTTGTTGTCG AGTTGGCGAT GCTTTGTCCG CAGAGACTTT GTTCGAGGAA ATGACTTCCC AGCCCAATTT CCGACCTCGT GTCCCTCCCT ACAAGTAAGA CATACTTTTT ACGCTTGTTG AGTTTCTGTT GACATCAAAT CCAGCACTAT GATGCAGTTT TACCTTCAAA CCCAACCCAA CCGAGAACGT TTCCTCTATT ATTATGGTGC CCTTCAACGC GCTCGCGTCC CTCCTTCTGC CCACACCTAC AAGCTCCTTA TCGACGCCTA TGCCACTCTC CCTCCTATTG ATATCCCTGC TATGGAGCTC GTATTCGCTC ATCTTATTGC CGACAAATCT GTACGAGTTC AAGGTACTCA CTGGGCGAGC TTGATCTCCG CGTACGGTAT TCACGCGGGT GATGCCAAGA GGGCCAAGGA GGTGTTCGAC AAGATTCCAG AGAAGGGTGG AGACTACGAA GCAGTTGTAT GGGAGGCATG GTTAAATGTG CTCAGTCAGC AAGGTACTAT CGAGCAATTA GAGGAGGCTC ACACTAGAAT GCTTGAGAGT GGTGTACAAC CGACGGCTTA CGTTTACAAT GTTCTCATTA ACGGCTATGC CAGAGCTGGT AACATTGGGC GTGCCCGTGA GGTCTTTGAG TCCATGGGAG ACAGTATCAC TGGTGTCGCC GCTCCCAACA ACCATCCTGT TCTTCTCACC TCATCTGGCC ACGCCAAGCC GTCAACACAG GCTGCGACGA CTGGTGTGGT CTACCGAGAA CCGTCAACGT ACGAATCAAT GATTCGAGCG GAGATCACAT GTGGAGACCA GCAGAGGGCA ACAGAGGTTT TGCAGAGGAT GGAGGAGAGG GGTTACCCAA TGGCGGTTTA CATGCGGGGC AAGGCAGCAT TGGAAGGAGA GCCTCCAAAG TTTTAGATTA GATCATTCAC CACTCATATT GGTTCATTCA CTTGCATTTA TACCCAATTT ATAAGATAGC TGTTGGGAAA GGAGAAGTAT ATAGCTTCAG GAAAGACGTA TTTAATATCC AGGGGATTGA TGTAC
|
Protein sequence | MFTKATSHLR PFIRLPSSHS PASPDHFTAN PSLLHHLPQH GTSNSVIVQG QHSAQAGGAS GHAGRTGYGG GAGAGGGYTG HARAFLSLPQ TASFDPSSTL SDERKKDASS SSSSNTSLLL KHRLSKTPRI VHLNDSVRAR RAIEERKGGK DFTVVELEDV PRIHRRDSIA EYTSGRLPLT STSTRSLSRS HTSFDIFQVG IPQPRSIGLR ALSTTSNAPR ALEAEDIEAV EELELMTPRA GREVRTIGQP NRVLMDLAGR DLPSTVTAGW QLGGIRRNST AAVERPSLDL PPADLSSIAT HEQKEDKQRE EAILQALRQA KQNGNAELVK NLITHYRSPR SLPPFSPDSL PGVPELSKHY PLPSGYSIRI YNACLIAALA IRSPGQSIAP ILEIYNELLE KDMIPDSMTY GAVIRALAMR ERDVRGSQEI WERQKTWGLW RSEISGNQTW DPEVAAENDA NIEAYLAENN VLSAVRLFRA AALTGQAGRF VVSIHGTVLD ALSKLEKPDV TVMEQLVEHA ESHEVPGIIS LYKHLFLAYA KIKDASALSQ VWNKYRGLAK TASREDWAAA CGQREAYDVR IEGAMREPWD IVISAFIEVG ELSKAFEVFG EMAEVAEKKT KAEVPPATHR TCGVLVTALA RAGEFDLALE WFNSLQASAI VTQNSPHRLT LEHTIVLAEQ LLLKGRWFDA ADVVIGLGGI RDELAQPSAQ HSLRTVAWKT NFALIHHAAR STPEEAARTL ARAQEVLAAV PIQLDLRTVV LHLTLLAQAS DYTNMIRVVD LISARKLPAN TIQRLNDACD QIAKEDIPFK YRIELIKAFG RHHADLTPFV AEKLIQAYIS EKSKPVDERD LSKESLYILL DAFTVVEQKQ VNEGEYDEAL EQLMQDLAAT DVSEDMKNAT SNQAIGYLVK NVIFRFGAER ARSMLGAVFG EQEATNLVQP VSPTFIQSET VSGSERLSEA ASPSQSHASS ATSAPALSFS QTLTSSIERF TYRNPTITPL EAYALLRDGL SVSQVPRLEI ILSLMDHLAR RNDEPKVREL YELAQVVLNS LVRPDVQAQS WYSAENAMLI ACCHLGYLEE AGLHRARIVQ AGFTPSADAY ATMIASSKDT TDDALVAREL WEEALGMGVK PHLFLYNTVI SKLSKARKAE SALDLFMRMK SERIRPSSVT YGAVINACCR VGDALSAETL FEEMTSQPNF RPRVPPYNTM MQFYLQTQPN RERFLYYYGA LQRARVPPSA HTYKLLIDAY ATLPPIDIPA MELVFAHLIA DKSVRVQGTH WASLISAYGI HAGDAKRAKE VFDKIPEKGG DYEAVVWEAW LNVLSQQGTI EQLEEAHTRM LESGVQPTAY VYNVLINGYA RAGNIGRARE VFESMGDSIT GVAAPNNHPV LLTSSGHAKP STQAATTGVV YREPSTYESM IRAEITCGDQ QRATEVLQRM EERGYPMAVY MRGKAALEGE PPKF
|
| |