Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL05700 |
Symbol | |
ID | 3254893 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 584972 |
End bp | 588026 |
Gene Length | 3055 bp |
Protein Length | 897 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254046 |
Product | hypothetical protein |
Protein accession | XP_568104 |
Protein GI | 58261388 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATTTCGTACG CTCACAGCTT GTTGACCCAC AGCTCCCGGC GCGCAGCTAT GCCCAAAGAG GACGCTCAGT TACATCCTCC GCCACTCAAG CGGGGAGATG CCTGCCTCTA CTGCAGGAAG CGCAGAATTC GTTGCTCAGC CACAAAGCCC ACCTGCACTC ACTGTGCAAA AATCGGCCGC GAATGTGTTT ACGATGTCAG GAAACCCACA AGCAGAGTAC AGCAACTTGA GGAAAAGGTC GCTCAGCTGG AGAGCTTACT GAAGAATGGA GCAATGAAGG GTGATGGGGC ACCATCCGGA TCAGGACTAC AGGCGAGCGG ATCTACGCCG CCTCTGCCGC ATCAGTCTTC AGGCTACACT CCCTCAGAAG CCACAGCTTT ACCGCAGCAA ACATCCACGT CGACCTCTTA CTCATTATTG AACAATGAGA CAATTGACGT CAACTTGTTT GGAGGCAAAT ATCCTGCTAT ACCGCCGGCT GGCGACGATT CCAATTTGTT TCCCAGTTTC GGAGGATCCA TCTTTGGAAG CATGGGCTCG ATAATGTCCC AGCCTCAACC TCAAGCAGAA CAAGCTTTCG ATTTTTCTAC CCTCGATCCT ACATTCATGA ACATGATCAA CTCCTTTCAA ACTTCTACTG GTCTAGCCGA ACCCGTACCG CAACAGGAAC AACAACAATT ACCCACGGCC TTCGGCCAGT CTCCAGCGCC GCTGGCCTCT CCATCTTACC TCAACCCTGC CCATGCCAAT ATTAATTCAC ACGCTGCTCA ACCCTTCCCC TCTTCCACTG CTACTGAACG CATGCCACAA TCGTACACTT CTCTGCCCAA ACTTCCTTTC CTCAATAACA ATTTTTGCAC GTCTGTCGTT TCAGAGGCTC TACCTGAAAA TCCCTCGGTC GTTGCGGAAT TAGCATCGGC GAACGCAAAC GTGCATGAGG ATATGGCTGC CCTTCTCAAA GCCGCTGCTG CGTCTAACGC GGAGCAGTGG GCGACGGGGA TAGTACAAGG GTCGGATTTC GAAGGGGGGC AGGAGAATAT GCAGCTTGTT GGGGGTTGGT TTGATGCTAA TGATTTACCA AAGGTCGCAA GGGATCATCT GTCAGTTTGT ATCCATGTTG CGGGGTTGCG GTTCGCTAAT AATATCAGTT TGAATATGTT CTTTTCTGGT ATGAGATTAT TTGGTCAAGA GTTCCATGTC CCTCGGTTCA TGGCTAGGTA GGTCTACCCG TTTCAAATTT GACCAATGCC TAACATTTCT TAGCCTTTCC CTTCCGCTCT CAAAACGTCC ACACAATTGT CTTCTTTACT CTATGTATAC CATGGCCTCT CGCATATCCA CCTCTCAACC CATCCGCAAC CTCGAACCCC ATTTCCACTC CATCGCATGC CGTCAACTCG AGCTCGCCAT CGCCCAAGTA GATAAGCTCT TGGATGCGAT CCGAGCGAGC TCCATCCTTG CCGTATACAA ATACAGTATA GCGCGGTACC ATGAAGCGTG GATGATGTCT GGCCAAGCTG CGAGACTGGC GATTGCCTGC GGTCTGCACC AGATACAATC GTCCGTTTGG AAATCGTATA ATAACGAAGC ACATGAGATG ACTGCTGATT TCGGGGGGCT GATGAGACGT CGGTCATATA TCCTACCACC ACCTGTGGAT GCAGTAGAGC ATGGAGAACG GATATGGGCT TTCTGGTCGA TCTTCGTGGT AGACCGATGT GGATCAATAT CCACACAATG GGTGCCGGCG ATACCGGACG ATGCGATTAT AACGCCTTTC CCTAGGCCGC TACACGAATA CGAACTCGGA TTAGTGACAG AGGCCGATAA TATCTCCATC TCTTCCATTT ACGCTCCATC TCCTCCTCGT TCCCGACCAC TACGCTACGA CTATGCAGAT CTGGTTAAAA TCCGTTTACG AGCATTAACC CTTCTGGAGC GAGCTTCCAA ATTGATGTAC CTGCCTCCCG AACCGGGATG GGATAAAAAC ATTGAAAGGC ACAGATCTGA ATCTTCTGGA TCAACGTTTT TCGTGCCCAA GAGACCTGAT GAGATGTATG AATACCTCCC TTCTCCGTCT GGGAGCGGAC CTACCTTTTC CTCAGGATCA AATCATACCG ACACAGCAGG GGATTTCAGG AAGAATAGAG GCTGGACGAG GACAGCAAAA GTGAGGAATC CGAAAGCATA TAATGAAGTT CGCGAGGCTT TACTACTTAT TGAAGAGGAT TTACCAGAGG AATGGCGGAC GAATTGGTTG GAATGGGATG GGAAAGTGCA GGCGTGGCAT TTTAATGGTG CGAGAAAGGA TATCATCTCG CTCGTATGTA TTTTCTCTCT CGCTTGTACG GCTGTGCAAG AAGCTGATAG GAATGATGAA CGGTAAAAAT AGCATCTGGT CCTGGGTTGT GCATGGATGT TCATCGAAGA TGTCTATGTG TTTGGTGCTG AGAATACGAC AGCAGTAAAT ATTGCCAAGC GACTAACCGT GACAGTGAGG TATCTTGCGC AGCAAGCCGC ACATTCAGAT TTAGACGTGT TCACAGCTAT GATGTGGGTA ATCTGTTTGC AACGGGAAGC AAATGTTTGA AGGGGAAAGA GCTGACTGAG GAACCGTAGT TGGAGTTTCA TGTCGAAAGT TTTGATCAGA GAGATGAAGC GGCGCGAGTC TTCCGGTGAT AAGCTAGGCA AGTTTCCCGA TCTCTTTTAC TTCCTCGCAT AACCTTTGCT CGGCAGGCAA CTGACTCTTT ATAAACAACA GGTGCTGCCG CATTAGAACT CGACATTGAC ACGCTTGTTC AAGCACTTAA ACAATTTGGC AGAGGGTATG CTATGGCAGT GATGCAAGCT ATGCGGCAAG AGAGGTACAA GAATTCTAGC TGGGAAAATG TGGAGTCCAT GGAAGGGGAT AATGAGGGCT CGAGTGATGA AGAGACATTT GGTCGGAAGG AGGGGCTGGA ATGGGCGGGA CGGCATTTCG GAGGGGCAAC ATCTTAGCTT TTAAGGGCTG TTTATATGTA TATAATAATC TAAAGATGTA TCATATTATA TAACT
|
Protein sequence | MPKEDAQLHP PPLKRGDACL YCRKRRIRCS ATKPTCTHCA KIGRECVYDV RKPTSRVQQL EEKVAQLESL LKNGAMKGDG APSGSGLQAS GSTPPLPHQS SGYTPSEATA LPQQTSTSTS YSLLNNETID VNLFGGKYPA IPPAGDDSNL FPSFGGSIFG SMGSIMSQPQ PQAEQAFDFS TLDPTFMNMI NSFQTSTGLA EPVPQQEQQQ LPTAFGQSPA PLASPSYLNP AHANINSHAA QPFPSSTATE RMPQSYTSLP KLPFLNNNFC TSVVSEALPE NPSVVAELAS ANANVHEDMA ALLKAAAASN AEQWATGIVQ GSDFEGGQEN MQLVGGWFDA NDLPKVARDH LSVCIHVAGL RFANNISLNM FFSGMRLFGQ EFHVPRFMAS LSLPLSKRPH NCLLYSMYTM ASRISTSQPI RNLEPHFHSI ACRQLELAIA QVDKLLDAIR ASSILAVYKY SIARYHEAWM MSGQAARLAI ACGLHQIQSS VWKSYNNEAH EMTADFGGLM RRRSYILPPP VDAVEHGERI WAFWSIFVVD RCGSISTQWV PAIPDDAIIT PFPRPLHEYE LGLVTEADNI SISSIYAPSP PRSRPLRYDY ADLVKIRLRA LTLLERASKL MYLPPEPGWD KNIERHRSES SGSTFFVPKR PDEMYEYLPS PSGSGPTFSS GSNHTDTAGD FRKNRGWTRT AKVRNPKAYN EVREALLLIE EDLPEEWRTN WLEWDGKVQA WHFNGARKDI ISLHLVLGCA WMFIEDVYVF GAENTTAVNI AKRLTVTVRY LAQQAAHSDL DVFTAMIWSF MSKVLIREMK RRESSGDKLG AAALELDIDT LVQALKQFGR GYAMAVMQAM RQERYKNSSW ENVESMEGDN EGSSDEETFG RKEGLEWAGR HFGGATS
|
| |