Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04140 |
Symbol | |
ID | 3254787 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 127562 |
End bp | 129462 |
Gene Length | 1901 bp |
Protein Length | 518 aa |
Translation table | |
GC content | 48% |
IMG OID | 638253887 |
Product | expressed protein |
Protein accession | XP_567969 |
Protein GI | 58261118 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.215775 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CCCAATCCCT TGATCTGCTT CACCATGTCA GACAACCAGC ATCTGCAAGG CCAATCTAGT GTCCCAGAAA CGCCCGCAAA TACGTCAGAA GTTGATACAT CGGGCGCGCC GGCCGCAGCA AGCAGCGTCG GGCCCGCTTC GAGCGAGGGG ACAGATCAAC AGGACCCTGT CGTGTCGACT TCTTCTTCAA AAGTACAAGA TATACTACAG CAAAGCATCA ATCGCGACAA CTTTCAGGAA GAAGTTGGAC AGGTGATGGG CACCATAAAC AGTTGGTGGG GGGGCGTCAA GAAACAAGTA CGATCCCATC CTACTCAGTA GTGTCTATTG TGCTGATCAA GTCCATGTCT AGTCCGTGTC TACTTTGGCG ACTTTAAAGG CCGATATAGA TAAAACAGTG ACCCAAGCCC AAGCTGACTT TGAATACCTC AAGGCAGCTA AAATTGAAGT GGTACGCAAA GACGCCACTT CCGAGCCCAC TAGAGTGTCA GCCAAGGAGG ATCAAGATAT CAGTGCAGAT ACTTTGAAAG AAGAACCGAT AAGTGTCCAA AACGATCAGG ACAAGGGAAA AGGGAAAGAA ACGGCACAGT CATCGGCAAC GAATCAGACG AGCCCCCCTG CTTTCTTTAC GAAACTCGCT TCTTCAACTA GTCAACTTCA GCAATCACTC CTATCCGCTG TACAGTTTAC ACTCGATGCC ACAACTGCCA ACTCGGCACT GTCCAATCCA AATGCCTTCC GTCAGCAACT TGTGGATAAT CTACGCCTGG CCTCCGCTCG AGAGAACTTG CAGCTTTCAG TCAAACAAGC TGAAAAACTG GCTGAAGAAT ATCTGCGCAA AGGAGACCAG TGGGTCAAAG GTGCTGAGAA GTGGATGGAA GAGGCTGTTA AAGTCGTACC TCCAGAGGGA GAAGAGACTC ACGTGGTCAA CATCGGCTGG GATGGCGGAG ATTGGTACTC CTTTTCAACT TCTGATAATA CCCCTCTGCA CATATCAACG ATTGACAATG GTGCCCCTGG TCCCTCAGCT GCTGGTACCC AGGTCAAGGT TCTGGCCAGC TCTCGTAAGG ATGCACTTTT GAAGCGCCTT CGAGAAGATA AGCAACTTTT GTTGGTTGAT CCTGAAGGTG AAGGGGAAAC TGAGAAAAGG AAAGCAGAGT TCCGTGACTG GGTTAAGACA CAATGGGAAG CACAAAAGAC AAATGGGCGA CTGGAGGATG AGGGTCTTGT GGGTCATATT AGGATGGAGC TTGGTAAGTG ACTTCCATGT TGGGATCCCT GAGAATATGA TACTTATATA AATGTCTTCT TACCCACTAC TCTCATCAGT GCCTGAGTAC CTCACAGATG AGCAATTCTG GCAACGTTAT CTATTCCACA AACATATGAT TGAAGAGGAA GAGCAGAAGA GGAAACTGCT CTTGCAAAGT GAGTGAGCAG TATTCCTGTC ATGCTCCTGT TGCTAATTTT TTTTTCTTTG GGTAGCTTCT CAACAAGACC AGTCAGATGA TTTCAACTGG GATGATGAGC CTGAAGAAAC TACCCCCCTG GGTGATGGGC AGGCATCCCA TGGTGTAGTC ACCCCTAAGG TCAGCCCAGT TGGCAAACTA CCTAGTTCAG TGTTCAGTCA CTCAAAGGCA AAACTTGCTA CTTTGGACTC AACAAGCCCA CATGACTCGG AGGAGAGCTA TGACCTAGTC AGTGATCAGG GAGGGAAGAC TGCCAGGGCT GCCCCTCCTG TGGGAGATGA TGATTCTGAC TGGGAGTGAC AATGATTAGT TCTGCCACTT GTTATTTGTT GTTGTATATA CTAGCCAAAC CAAGCTCATG TCAAATATTT TACAGATAGG AAAAGACTAT GATAAAAAAA GTTTCAGATA GAACTGATCC CTTGGATGTT T
|
Protein sequence | MSDNQHLQGQ SSVPETPANT SEVDTSGAPA AASSVGPASS EGTDQQDPVV STSSSKVQDI LQQSINRDNF QEEVGQVMGT INSWWGGVKK QSVSTLATLK ADIDKTVTQA QADFEYLKAA KIEVVRKDAT SEPTRVSAKE DQDISADTLK EEPISVQNDQ DKGKGKETAQ SSATNQTSPP AFFTKLASST SQLQQSLLSA VQFTLDATTA NSALSNPNAF RQQLVDNLRL ASARENLQLS VKQAEKLAEE YLRKGDQWVK GAEKWMEEAV KVVPPEGEET HVVNIGWDGG DWYSFSTSDN TPLHISTIDN GAPGPSAAGT QVKVLASSRK DALLKRLRED KQLLLVDPEG EGETEKRKAE FRDWVKTQWE AQKTNGRLED EGLVGHIRME LVPEYLTDEQ FWQRYLFHKH MIEEEEQKRK LLLQTSQQDQ SDDFNWDDEP EETTPLGDGQ ASHGVVTPKV SPVGKLPSSV FSHSKAKLAT LDSTSPHDSE ESYDLVSDQG GKTARAAPPV GDDDSDWE
|
| |