Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNJ00100 |
Symbol | |
ID | 3254271 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006679 |
Strand | + |
Start bp | 20980 |
End bp | 25202 |
Gene Length | 4223 bp |
Protein Length | 896 aa |
Translation table | |
GC content | 50% |
IMG OID | 638253167 |
Product | hypothetical protein |
Protein accession | XP_567268 |
Protein GI | 58259711 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00491438 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTCGAATCCA CGCCCTGGAC ACCTCCCTAC CCTCGCCCTT ACAACATATT CCTCGCCAGT CACAAAAACT ACCTACTGGT CCGGCAAACC GCTAACAGCA CAAGAATTGA TTGCTCTTTA TCGAACAACT TATCGGTGGC GACGTTTTCG ATAAGGGCTT GTGCATTTCA AGGAATCAGT GTGAGTATTT TGAGTGATAT GGTATTGTGT TCGTCAAGGT GGAGAAGGGG CGAGTGTCTT ACTTAGTTCC GTGGACCTCG AGGCCGTGAC GTGGCTAGGG TGATTAAGAC GAGTGGTCGA CAACGTGTTG CTGACTTTTT GGACTTAAAC ACAGAAAAAG TCCACACGGC TCTCAAAAAT CCCCCGACGA CGGGGAACTA ACTTCAGTTC TATCTATTTT TATCGATTGA TACTCGGACA AAGAGGTTTT TTTGATTGAG CGAGAAAGTT GTAAGTGGCT TTCCATTCTG TGGAATGAAA GTGCGAAGAT GGCTTGTGAG GATTTGCTGG CAGCGGTAGA CTGAGGAAAT AAGAATCATA CCTTTTGTTC AACTCTCATG GCCATTGTGG GGCTACATCT CTGCTTTGCG GGCGAAAAAT CTGTCAGGGA GGAGTTTGAA CCCCTCCTTT CGCACTTTTT TTCCTTTAAC CCTTTGCTCA TGGCTTCATG ATTTAGGGAT TAGGGATGAT TTCGTTGCTC GTTTGAAGTT CAAAATCCCA CAGTTTTCCT CGAACGTGTT GTTTATGGCC CGTTCTATCC TCAACTCGAG AAACCCGATC GCTGACGCCA TACCTGCCAT TCAGTCTTCA CCTCTTCCTC TAGCCAGTTC TCCTCTCGAC ATCATGTCGC CGTCTCGACA GCAAAGTTCT GTGTTCAACG CCCTTGGGTC CTATGCCGAT AGGATCAAGG ATGCCAATGG GAACTTCATC AAATCTTCAC CTTCCAGCTC GTCTTCTGCG ACTCCTGAAT CCACCTCTCT ATCGTCTTCC ACTTCCGGTA AAAATGCGTT TTCTACGGCT ACTTCCAAAT CTGGTCAGCA GAAGCAAAGT TCCTCGCCCC AACAAGAACC GATTACGGCT GAGGACGATG GGCCATGGGA GACAGTTCAA TCTACTCGTG CTCGCCAACG TTCAGACCGT TCTGAGGAAA AGGAAAAGGA AAAGCGAGGG AGCAGTTCCA AAAACTGGAG AGACCGTACA CATCGTGATG AGAAGAATCA AGATGATGGT GAAAAGAGAA GCGGAAGAGA AAAGTCCAAA AAGGAAAAAG GGGACAAGGG CAGTAGTGCA CCCCCTGTAG GGTCTGCTAC TTCTGACGAG AAGACTGCAA AAAGTTTATC GAGCTCGACA AAGAATGCTT GGGGTGCCAC TTCTTCGTCT CAAGGTGCAA GCCCAATAGC CTCCAACCCC GTTCCCAAGC AAAAAGCGCA AAATGATAGT ACATCCCGAT CGTCTTCTGC CGCTGCTCCC ATCGGTCCAA CCACTTCGAG TATAAATGAA ATAATCAAAC AGTCTGAAGG TTCCGACGAG GATAATTGGA GAGCTAGGCC TGCAAAGGTG GAGAAGAATG GAAAAACGGA AGAGTCTGCT AGCATTACTC AAGCGCAGCC TCAGCCTCAG CGACAACTTG CCCCTCCTCC TTCAATAAAC ATTTGGGATC TCCGAAAAAA GATGTCTGTC CCTGCTTTAT TCTCTCCAAC CAGTGCCAGC TCCACAGCTA CGGTCATCCC ACCAAAGGGC GACAAAGAGA GAAGTTTAAC AAATGGCATG CTCAAGGAGG AAGGTTCCGC TACAGGGAAA AGCCTGTCAA AGAAAAAGTC TGCGGCGGCT GCGGCGGCTG TGGGTACTCC GTCGGTACCT CCATCGATCC ATGATGCCAC TTTATGGCCA GACATTACAC AAGCCGCAGA GGTTGCCAAG GCTGGGGAGG ACAAAAAAGC TAAGGAGAGA TTGAACAGTG AGAGTGCCAG TGTTACGGAG GAGAGCACCA TTGGGACCGG TAGTAAGTTT TCTATTCGCC TTGTGGTGCC CATGGTCAAA GTACTAAAGT TTCTATCAGA AAAACCCAAG TGGACGCCTA TACCTGCACA CAAACTTTTG GCCGCTGCTG ACCGTGCAGC TGAGCAATCC CGTAAACAGA ATCGAATGGA GGCGAAAAAA CGAGCATCAG CTCGTGAAGG TGGAGAATCC GGGCCTACAG GGTCAGGTGC GCCGGGGAAA GGTAATAAAA CTAGGAAGGG TATGCAGGCT GCCGAGGGAA AGAAGGCTAA GAAGGAAGGA GCGCAGCAGA AGGAGGGACA TGCGTCAAGT AAGGCTGGTG ATGCAATTGG TGCGGGTACA GGGAAGGCAA ATGGGGATGT GAAGGAAACC AAAGAAGCAG ACGTTCGCTC AACATCTCAA CAGGAATCAT CATCCCATCG TTCCGGACCC TCCATTTCTG CCTCCGCCAA CGCGGAAAAT GATTCTAGCT TACACGCACG GACCAAATCA ACACCAAATG CCACCACCAC CCCTCTCCCG CCCCATGCAT TCAACCTTGC TTCCAGTTCG CATCTTTCCA GATCCATAAG AGGTAGAGGA GAAGGTCGAG GATCGTTTGG TGGGGGGCGA GCAAGAGGTG GCTTCAGGAG TAGTGGCGCA TTGGGGCCCA AGGGACAGCT TGGGCATGGA CATGGGCATG TGCATGGACA TGGACAAGGA CAAGGCCTTG GATACGGTTA CGGATCACCA CCCTTGGGGG TGGCCGGTCT GCCTGTTGAG GGTATCGTCT ATGCTTCCAT AAATCCTGGT GTCGGTGCCG GCTCTACTCC CAATTTGTAT CAACGAGGTT TCGGTATGGG TTTCCAGCCC TTCTACCCTG CTGCCACCGC TGCGGCTTCG GCTGGTGGGG GGGGGCCGAC GGGCACAGCG GCTGGGGATG CGGCAGGAGT GTACGACCCG GCGGCGGCTG TTTATGGAAA TATGGGCATG TACAAGAGCG CGTCTATGCC TCCTCCGCCT ATGCCTCAGA CAGTTGTACC TAATCTTGAT CCTTTGAGAT TCTACGTCCT CGGTCAGGTA AGTTTCTCTT CCTTTGACTT TTTCTCGAGA GAAGAATGAG GCGGATCACT GATGGTGTTG TGATTAGGTG GAGTACTACT TTAGCATGCA GAACCTCGCC ATGGACTTCT TCCTTCGTCA ACAAGTGAGT ACTCTTCTCC GCGTACAAAA ATGGGAAGTT GCGAAGAAGC AATCTCTGAC ATTATTGTCT CCTAGATGGA CTCTGAGGGC TGGATCGACA TTGCCATGAT CGCCTCATTC AACCGTATAA AATCGCTTAC CCCCGAAACC TCCGTTGTCC GCGAATGTAT GACCCTCTCC AACTACCTCG AAGTCCGAGA AGATAAGGTC CGTCTGAGTG GTGCCGAATC TCATCGATGG GTATTACCAG ATGCGCCGCC CAGCAAATTT GGGCCGGACC CAAGGTCGCC CTCTTTAGCG GAAGGGGCTG AATCGTCGGA AGAGAGGGAC GGGACTGCTA GTGGGTCTCA ATCCGGCCTT GTGACTGTTG GGGAAGAAGG GGCGCAAGGA TTACAAGCGT CGCCTAGAAG GATGTTTGGA GCGCAGGATG TGAAGGATGC TTTGATGAAG AGTTCGGCGT TGAGTACTGT TAACGGGGAG ATTAAGGAGA AGGAAGAAGT GAAGGCGTTG GCGAACGAGA GCGAGGAGAC AGAGAAGGAT GAGCAGCAAT AGGGATGTAA TGGTGAGTAT TGGGAGTTGA TGTGAGGCTT TGGACAGAGG AGCTAACGGA CTCATGCCAT GGTAGACTGG ATTTAAGAAG AAATTCAAAT ACAGAAAAAG ATACTCCTAT AAACTGTATC AACACCGATT ATCCTCTGTC TGACCATTTA AAAGATCGAG TGACTTTCGT ATCCAACAGT CATATTTTCA ACCTTATCAT TGCGCCCCTG CGTTCCGCCA ATGCGGGCGC CTCTACACCC CATGCTTTCG CCCTTTGAAC CACAATCATG GTTACTCTGC ATTCCCATAT GGAACTTTCC CTTTTTTTAT CCTTCTCTCG CATTATATCG TTTCTTTTCC TTTTTACTCA AGTTCCGGTA ATCTTTCTGT GCATAACGAT CCTATCACAT TAACGCGGGT GGTACCAGTG TCCATTATCC GTACTACGTC TTTTCTCTAT ATCTGTTTGG TACGAAGTGT CAATCCTTTC CGGAGAAACT GTC
|
Protein sequence | MSPSRQQSSV FNALGSYADR IKDANGNFIK SSPSSSSSAT PESTSLSSST SGKNAFSTAT SKSGQQKQSS SPQQEPITAE DDGPWETVQS TRARQRSDRS EEKEKEKRGS SSKNWRDRTH RDEKNQDDGE KRSGREKSKK EKGDKGSSAP PVGSATSDEK TAKSLSSSTK NAWGATSSSQ GASPIASNPV PKQKAQNDST SRSSSAAAPI GPTTSSINEI IKQSEGSDED NWRARPAKVE KNGKTEESAS ITQAQPQPQR QLAPPPSINI WDLRKKMSVP ALFSPTSASS TATVIPPKGD KERSLTNGML KEEGSATGKS LSKKKSAAAA AAVGTPSVPP SIHDATLWPD ITQAAEVAKA GEDKKAKERL NSESASVTEE STIGTGKKPK WTPIPAHKLL AAADRAAEQS RKQNRMEAKK RASAREGGES GPTGSGAPGK GNKTRKGMQA AEGKKAKKEG AQQKEGHASS KAGDAIGAGT GKANGDVKET KEADVRSTSQ QESSSHRSGP SISASANAEN DSSLHARTKS TPNATTTPLP PHAFNLASSS HLSRSIRGRG EGRGSFGGGR ARGGFRSSGA LGPKGQLGHG HGHVHGHGQG QGLGYGYGSP PLGVAGLPVE GIVYASINPG VGAGSTPNLY QRGFGMGFQP FYPAATAAAS AGGGGPTGTA AGDAAGVYDP AAAVYGNMGM YKSASMPPPP MPQTVVPNLD PLRFYVLGQV EYYFSMQNLA MDFFLRQQMD SEGWIDIAMI ASFNRIKSLT PETSVVRECM TLSNYLEVRE DKVRLSGAES HRWVLPDAPP SKFGPDPRSP SLAEGAESSE ERDGTASGSQ SGLVTVGEEG AQGLQASPRR MFGAQDVKDA LMKSSALSTV NGEIKEKEEV KALANESEET EKDEQQ
|
| |