Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC06020 |
Symbol | |
ID | 3256433 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1775365 |
End bp | 1778480 |
Gene Length | 3116 bp |
Protein Length | 793 aa |
Translation table | |
GC content | 55% |
IMG OID | 638255823 |
Product | hypothetical protein |
Protein accession | XP_569823 |
Protein GI | 58265334 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATCGATAAC AGAACATCCA CAAACAATGA CCCAGCAGCA TCTCGCCACC TTCACCTGGG GCGCAGGCGC ACAAACAGTG AGTTCTGCCA GCCGCCCTGA CCCGTTCCTG CGCCGTCCCC ACTCCCAAAT CCCCATTCGC CGGGTAAACA AACCAGGGCG GAAATGTCCA CTTCGCGGAG CCATCCATCT CTCAAGTCGT GTCAAATGTG GGCTTCATCT GCTGACTGTG TATTCCCGCT CGTTTCTTCC CGCACTTTGC TTCTGAATGC ATCACTCAAC TTGACTCCGC ACCTGCCGCC AACTAATAAA TAGGTTTGTG TCGCCGGTAA CTTTAACGAC TGGTCTGCCA CTGCCACTCC TCTCAAGAAA CAATCAGATG GAAGCTTCCT GGCCGACGTT TCCGTTCCAT GGGGAGAGAA GCAGGCGTTC AAGTATGTGG TCGATGGAGA GTGGAAGGTT AGAGAAGACG AGGCAAAGGA ATGGGGTGGG TCTGTTCCTT TCTCTTGGCC CTGAAAGAGC TGACAAGGGA TTGGATTAGA TGCCGCTGGA AACATGAACA ATGTCTACAC TGCTCCCGAA GGCCCTGATG ACAAAAAACC TGTGGACAAG TCAACGGCTA CTGGTGCCAC TGGTGCTTCA ACCTCTGCTG GCGCTGGCGC TGGCGCCGGT GCTGCCGCGG CTGCCTCCGC CCATGCCAAG GACCCCGCCA CGAAGGATGC TACTTCCAAA CCTCCAACCA CCGCCGTCCC CCCTGCCGAC ACTGCCGACA CTGCCCCTGC CTCAACTGGT AACAAGAAGA CCGATCCTGA GGCTCTGATT GCTGCTGCTG CCCCTGGTGC TGCCATTGGT GCTCCTATCT TGGGCAAGCC TGTCGCCGGT GCAGAGACAG CTCCCACCTC TGGCATCACT ACGACCACCG CTATTGGTTC TGGTACCGCT GCTGCCACCG AGACCAAGAC CAAGGACAAG GCCCCCGTTG AGAGTGCTAC TCCTTTCGAC AACAAGGCTG CAGCCGACAA GGCTGGGGTC GCTGAGAAGG GTACTGCCAA GGACACTCTT TCCCCCGGTC CTGTCCCTGT CGCTATGGTC GCTGAGAAGG GCACTGCCAA GGACACTCTT TCCCCCGGTC CTGTCCCTGT CGCTATTGCC GTCCCTACAG ACAAGAAGGC TGGTCTCACT GAGAAGGGTA CTGCCAAGGA CGTGAACGCT CCTGGCCCCA TCCCGGGATC AACCGCCGCT TCTGCTGATA CCGCCGCCAA GGCTGACGAG CCCATCGACC CTGCTGTAGC TGCCACCACT GACAGTTCAG GTGCTGTCCG AGTCACCGCT GTCGACGCGA CGCCTCAGCA AATCGAAATG GTCGCTGCTG CCGCTAATGT TGGTGAAGCT CCAACCGCTA CTGTTGGAGG AGAGCATGGT CTTGCGGAGA AGGCTGCCGA GTACGGTGCG GCTGCCATGG CCACCTTTGG ATCTGTTGTA GGCGGTGCAG CTGCCGCTGT GGAGAAGGCG ACTGGTGTTG ACCTTACTCA CTCTAGTCCT GTAAGCTTTT GTCGCTCTAT TATTTTCTGC GCGAAAGCGC TAATGCGAGA CAGGCTGAAA CTAACGTACG TAACCTCTTT ACCTCTTCTT TAGCTGTCCG TCGAAGAAGC CCGCGCGCGA GGCATCGATG TCACCAACCT CGAAAAGGTC GATGCCCCAA CTGACACCAC TTCCCCTCAG GGATCCGCAC CTCCCGCTTC CGCTGTCGCC GCGCTCGACG AAAAGGTCGC CGAACTCAAG TCTGAAACCA TTGGCGCTAG TAGCACAACA GGAGTGACAG ACCAGGTGGC TATGCCATTG CCAAATCAGC AGGCTCCGAA GAGTACGTCG TTGTCGTTGC CGCCCTCGAA GGAGGTTGAT ACTGTGAGCG TGGATAAAGG GCTGAGTGAT TGGTTGGATT ACACAGCTTC TTACCCTGCC GACGGGTCTT CCACTCTCGG CCACACCGCT TCTACGACAG ACAACAAGGA GAAAGACATC AAGAATGACA TTCCTGCACA ACCTGAAACA GCAGACATGA ACCACCGAGC CGTACCTGCC CCTGTATTTA CCACAATCGC CCCTAAGGAC CCCAAGAAGG ACCGATCTCT AGGGAGCAGT GACCTCAACG ATACTGCCGG TACGAAGCCC ATCGATGCCG CTCCCGGTGT AGACGCAAAG AAGGCCAAGA GGGAGGAAGA AGCCAACCCC ACAGGCGCGA CTGGTGAGAA GCCCGAAGTG GCCGAGGCTA AGAAGGCTGC TGCTCCTGAT ACTCCCGAAG ATAACTTGAA GCTCAAGTCT GAAGACGTGG GTAAAGGTAC TGGTGCTGGA GTAGGGTCCG CTTCCGATGG CAGAGCGCAA GTCCCTCAAT CAGGTGTCAA GGCCGAAACT TCTCTTTCCT CCCCTTCGGC TGCAACTGCT ACTACTACCG AGACTTCTCA ACCTGCTAGC AGCGCGACTG GAACAACCGC CACGCCTTCC AAGACGACAG CGGCGACGAC TAACGGAACA TCTGTGCCTG CTGCTGCAGC TGGTTACGGT GGAGCTGTCG CCGGTGCCGC TGGTGCCGTC GAGGCTGGTG GTGCTGGTTC TACTACCACT GGTGCTTCAA ATACAGCGGC TCCAAGTACC CCTGCCGGGA CTAAGCCTAC TTCTACCTCC ACGCCCACGC CAGGAAAGGA GTCTAGCAGT GGACCCGGAT CTGTGAAGAA GAAGACTGGT TTCCTTGCCA AGGTCTGTCA CTTTACCATT TATTCCCGAA CGCAATGATG GAAAGGAAGA ATACTGACGA GGACTTTACC TGTAGATCAA GCACGCTCTT TCGCCCGGAC ACAAATCCAA GTAAAGAACA TTGCCCATCG ACTGTATTAC GAGCTCTCCA TCTATCGGCC TTGATAAAGT CTTTTGAACT TTTTCCCTTT ACTCATTTTT TTCTCCCACT TAATCTTTTT TCTTCTAATT CTTATTTAAT ACATGGTCTA GCGTCGTAAT AGGGGAACTG TGAAGTGAAA TATACAAAGG TCCTGGATAT ACCATGGGAT AGTGTTTTGC GTATTATACA AAAAAGTATG TGAAATGAAA AAAAAGTGTT ATTTCG
|
Protein sequence | MTQQHLATFT WGAGAQTVCV AGNFNDWSAT ATPLKKQSDG SFLADVSVPW GEKQAFKYVV DGEWKVREDE AKEWDAAGNM NNVYTAPEGP DDKKPVDKST ATGATGASTS AGAGAGAGAA AAASAHAKDP ATKDATSKPP TTAVPPADTA DTAPASTGNK KTDPEALIAA AAPGAAIGAP ILGKPVAGAE TAPTSGITTT TAIGSGTAAA TETKTKDKAP VESATPFDNK AAADKAGVAE KGTAKDTLSP GPVPVAMVAE KGTAKDTLSP GPVPVAIAVP TDKKAGLTEK GTAKDVNAPG PIPGSTAASA DTAAKADEPI DPAVAATTDS SGAVRVTAVD ATPQQIEMVA AAANVGEAPT ATVGGEHGLA EKAAEYGAAA MATFGSVVGG AAAAVEKATG VDLTHSSPLS VEEARARGID VTNLEKVDAP TDTTSPQGSA PPASAVAALD EKVAELKSET IGASSTTGVT DQVAMPLPNQ QAPKSTSLSL PPSKEVDTVS VDKGLSDWLD YTASYPADGS STLGHTASTT DNKEKDIKND IPAQPETADM NHRAVPAPVF TTIAPKDPKK DRSLGSSDLN DTAGTKPIDA APGVDAKKAK REEEANPTGA TGEKPEVAEA KKAAAPDTPE DNLKLKSEDV GKGTGAGVGS ASDGRAQVPQ SGVKAETSLS SPSAATATTT ETSQPASSAT GTTATPSKTT AATTNGTSVP AAAAGYGGAV AGAAGAVEAG GAGSTTTGAS NTAAPSTPAG TKPTSTSTPT PGKESSSGPG SVKKKTGFLA KIKHALSPGH KSK
|
| |