Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB03820 |
Symbol | |
ID | 3256077 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | + |
Start bp | 1134122 |
End bp | 1137296 |
Gene Length | 3175 bp |
Protein Length | 737 aa |
Translation table | |
GC content | 49% |
IMG OID | 638255029 |
Product | expressed protein |
Protein accession | XP_569028 |
Protein GI | 58263236 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CAAAAGTCCT AGTTTTGACA CATACAAGTC GCCAAAGGAC GCGTGGCAGA AGGAAGAAGA GTGTGGCGCG GTCAACAACA GATCAGAGGT GAGCATCGTC TTAGAATTTA CGACACGCCG TTCTTAGATG TTACGTTTTC TTTTTTTACT AGCCAATCTT CTTCATCCTC ATCTGTTCCA TGGCATTCCG GTAGAGGCGG CTTCCTACCA TAGCACAAAG CAGGCTCTCA TATAGGGTGC GGACGCCTGA AGGTCACGGC GAGGAAGTCG CAGCAAAGTT GGTTGATTCC TCAAAAGAAA AGCATGATCA AGAAGGAAAG CCTGACGTGA TTAGTACAGA GCAAAAAAGC ACCCAATACT GCAAAGAATC CAATAATCCC GAAAGTCCTT TTGTCTGTCC CGATTATGCT TCTGGAAAGC GATTACTCAG AGATGGGAAG GGAAGGACAC CGTTCTCCCG AAAAACATCG ATCTGAAGAC AGGCGTCCCT CGGGCGAGGA GCATCTAGTC TCTAGGTATA ATAATCCTCC TATTTCTGAA CAAACAAGGA ATTCCCCTTC CCGAGAAGCC AGTACTACTG GCCACGAAGC CCAACCTCTG GCCCCTTCAT CAGCTGAGCA AAATCTGCCC ACCCATTATG TTGTGCGCCC CATACAGACA CCCGACGCTC ATCAGCAATC TGGTTCTTCT GATAGAATTG AAACGGCAAC AGAAGTGAGA CAATCCGAAC CCTCCGCTCA TCAGCAGCCC ATAGAGGGAT TGCTTTTACT TCGTCAACAA CTTCCTATAA GCCCTGCCAG GAAACGATCC AGAAGCCCTC CCCCTTTTGG CCGTATTGAT AAATTCGGGC GATCGCGCAG CAGCTCTCCA AGTGGATCAA AATCCGGTGA AGATAAGGCA AGGAGTCGTC CCAATACAGG TGATGAATTG AAAAGCAGTA TGCGCGGGAG GGAAATGCAG TCTAGATCAC CACACAATGT AACAGACTTT TTGACTGAAC CTCCTTCTTC AGCACTTCGC TCTACTGTTC ACAGCTCATT ATCTTTTTTC GAGAGAGAGC TTGTCGACGT GAAATCACCA TGTTTACCAG CCATATCTGA CTGGAAGCCG ACGGAGACAC CTCCAAGTGG GCTAATAAAG CCCACGCTTT CTTCTTTGGT GTCCGATTTT CGTAGTCTGT AAGTGACGAA TATTGGACGT ATTTGATTGT TAAGGATGAC TTTCTAACAT CACATGGTGT CAGCCCACCA CCTTTTCCCA GCGCCCCTTT ATCATCAGCC CGTTCATTCT TTGGTCAGTC ACGAGCCGGT CCCTCCTCTC AATCACAAGA ATCGTCTAAT CCACGGCGAA GGAGTGAAGA CGTTCATCAC CATCTTCCTA GCCTTGAACT GCCCTTGCAA CCTCTTGTTG AAGAGCAGAC GCTTCAACCG CCTAGACCTC CGCTCGATAG ACAGCAACAT CTTGATATAC CAAGCCCTTC TCTATCTGAT CCTGCAAAGG GTCCTGGAGA ACAGCCTCCC TCTGCCGTCA AAGGGAGTCG TGCAGGTAGG CCACATGATA GTTTCAATAC TATCGGGAGT AACGATGGGC TGACCGTGTA CCGACAGAAA GGACATCTTC TACCAAAGGG ATGAGAGAAA AGGGTGGTAG ATTTGAACAA GGTGGGAGTG GCTCAGGGAA GAAGAAGGCG AAGCATCAAT CGGCTGAGTT GCTTTCTCCA AAGGGTAGTG ATACTAAAAA TGGTGAGCGG CTTGTCTTTG AGAAAGTCAG ACGAATCAGA GGATGACTGA TGAAACTGGA TTGTAAAGTG GCAGATGCTC CAGGGCCGAG TGTTAATGGT AAGTTGATGA ATGAGCGCCA GGTCACGCCT TGTGGTTCAC AATGTCCGTA CTGATGAACC TCTTTTAGCG AGGCTTTCCA CCGAGAGCAC CGGTCATGAA AGTGCATCGG GAAGGGCCTT CGATATACAT TGGACGCCTG ATAACGTGAA AGCATATTGG ATGGGTTATG ATTCTGCTAT GCGGGACGTT AAATTTGGGA GGGGTGATGT TAGAGTAAAT ATGCCTAAAG AAGGTATGAC CTGGAAAAAA AAACATCAAC ATGAAATGAC TGATGTGAAA TAGGACGTGC AGTGGCCATG GAGGTCTCAG TTGATGAACA GCATCAGCAG CATTCCAAGA CGAGGGCTGG AGCTACAGCT GCAGAATCAC CTCAGAATCT TGGCTTCAGA GCTGCTTACA GACCTCCTCC TTCGCCTCAA ACTGTCACAC AACCACAATT GCAGACTCAG AGGTTGGCTC CGCGAATGAG ACCACATGGT GAAAACCAAC CTGAACCACT TTCGCAAGGC ACTACAATTT CCCCTAGTAT GAGAACTCTG AGTCACCCGA TTTTTGCGCC CTTTGGTTGG GAACCTTCCA ATCCTGTTGT CCATCCGCAG CAGAGGAGGT CAACGTATCC GGTACACATG CCTAATTTCG TTGCCCCTAT TCATCCTGCT TACCACCAGA TTGTTGCGGA CCCGCCAAAA AAGCAGGTGC AGGTACCTAT TGGAACACGA TTCCATCCGC CACAAATGCA TCCTCCTCCA ATTTATTCGC TCTCCTCATC GGAATCTTCC ACTAATTATC GGGAAGGGTC TGGTCTCGAG CCTGCTGGCT CTCATCGTCA GAGGAAGCGA CAGCTGATCT CTTGCTATCC TTGCAGAAAG CGCAAGCTTC GATGTGACGG CCGGCGACCA GTCTGTGAAC AATGTGAGAG AAGGAAAGTC GCCGACCAGT GTGGATATGC TGAAAGTATT AAACGACGGA GGAGGACCAA GAATGCTGAG GACGATGATA TCGAGATGAG AGATGAAGGG GATGACGAGA TCGAAGAAGG AAAGGAGGAG GAGATACAAG CCGGGCCTAG CAGAAGGGAG AACTTGGATC GCGAAGAGAG GGACCAAGTG TAAGGATATA GAGGAAGGGG GATGGAGGAA GACAAAGAGG AAGACGAGAT GCAGACGACG AAATAGTTCT TACGAAGACA GCGACACCGC TACACCAAGC CCTTGAAGGA TATGAGTCCT ACTATCCCAG GAATGACCCA TTGTCATCTG ATCTGAGACC TATAAGGATT ATCGACCTCG TGCTTGTTAA ACTGGAGTTT CCTCAGCACC GGCGTATTAA CAGAC
|
Protein sequence | MLLESDYSEM GREGHRSPEK HRSEDRRPSG EEHLVSRYNN PPISEQTRNS PSREASTTGH EAQPLAPSSA EQNLPTHYVV RPIQTPDAHQ QSGSSDRIET ATEVRQSEPS AHQQPIEGLL LLRQQLPISP ARKRSRSPPP FGRIDKFGRS RSSSPSGSKS GEDKARSRPN TGDELKSSMR GREMQSRSPH NVTDFLTEPP SSALRSTVHS SLSFFERELV DVKSPCLPAI SDWKPTETPP SGLIKPTLSS LVSDFRSLPP PFPSAPLSSA RSFFGQSRAG PSSQSQESSN PRRRSEDVHH HLPSLELPLQ PLVEEQTLQP PRPPLDRQQH LDIPSPSLSD PAKGPGEQPP SAVKGSRAGM REKGGRFEQG GSGSGKKKAK HQSAELLSPK GSDTKNVADA PGPSVNARLS TESTGHESAS GRAFDIHWTP DNVKAYWMGY DSAMRDVKFG RGDVRVNMPK EGRAVAMEVS VDEQHQQHSK TRAGATAAES PQNLGFRAAY RPPPSPQTVT QPQLQTQRLA PRMRPHGENQ PEPLSQGTTI SPSMRTLSHP IFAPFGWEPS NPVVHPQQRR STYPVHMPNF VAPIHPAYHQ IVADPPKKQV QVPIGTRFHP PQMHPPPIYS LSSSESSTNY REGSGLEPAG SHRQRKRQLI SCYPCRKRKL RCDGRRPVCE QCERRKVADQ CGYAESIKRR RRTKNAEDDD IEMRDEGDDE IEEGKEEEIQ AGPSRRENLD REERDQV
|
| |