Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNA05460 |
Symbol | |
ID | 3253284 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006670 |
Strand | + |
Start bp | 1463402 |
End bp | 1466321 |
Gene Length | 2920 bp |
Protein Length | 558 aa |
Translation table | |
GC content | 47% |
IMG OID | 638252866 |
Product | hypothetical protein |
Protein accession | XP_566868 |
Protein GI | 58258911 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATATATAGT CTTACAACCA AATATCCTAT ATGGTATAAT AGTACTTCTT CTCCGTCGGA CACCGTAATC TTTACACGCT CTCTAAGTCA TATATAAGTG AGTTGTTCAT CCTGCGTCAA CGACTGTCCT TGCTACCTCT ATCCCCGGAT TCTCGCCCGC CATATATGTA TTTCCACTGA TAAATCCCAC AGTCACGCTT CCCTTCGGCT CTGCTCTGCT TGCACAGTTT GGATCTGACA ACAATTTCCA CGCAGCTTCT TCGTCTCTAT AATCCGATAT TTATCGAAGG TGAGTCTGCG TGAAACTTCT GCAGAAGGTT GGTCGTACAT AGTTGTCATG GTAGCTCTGT TGTGGCCATT GGTATTCTTT CTTCAAGGCT TAGCCAAGAG CCAAGAGGGT TGAGCTGCCT TGTACCGCGC TTGGTTCGAG GGGCCAGGCC CAAAAAAGGG AAGGGCAATA ATACTGGTCC AGCCTCGGTC TTGAGCATTT CTGATGCTAG CATACGTTGC AACAGTGTTT TATTGGCATA ACGGATGGGG AAGAGTTCTG ACATCGAAGT TAGCAATATT TTTTATCTCT CCCTGTTCTG CCTGTTTACC GCCCCTACGT GCTAGGTCAC GCAAAGAACT GGATTCAAGC CTCCTGCCGT CAAACCGTGC CGAACTATTA CGTCAAATCA GCTCTTATAG TTCAATTTTT AGATTGTTCT ATTTTGAGAG AATTGCCCTA AGGACTCCCC ATTCACGTGT CAATGTTCCC CTCAACAGGC AAAGACTCGA TGCTCATGGA GATTGCCGGC CCAACGTCCC CACTTTCAAC CAAAAAAAGG AAAAGAATGG ATCAAACATC CAGCCATCCA TTACCAACAA ACCATGAGGT CGACATGCCC GATATGGCAT CCAGAACTGG TCGAGAGGGA TGGTAAGGCC GAAGTTGCCT GATCACGGCG GAGAATGTGA CTTACAAATT CGGTGGAATA GGACATACCA TCTTGAAATC TTACAAGAGC CTCTAAGGGC CAGGGCTTGT GGTTTTGGTA ATAAGGTGAG CTGTCCACCA TGGTTTTAAA ACTAAACAAC TCAGGATCGC AGACCCCTTA CGCCCCCTCC TATTATCCGA TTGTGGATTC AAGATGCTTT AGGAAATCAA GTGGATCCCA AGTAAGTCTG CCGAAACGCT ATTTATTCAT GAAGCATGCT GAACGTACTC TTAGCATAGT TGATTCCAAC ACCATAATCC TCCAGGTCGA CTTATGTTCG GCTGACGGGC ACGAAGGAAG AAACGTGGTA AGACATCCAG TGGGACCTGG CAGTGTTCCT GCAGTTGTCT CAGTGAGCGA AAATGTTAGG TCAGGTCATG TAGATCCTTC GACTCTTCCA ACTACATCCA CAGCGGTGTC TGAACAGCCC TATAGGAATA GTAAGCATCT CCAGATAAAA GCAAAATATG GCCTGCTCAC AACTCCTCAA ATTACAGGTA GCATGGACTA TTCTTCATGG CCTGAGCAAT ATTCCTCGGA ATCTTCCGAC CGGACCACTC CGGGGTGGAC TTATCAAGAT GCCTTTGTCT CTTCACCACG CATTGCCGGA ACTCGCCCTC CCATGGCTCG ACGAGTACGC ACTCCCACTC GACCTTCCAC TGCGCCCTCC TTACGTCCTC CGGCATGGAG CGTTGATGTC TCTCAACGGC CATTGTATGA GGAAGCTCTT CCTCCGATTT CTGCACTTGC AGAAAATATT CGCGACCCCT CTGGGCGCCC CTGGTTTCCC GACCCCCACA TTAATACGCA AAGGCCGACA AGTAGCAGTT CTTTGCGGAG CCGCCCTCAC ACATCCCACT CGACTGATTT GAGTACAGCT CCCACCGACT ACTCATTTGG GAGACCTACG ACCACAAGTT CCACCGGTAG CTGGCATCTA TCGGCCGATT CAGAGTATAA GGGTTTCGCA CTTGAAAACC AAGCTGCTGA AGGCTCAAAA CCTGGCCCCG GAGCAAAATC CAATGATCCT AACGTACCTA TATCATCCGC GGAAACTCAA GCAACATCAC CTGAATCTTT TCTTCCCGGT TCTTTTACTG ACCGCTTCAC TCAAAACGAC TCGCCCCGTG TACCATACTC GTACCACAAC TCCCATTATC GGTCGGATGT CGCGCCGGGC AAGGAGCGGG ACTCTTTCCA TAGTCCAGAC CTAGATACTA TGAAGGAAAC CCGCTGTTTC GCACCAGCAA GCGTTGCGTC GACTTTGGTT TTGGTTGGGA AACGTCACAC CCCTTGTAAC AAACTAAAAG ATGAGCATGG ACGGTTAGGT CTATTTTTCT TCGCCACGGA TCTGGGAGTG CGGACAGAAG GGAGATTTTG CCTGCGAATG AAGATCATGG ATCTAAGCCT GTACGTATCA AAGTTCAAAA ATGATCCAAT TAACATCTTT TCCTCAGCTT CTTACGGCCT CCTAACCCAG GCGATAGCAC GCCCATATTG GCGGAAACTA TCAGTCAACC AATAGAAGTG TAGTAAGTCC ATTGAAAAGC AGGACCTATT TTGCCGGTGA ACTAATTTCG ACTACTGTCG TTGTGCTTGT GTACGACAGT TCTGCAAAAC GCTTTCCTGG AGTCATACCT ACTACTAAAC TCACTCGTTT GTTTGCTGCT CAGGGAATCA AATTGGCTGT CAGAGAAAGT CATAAACAAA AGCACCGCAG CAAAGAGAAC GTAGATATGG ATGTTCAAGA TGAAATCGAG GAAGATGAAG ATGGTCAATG AGACATGGGG GATTACGATT GTGCGAGTTG CGCCGTGCGA TGGGGTGTTA AGAAATTAGT AATATTGTAG AAGTAATATA CCGACGTTTA ATCGCAAATT CATACTGTTC AGATAATTAA ACTTTCGATA ACCAGGGTAA ACATGAAGGC AGGGCATAAT
|
Protein sequence | MFPSTGKDSM LMEIAGPTSP LSTKKRKRMD QTSSHPLPTN HEVDMPDMAS RTGREGWTYH LEILQEPLRA RACGFGNKDR RPLTPPPIIR LWIQDALGNQ VDPNIVDSNT IILQVDLCSA DGHEGRNVVR HPVGPGSVPA VVSVSENVRS GHVDPSTLPT TSTAVSEQPY RNSSMDYSSW PEQYSSESSD RTTPGWTYQD AFVSSPRIAG TRPPMARRVR TPTRPSTAPS LRPPAWSVDV SQRPLYEEAL PPISALAENI RDPSGRPWFP DPHINTQRPT SSSSLRSRPH TSHSTDLSTA PTDYSFGRPT TTSSTGSWHL SADSEYKGFA LENQAAEGSK PGPGAKSNDP NVPISSAETQ ATSPESFLPG SFTDRFTQND SPRVPYSYHN SHYRSDVAPG KERDSFHSPD LDTMKETRCF APASVASTLV LVGKRHTPCN KLKDEHGRLG LFFFATDLGV RTEGRFCLRM KIMDLSLFLR PPNPGDSTPI LAETISQPIE VYSAKRFPGV IPTTKLTRLF AAQGIKLAVR ESHKQKHRSK ENVDMDVQDE IEEDEDGQ
|
| |