Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI04350 |
Symbol | |
ID | 3259710 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 1160390 |
End bp | 1163742 |
Gene Length | 3353 bp |
Protein Length | 966 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258929 |
Product | hypothetical protein |
Protein accession | XP_573020 |
Protein GI | 58271728 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2453] Predicted protein-tyrosine phosphatase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCCCGAG CAGCTACCGC CACTCCTCCA CCTCAAACTT ATTTTCCCCA GCGGAACGCC ACTCCGTCTA TGAGGCATGC CGCGCCTAGC CCGGCGATTC ATTTTGGGCG ACAACATCAG CATTTGGGAC AGAGCCATGT GAAGAGGAGG GTGAAGGGTA ATGGTGGATC CGGTTTGAAG AAGCAGCATA TACCGAATAG CAGAGAACAG AACGAGGAGG ATCAATTGGG GATGTCATTC GAGGATCTCG GCGCGGAGCG CGTCCTTCAA AATCCAAACT CAGACAAAGC CGATATCGAT GTCGATATCG ACATTGACTT GGACCTCGAC CCTAACCTCA CTCCGGCCTC TTCCCGAGCC TCCATCGTCA GTGGCCTAAG CCAAGACCAA GGTCCTGTCC TTCTCACCCC CGTCCAATCC GATTCAGAGG GTAGAGGCCC AGTGTTAGTG TCAAATTCCG GCAGGGTGTC GTCCGCAGGT TCAGAAGGGG GCGAAGAAGA AGGCCGAGCG TCGCCGGGAT GGATGGGAAA GATGGCGATG GCGGTTGTGA ATACGGGGAT GAGTGTTTCC AGAGGGTTTG GAAATGCACA AGCGCATGGT AAAAGAGTCC AATCACAGCC ACAGCCACAA CCGCAGGCGC AAGCTCAAGC CCACGATCAG ACTTTGGCAC ACCCAGCCGA AAAGATTGAC GATCAGCAGC GACAGCGGGA ATGGGCTGAA GCAGAGAACC GGAGAATCCA TGAGTGCGCA AGGTTGTGTT CTCAGTGGCC GTTAAGTGGG TACAACTTGG GCAAATACGG TCCCGGTGGT GAGTCAACTG TCCTTCACAT GTTCGCATCC GGTAAACATG TGTTAACATT GGGAATGAAG GCGCTGCAAC GTTCTATCAA CCCCAGTCAT TCTGTAACCC CCAACACGTC GCTTGGGTGA TGCATCGTCA AGCCCAAATC GAGCAGAGGT TGGCATTGAC CGAAGGGACA TTCTTCGATT GCCGCAAAGT GAGGCAAAGG AAGAGGCAAG GTGATTGTGA GAATGGTCGA GGGTGTAGAG CTAGAGAGGA AGATGAGCAG AGTGAAGTTT CAACCTCGAC GGATGACTCG ACGTTGTCAT CTCGGTTGTC GGGATCGACA TGCTCAACCG ACTCTCGCTC CAACTCTACC GCCAATTCTT CCAATCCTTC TTCCTCGCAA GACGAAAACA ACTCTGAACC TCTGTCCCTC GTCCAACCCC AACGACCTTC TGCGCCAAGG TCATTGACGG CTGAGCACTA TGCGTATGGG CGTACAGCGC AAAAGATGGA GAGAGAGAAG GAGATGGCGG AGGATTTGAG AGAGGCGATG GCGTGCTCCT TATTGGAGTT TAGTCATCCT GAGCGTCGCG GTTCAGTGAC TTCTTCGGCC ACCAAGGATG TTAGCGCTGA TGGAAGGGAC GAGAATGGAG AGAAGTTGGA GAATGCCGAA GTGGAGGGCA GGAGAGAAAG GGGTCGCAAG ATGTTTGCCG AGTTGGAAGC TCTTGCGAGG GATGTAGGTT TGGGGCATGT TGGGGAGGGG ATGGATGTGG ATGAGGACTT GTGTGAAGAG TCTCAACCAC AGCCTCAACC CGACACGCAA GCTACCCCCT CTGCCCAAAC TCACGCCGCA AGCTATTCTT CCGAACCATC TATGCCTCGC ACCCGAACAC AACGCGCTCA GTCATGCGGC TCCAAACGTC CTATTCCTAG TACACGCTGT AGCGCCGATG GGGAAGAGGA GAAACGTCGA AAAGTTCGTC CTGTCGCTCC TCCTACTACG TCTTCAGGTT TTGACCCAAT GTGCACCGCA ACGATGGAAG TGGAAGAGGC TGTGGATGAA GCTATGGTTG TTGAGGAAGC TGTTGATAAC GGCGTCATCA TGCCCGCTAC CCAAACGCAG TCTCAGAATC AATCTCATAC AGGGGAAAGT AGGCTCAAGA CTGGAAGGGG AAAAGGAATG TCATCTTCCG TACCCGACTT GTCCAAAACT TATCCTCATT CCGAATCAGC GACGACGTAC CCTCCTCAAC GACAGCAACA GCCTGAACAA TCAAACTCTC GTGATATATA CCCTCCTCCG GCTGTATTCG GTGTGGTCGT CAAAACTTCC GAGTCACATC CCATCATCAT TTCGCCCTTC TTTCCGAAAG ATTTGTTGGG CATTTTGGCG GAACATCTCG TCCTGCCTCT TGGGGGAGCG AGGAAACCGT TGTTGTTGGG CTCGAAGCTT GATGTTCCAA GTCTTTTACT CTCTTATTCA CCAGGAATGC CCTTCCCCTC ATCCGGCTCG ATGCGAAATC AATCCCAATC TACTCAGACT CAGCCCGTGT CCAACCTTTG TCAAGCTCCT CGGGTACCAG CATTAGGTAA CCTTCTTCTG AGCAGCTGTC CTGGTAAGAG ACTGAGAATG GAAGGGCCGA GTAAAGGTAG GGGACCCGTT TGTAGGGATT TAGCGACAGA TTTGAAGAGG ATCAAGGGGG AGGGTGTTGG GTGTTTGGTC TGGTGAGTTC AAGCTTCGTT GCTTAAATCT TTTGGTTGCT AATGTGGGGT GGGGACGGGC ATTAGTTGTT TGGATGATGA AGAACTTGAA CTGCTTGGTG TACCTTGGGA GACGTATCGC GATGTCGCTG CGGAAACGGG GTTGGATGTC ATTCGGTATG TAATATGAAC TTCTTCCTTC TCTCCTTCTT CCCCTCCTTG CCTGTACATT GGCATTAGCT CATCATTACT CCTAGGTTGC CAATGCCTGA CGGCTTCACT CCCGTTTCGA TGGAACTTTT TGACTCTCAA GTATCCCTCA TCGCTACAAA ATACACTTTG CAGGGCATCA ACGTTCTTGT TCACTGTCGA GGTGAGTTTC CATCTTTTCT GCGCCTTCGA CCTCGTGATC GAGCGGGTGT GTGTTATGTG TATGAGGAAT TTGATGATTG ACACTGTTGG CTCTGCCCTC CCCCTTCTAC CCCGTCCGCC TTTTTTCCCT TGGACAACCT TCGAACACAT AATCATAGGT GGAGTTGGAA GAGCAGGTAT GACGGCTTGT GCGTGGGCTA TCAAAATGGG TTTCGTCCAG CCTCATCCTT CATTGGTCAT CGTTGAAGAA GCTGCTCGAC AACGATACAA TCTCACTCAC GGTATTTCAC CCACGTCAAA CTCCCCGACA CCTATCGCTC CCTCTGCCGC TGTGCCCGCC GAACTCGAGC ATCAAATCGT TATGAGCATG GTTGAGAGAG TCATCGCGAT GATCAGGTGT CGAAGGGGAT TGAAAGCGAT TGAAAGTTTC GAGCAGGTGG CGTTCTTGAT GCGATATGTG GGATGGTTAA GGCAGGGTGC AAGGAGCGCC TGA
|
Protein sequence | MSRAATATPP PQTYFPQRNA TPSMRHAAPS PAIHFGRQHQ HLGQSHVKRR VKGNGGSGLK KQHIPNSREQ NEEDQLGMSF EDLGAERVLQ NPNSDKADID VDIDIDLDLD PNLTPASSRA SIVSGLSQDQ GPVLLTPVQS DSEGRGPVLV SNSGRVSSAG SEGGEEEGRA SPGWMGKMAM AVVNTGMSVS RGFGNAQAHG KRVQSQPQPQ PQAQAQAHDQ TLAHPAEKID DQQRQREWAE AENRRIHECA RLCSQWPLSG YNLGKYGPGG AATFYQPQSF CNPQHVAWVM HRQAQIEQRL ALTEGTFFDC RKVRQRKRQG DCENGRGCRA REEDEQSEVS TSTDDSTLSS RLSGSTCSTD SRSNSTANSS NPSSSQDENN SEPLSLVQPQ RPSAPRSLTA EHYAYGRTAQ KMEREKEMAE DLREAMACSL LEFSHPERRG SVTSSATKDV SADGRDENGE KLENAEVEGR RERGRKMFAE LEALARDVGL GHVGEGMDVD EDLCEESQPQ PQPDTQATPS AQTHAASYSS EPSMPRTRTQ RAQSCGSKRP IPSTRCSADG EEEKRRKVRP VAPPTTSSGF DPMCTATMEV EEAVDEAMVV EEAVDNGVIM PATQTQSQNQ SHTGESRLKT GRGKGMSSSV PDLSKTYPHS ESATTYPPQR QQQPEQSNSR DIYPPPAVFG VVVKTSESHP IIISPFFPKD LLGILAEHLV LPLGGARKPL LLGSKLDVPS LLLSYSPGMP FPSSGSMRNQ SQSTQTQPVS NLCQAPRVPA LGNLLLSSCP GKRLRMEGPS KGRGPVCRDL ATDLKRIKGE GVGCLVWLPM PDGFTPVSME LFDSQVSLIA TKYTLQGINV LVHCRGGVGR AGMTACAWAI KMGFVQPHPS LVIVEEAARQ RYNLTHGISP TSNSPTPIAP SAAVPAELEH QIVMSMVERV IAMIRCRRGL KAIESFEQVA FLMRYVGWLR QGARSA
|
| |