Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG03570 |
Symbol | |
ID | 3258566 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 995090 |
End bp | 998131 |
Gene Length | 3042 bp |
Protein Length | 912 aa |
Translation table | |
GC content | 56% |
IMG OID | 638257981 |
Product | hypothetical protein |
Protein accession | XP_572056 |
Protein GI | 58269800 |
COG category | [K] Transcription [L] Replication, recombination and repair [R] General function prediction only [T] Signal transduction mechanisms |
COG ID | [COG0515] Serine/threonine protein kinase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.768082 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCTTTGTAT ATCTACTTTT ACCACATCTC CACGTCTTCT TTTCCCTCAG CTGTACATCC ATCCTGCGCT CGCCAGAGCG ACTCCCACGC CCACGCACAA CTAGATACAA CCCACAGCAC GCAGCGTCAT TTCGTACGTA CATCTCGCCC CGGCCACCTC GCACCACCAC TGACACCACA TGCAGGATGC TGCATGCGTC AAACCTTCCA CCAATTCCCT CCCAAGCACC ACTGCATCTG AATACTTCTG GTCTTGACAG CGCAGCGAGT TCTCCAAAGC AATCTCCCGC ATCTCTTTTC ATCCAGCGGC TACATTCGTC GAGACCTACG AGTTATGAGA CCGCCTCAAA CTCATCAGGA GGAGCCGAGA GCAGTTCGCA AGGAGCGGCA GGGACAGCAG ACCTGAGCAG AATGGGGAGT CTCAGAGGAG GAGTCAGTTT TGGTGAAGCG GCGTCCAACT TGTCACCACA GGCCAATCCT GGCCAGGTCA ATACAAACAC CTCGGTGGGA TATACACCTC CTGCAGCGTC TCGCCACCCT TTGTTCTCCC ACCATGCTCT ACCGGCACGC CCGCCCTCGT TCTCTCCTGC GCATTCCATA TCGCCGTCCT TCGTTTCTGA CCCAGGTCCG TCCGGCCCAC CGTCCCCGGC CGTATCGGAT CTCTCCAATT TCTCCTCCAG CGCCTTCTCG CCTGCAAGCG CATTCCTCTC TCATTTCAGC TCGTCCAGCT CAGCATTACA ACTGGCGCCT GATGCGGAGG GCGCGAGAGT GAAAGATTAC ACTCTGGGCA AGCTGATTGG AAGAGGAGGC TTCTCAACGG TCAGAAAGGT GACTCTGCGC ACCAGTGGAA AGGTTCTGGC TTGTAAGATT GTCAAGAGAG ATGACCTGTC AGACATGTCG GGGTCGTTGG AAAAGTTCGA GGAGGAAATC AAGACATGGC AGAACTTGCC CCCACACCCG TCCCTCCTTC CGCTTCTCGA CATGCATAGG ACGCCGAGCG CAACTTTTCT GTTCACGCCG TATCTTCCCG GCGGGTCCTT GTTGGACATG CTCAAGCGGG AAGGCGGATC AGACAAGACT GCGCGGAAAT GGTTCCCTGG TGTGGTCTCA GCTGTATCCG CCATGCATGA GGGCTTCCCC GGGTTTCCGG GCGGGTTACT GCATGGTGAT CTCAAGTTGG ACAACTTTTT GGTGGATCAT CAAGGTAAAG TCATGGTCTG CGACTTTTAC ATGGCACAGA TGGTGGGCAG ACAGGAGGAT AGAACGGCAA CCATCCCGCC GCCGCTTAAC GCAGGGGTGA ACAGGCACTC GACTTTGCCG AGCAGTTTTT CAAGAGGGTC GTCAAGGATA CCTTCTCCCT ACCGTAGTCA CAATTCTCAC ACGCCACACC GCTATCCCAA CGAACACCAT TTACACGAGA ACAGCCACGC TCCTAGCCCG GCGCATACTC AGTCTTTCCC CTCTGCCTCC CTGCCATACG CACCACCCGA GCTGCTACGT GCACCTCCTT TGGGACCTTC TCTCGCTCAG GATATATGGG CGGTAGGCAT CGTCCTTCAT GCTCTTTTGA CAGGAAGGCT CCCGTTTTTT GATCCGTATG ATCCTAGGCT TCAAATGAAG ATCTTACGAG GGACGTGGGA GGAACCGCCC AACTTGGGTA AAGAATGGCT CGAGTGTCTG AAAGGATGTC TGGACGGGGA CAGGGAGAGG AGGTGGACGA TAGTAAAAGT GAAGCAAAGC GATGCGGTCT TGGGTTGGAA GGAAGTGCAG AGCAGGAGCA AGTCGAGGAG TCGGTCAAGG GTCAGAATTG GAAGAGGTAT GGGCATGGGT GACGGGTTTA TGGATCCAAT GAGGAGAGAT GGAGCCAGTC AGCCTGTGCC CATCATGTCG TCTTCATCGC TTAACCCAAG AGGCAAAAAG AGTTCGAGTG TCTCCAGATC AAGAGACAGA GGACACAGTT TTCAAGGTGG TGTATTCCCA GGAGAACATG GCCGAACACC ATTCGATCAT CCTCATACTC TACATCTCCA AGACCCACCG CCCCCACGTT CTCGGTCAGT CAGTGCATCG CGCTCTAGAT CTTCCGGTCA CCGCCCAATG TTCACATTCG ATGCGCCCGA GTTATCCAGA AACCTCGAAC AAGTTGACCT CAACCGCGGT CGGTCCACTC ACCGCGGCCT CCCCCCGGGC TCTACTTGGA ACCTCTCCGT CCCGCCTAGC AGCTCCAGCT CATCATCCTT ATCATACGCT CGTACTCAGG GTACACCCTC CAATCAGGGC ACGCCCGTTC CCGTCCCCGC TGCCGATCCG TCATACTCCA GGGCTGCTTC TCGCTCAAGG TCACAGAGCG CACACAAAGG TGGAGGGATG CCACCCCAAC CGGTACCGGT CCAGTCTTCA GGATTCAAGG GCGCGATGCC GATGGGCTTC ACGGAGAGTG GCTTAGGGAC TGCGTATAGT CGACCGGCAG GTGCGGCGGG GGTTGCGATG CCGGTACCTA TGATGACGCC CAATTCCAAA TCACGTTCCA GATCAAGACA CTCTCAACAG TCCCAAGCTA GCACATCACC CTCCGTTTCT CGCTCTCGCA GCCAAGCGCG CGACTCACCC GCTCTTTCTG CCGGCGCCGG CGGCGGTCAC ATGGGTTGGG GCGAACCACC ATATAGCCCA TGGGCTGAAA CCACTCCGGC AGTCACACCT GGTTTGGGTT CTGGTACCCC TCTCACTCAG GCGAGAAGGA GTGGTTCTCA GTTTGAGAGG TATGAATACG GTCAAGGGCT TGGTGCGGTG CATGAGGAAG ATAGAGGGAG GGATAGGGGA GTCAGGAGTA GGGAGCCGAG TATGAATAGA GAACAAAGGG ATCAGAGTAG GGGGAGGAAG GGGAGGCCTG CGTATCGGTC TGGTGAGAGC TTTGGCAGGA GTTGAGGGAC AGGAAAGGGA AAAAAAGGGG GGGGGGAAGA AGCATCTCTG TATTAACAGC ATGTGAGGGA TTAGCATAGC ATGTGTGTGT ATAATTAAGA AGCAATCTAC AAGCAATCCA TA
|
Protein sequence | MLHASNLPPI PSQAPLHLNT SGLDSAASSP KQSPASLFIQ RLHSSRPTSY ETASNSSGGA ESSSQGAAGT ADLSRMGSLR GGVSFGEAAS NLSPQANPGQ VNTNTSVGYT PPAASRHPLF SHHALPARPP SFSPAHSISP SFVSDPGPSG PPSPAVSDLS NFSSSAFSPA SAFLSHFSSS SSALQLAPDA EGARVKDYTL GKLIGRGGFS TVRKVTLRTS GKVLACKIVK RDDLSDMSGS LEKFEEEIKT WQNLPPHPSL LPLLDMHRTP SATFLFTPYL PGGSLLDMLK REGGSDKTAR KWFPGVVSAV SAMHEGFPGF PGGLLHGDLK LDNFLVDHQG KVMVCDFYMA QMVGRQEDRT ATIPPPLNAG VNRHSTLPSS FSRGSSRIPS PYRSHNSHTP HRYPNEHHLH ENSHAPSPAH TQSFPSASLP YAPPELLRAP PLGPSLAQDI WAVGIVLHAL LTGRLPFFDP YDPRLQMKIL RGTWEEPPNL GKEWLECLKG CLDGDRERRW TIVKVKQSDA VLGWKEVQSR SKSRSRSRVR IGRGMGMGDG FMDPMRRDGA SQPVPIMSSS SLNPRGKKSS SVSRSRDRGH SFQGGVFPGE HGRTPFDHPH TLHLQDPPPP RSRSVSASRS RSSGHRPMFT FDAPELSRNL EQVDLNRGRS THRGLPPGST WNLSVPPSSS SSSSLSYART QGTPSNQGTP VPVPAADPSY SRAASRSRSQ SAHKGGGMPP QPVPVQSSGF KGAMPMGFTE SGLGTAYSRP AGAAGVAMPV PMMTPNSKSR SRSRHSQQSQ ASTSPSVSRS RSQARDSPAL SAGAGGGHMG WGEPPYSPWA ETTPAVTPGL GSGTPLTQAR RSGSQFERYE YGQGLGAVHE EDRGRDRGVR SREPSMNREQ RDQSRGRKGR PAYRSGESFG RS
|
| |