Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF04750 |
Symbol | |
ID | 3258356 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 1380511 |
End bp | 1383688 |
Gene Length | 3178 bp |
Protein Length | 930 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257593 |
Product | expressed protein |
Protein accession | XP_571617 |
Protein GI | 58268922 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTGCTTTGA ATTCTACTTC TACCCGTACT TCCTACATCT CATACATTAC AGGGCAAAAT CAACGAGAGT CAAGGTGATT TTATCTGTCC TTAAAGTAAT CCAGCTAACC TCTACTGTTG CGCAGCGTCC GTCCGAGCCG GCCATCCTGT CCACATCTAC TTCTTGAACA TCGCAACCCC CATCTCATAA CCCTAAAATC GTACATTGAA CTACACACAC GCTGTAAGTA CTGGAAATTC TGTCTGTGAT TTTGCCTAAT TCACCCGTTC AGCCTAGCGA TGTCCACTCC TGAAGCCGGG GACCAACCAA ATTCATCACT TTCCGCTCTT ACATCTTCTA TCGACAACAC CTCCTCCCCA GTCGACTTTC TCAACTCCCT CCTGGCACCT CTTCTTCCAC CATCTCTCCC ACCCCCTAAC AAGCCCCAAC CACCTTCTTT GCAGCCCATC GACACCGCCC TGAATGACCT CCTTACCCAA CTGTCGCTCC TGTCGCAAGA TACTGCGTCA GCGATCGAGC AAGGGATGAG CGATGTAAGC AGAACTGTCC CCAGGCTAGG CTACGACCTC CAATTTATGC GGGAGAGCGC CAATGGATTG TCTGTCAGTT TGGGGATGGT ACAAGGCCGG GTTGCGAGAC AGGCGGATTA TGAGATGCCG AACAACAAGT TCCCAGGTGG TGAAGAGAGT GAAGCTGTAA AAGCTTTCCG TGCGCTGGAA AAGATCACTC ATCTGGACAA GCTCAAAACA AGGCTCGAAT CTGCGCGGGA CACCTTGCGC GAAGCCGAAT CATGGTCTAC ACTTGAGTCG GAGATCACCA CTCTCATCAG TGAGAAGGAA TATGCAAAGG CCGGGCAGCG ACTGGCCGAG GCAAGTCGGT CAATGGTAGT GTTTAAGAAT GAGCCTGCAG AATGGGAAGA GAGAAAGAGA TTGTTGGTTT CACTGGGAGA TGAGCTTGAG CGGGTTGCAG GAGAAGCGTT GAGGGAGAGC TTGAAAAAGG ACGATGGTGT GGACGAAGTG CGAGCTTTCT GGGAGGTATT CATGGATATG GAAAGGGAAG AAGAGTTCAA GGGGTGGTAT TTCAAGGAAA GAGGAAGGGG GTTACTTGAG GCATGGAAGG AACCATTGGT GGAAGAAGGA CAGGGCGAAA ATTCATCAAA GCTATCCGAC TTTCTCCCCA AGTTTTACTC CCTTGTACTT CAAACCCTTT CAGCCGAGCT ATCCTATATA CCCCTCGTCT TCCTTCCAGA ATCATCACCC TCGATTTTGG CGTCATTTTT CCAGTCCACA CTCGACTCCC TCGATCCGAC GTTCTCAAAC CGCCTCGCCG CCGTTGCGGA CTATCACGGC CCTGGTGCCC TTCCTGAACT CGTCAAGGCA TGGGAAGCGA CTGTTGATTT GGGAGCGGGG GTACAAGGAT TGATTGACAA GATCATATTC AACACTCAAG GAGGCTTGCT CAGCGGTGGT GCCGGCGAAA TCGATGTTGA ATCACCCGCC ACCATCTTAA CCTCCCCTGG CATCTCATCA TCTTCACCCA ACCATCCCAT TCCTCGCACA AACTCCCACT CCCATTCTAA ACGGCATCAG TCCATCTCTC GCAGATTCTC CCGCGCACCT AACGCTACCA CTACCTCTCT TTCCCCTTCC CCCGGGAACG TCGATGACGC TTGGGAGACG ACCCTGTATG AACCATTCTT GGACTGGCAG TCGTCCTACT CTTCTTTGGA GAAGAGGTGT TTGGAGAAGG AGGTGGCGGA CTTGAAGACA TCATGGGAGA AGGCAAACAT GAAGCAAGGT GGGAAGGATG TAATGAGCGG GATGATATCA CGTACGGCCG AACTCAAGGG TAGGCTTGAG GAAGCAGTAG CGCGCTGCAG GATATTCACT TTTGGCTTTG GGGCAGTCCA TCTTATCCGT GCTGTGGATA CTTGCATCTC CAGATTTTTC GACGATGAGC GGACCTCCAT CCTCAACAAC GCCAAGTCTA AGAGGGATAA TAACAAGCAG AAGGATAAGG CGGATGAGCT CGATTTGGAT GAGTTGGATG ACGATGGTGG GGATTGGAGT GGGTGGCAAG TAGGTCTGCA CATCCTTGAT TCACTCCAAA AGGTGGCAGA GAAGCTTGTG GCTATGGAAG ATGGGTTGAA AGCCGAGTTG AGTGAGTACG CCAAGATGCT GAAAGCCCAA AAGGGGGAGA AATGGGATGG CCAATGGGAC GGAAGAAAAG CAACGTTTGG CACTGTGTCT CTTTTGCAGC AGTCCACGCT CAACACCGCC GATTTACATG CTCTCATTGC CTCCGCGCCA TCACCCATCT TGCCTCAATC TAAATCCTCT CTCCTCACAT TCATCCGTGA ATCCCAAATC CATCTCCAAC AGACTATCCT TTCTCCCCTT CTCACCCAGC TCGACACGTA CCCTTCTCTC GCGGTCTGGA TCAAGGCAGA TAAACAGACA AAAATCAGAA AGGGAGAACT GTACGTACCG CAGTTTAGTT TGAGTCCGAC GGATGTGATC ACGAGGACGT CGGAAGGGCT GTTGGATCTG TTGAGGGTGT TTGAAGTGTA TGGCGGGGAA AAGGCGTTGG GGTGGAGTTT GGGGAGTTTG CCGTTCGTCG AAGGGATGGG ACATGCTGTC GCTCTTGATT GCCTTTCCTC TTCCAGAAAG GATACGAATA AGGAACCTTC AACCTCGATA TCCGACCCAA CGACAACAGC AACATCGACC TCAGTATCAC TCGCACCCAT CCCTTCCTCT ACCCCCGCGC CTACACCAGA AACGATCCAA ACCACCTGGA TATCATCTCT CACCCTCTCC CTCCTATCAC ACTTTACCTC GTACACCCTA CCTTCCATCC AGCACCTTTC TCAGGAAGGA CAAGCACAAC TGAAAGAGGA TTTGGGATAT TTGGAGAATG CGGTGAGGGC ACTAGACGTG GAGTGGAGTG AGCTGGGTGA GTGGGCGAGG GCGGTGGAGA TGGAGGAAGA GGAATGGAGG GAGAATGTGA AGCGGGAAGG AAGGGAAGGG GCTTTGGCCG CTGTGGGGAG GATGAGGGGA TGGAAGTTCT GAAGGGCTTA TTTTCTGAGG TGGGCAAATT TCCCGTTCGC TACACCATAA AAGAGTATTT TGCTAATGGA CCCGTTTTCA ATGCTTAGTC CATCAGCGTC TTTAATTC
|
Protein sequence | MSTPEAGDQP NSSLSALTSS IDNTSSPVDF LNSLLAPLLP PSLPPPNKPQ PPSLQPIDTA LNDLLTQLSL LSQDTASAIE QGMSDVSRTV PRLGYDLQFM RESANGLSVS LGMVQGRVAR QADYEMPNNK FPGGEESEAV KAFRALEKIT HLDKLKTRLE SARDTLREAE SWSTLESEIT TLISEKEYAK AGQRLAEASR SMVVFKNEPA EWEERKRLLV SLGDELERVA GEALRESLKK DDGVDEVRAF WEVFMDMERE EEFKGWYFKE RGRGLLEAWK EPLVEEGQGE NSSKLSDFLP KFYSLVLQTL SAELSYIPLV FLPESSPSIL ASFFQSTLDS LDPTFSNRLA AVADYHGPGA LPELVKAWEA TVDLGAGVQG LIDKIIFNTQ GGLLSGGAGE IDVESPATIL TSPGISSSSP NHPIPRTNSH SHSKRHQSIS RRFSRAPNAT TTSLSPSPGN VDDAWETTLY EPFLDWQSSY SSLEKRCLEK EVADLKTSWE KANMKQGGKD VMSGMISRTA ELKGRLEEAV ARCRIFTFGF GAVHLIRAVD TCISRFFDDE RTSILNNAKS KRDNNKQKDK ADELDLDELD DDGGDWSGWQ VGLHILDSLQ KVAEKLVAME DGLKAELSEY AKMLKAQKGE KWDGQWDGRK ATFGTVSLLQ QSTLNTADLH ALIASAPSPI LPQSKSSLLT FIRESQIHLQ QTILSPLLTQ LDTYPSLAVW IKADKQTKIR KGELYVPQFS LSPTDVITRT SEGLLDLLRV FEVYGGEKAL GWSLGSLPFV EGMGHAVALD CLSSSRKDTN KEPSTSISDP TTTATSTSVS LAPIPSSTPA PTPETIQTTW ISSLTLSLLS HFTSYTLPSI QHLSQEGQAQ LKEDLGYLEN AVRALDVEWS ELGEWARAVE MEEEEWRENV KREGREGALA AVGRMRGWKF
|
| |