Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC03330 |
Symbol | |
ID | 3256643 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1054441 |
End bp | 1059301 |
Gene Length | 4861 bp |
Protein Length | 1469 aa |
Translation table | |
GC content | 48% |
IMG OID | 638255556 |
Product | conserved hypothetical protein |
Protein accession | XP_569992 |
Protein GI | 58265672 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00201155 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AACGACGAAT TCTCTAGTTA CATTCACCAT TCACCTGGCA TTGCACATCT TATATATAGG TCTATCCATT TAATCTCAGT ATAACCCTTC CGTATCTTCA CAACAAACGG CAACGTATGA CTCAAAAACA ATCCTTTGCC GATTTCCTCG CGTCCGGCAG TCCGTACGGT GGCAAAAATC GCGGCCGTGG AAGTAGTAGC AGGGGTCGGG GTGGTGGCGG TCGAGGAGGA TTTTATGGCA ACGCCAGCAA GAAGGCTTAC AATGCCGACT ACTCCAATGT TCCTTTCGAC TACGATAGAA TCAATAGCCA GCACTATAAG AAACTTGAGC GTGAGTATCA AGTATGGCGT TACCGTATGA GGTACGTTAC TAATATGAGA CCTTGTTCTT TCCTTAATCA GCCGCTTCTG TTGCACCGTA TGGTAACCGA GACATAAATC GCTTTCACAA TGGTCCCGAT AAGAACGTCT CATCGTCATC CTACCATGTT CCTCGTCTTG CGCATACACC GGGGACGGCA ACTCCCAATA AAATACATGG ACTCGGTTTC CAGACCTCCA GTCAGCTATC TCAATCTGAT TCGGTTCATC CATCGGTTGA AATGACAAAG AACAAGACGG CTAGTGAGAG CAGTCGTGGA CTCAGTGGTT CCCGCTGGGG TGGAGGGCTG GCGCCTTTGT TCATCAAAGC GGGGGAATTG TTTAAAGACG GAGAGGTCGA TGTCATTACC CAAGAAGAAG GTGAGATTTT CTCGTGCCTG TCACCTTTGA GTCACTGACG ATTCAACTTT TTTGACCTCA GATAAAGGCA TTGATGTTGG ACATTACTAT AAACCGGAAG CTTCTGAAAC TCGAATGGCA GATACCGAAA GCCAAGCAAA CACCCAACTT TTACAAGCGG AAGATGGTGA CAACACGAGC GACGCTGTCA TCGATGGTAA AAGCGAGCAG TTAAAGGTTG ATAGCCAATC TGGTTCTTCT CTTCCACCTA CTGCTGGTGA CGACTCCAAC ATCTTTCAAG TGGTTCAGTA CACCGGAGAA CTTTCTATCG GCCCCAGCAT AGGCAACTCT CATCAAGATC CTGTTGAAGG GGGTTACTCG CGGAACGGAG GGATCATGGG GATTCCAGTC CCTAATGAAC ACCAGTTTAC GGGATCAAAC ACCTTTACGA AAACATGTCC AGACATTATG TCTTCCTTCG ATCGTCATGA ACATGTTGAT GGATTGGTAT CAGAAGCCGA CTTGCAACCG CATCAGGAAG ATACGTTATT CTTCATCGAT ACAAATCCTG ATCCTGATAG TACGCCGTCT CAGACCCCTC AGTACAATAC TGTTGCAGCT CCTCCTATTG GTTCATCTTC CCCTGATTCC GACGAAGAGG ATATTGTTTT TGTCCCTCGA GCTTACAATC AGCAGAAAGC CATTGGACCG CACGTTTCCT CACTCCATCA ACGCCACGGA ATGAGGTGTT TGGGTCAGGG TTCTTCGAAA TACCAGTCCT ATATCGAGAT TGTCGATGAT CTCAATGCGC AACCCCTGTT GGACTCAACT CGCCAAATTC AAAACTTTGT TGAAGATGCC AATCCTAATC CTCAACCGCC CTCAGTGCCG GCGTCTTTAC GTACTGAGGA GCTTCCCGAT TCGATTTCCG GATATATCAA TGCTCCTGCG GCCTCGTACA CAGACGAAAA AATGGTAAAA CATCAAAAGA AACGCGCGAA CAAGAACGCC AAGCGTCTGG CCCGTCGAGC GGCCGCGCGT GGAGTTCCGA GAGACGACTC GGACATTGAA TGGGGGTCTG ACGGCCCACC TAATCTCGCC GGGGGCGAGC AAGATCAAAT GAGCGTTGAT GATTTGATTG GTGGCGGAAA GGGAAAAGAA AGTGAAGCTA TATTAAGAGA CTATTTGGAG GGCGTAAAAC TGTCTCAGAA GGCGGATGAA GACGAGGATA AATACGAGGA TGATGATGAT ATGGCAGCCC TTTCCAAATG GGCCAATCGG ATCAATGTTC TCGGGAAAGA AGTAGATTCG GATGAGGATT CCGACGAGGA ATCCAGTCCT CCTTCGGTTC AGGCCAGCGC GAGCAAAATA TACCACGGGC CTGAAGATAG GAGAGACGAG CTCAGACAAG ACGAGAAGGA TGATGATGAG CTTAGAGAGC TTGAAACGTT GCACAATTTT GCCAATCGAA TCAATGCCCT TGATGAAGGT CAAAACCTTA GTGAAGGTGA GGAAATCACG TTAGTCCAGC CTCAGGCTCC AAAAATCTCC CTTAAGCTTC AAGAGAAAAG ACGCGAGCAA CACGAAAAGC TTGAACACGG TGATGGCGAT GTTATGGACG AGGAGTTGAT GGAGTTGGAA GCCCTTCAAG AATTTGCGGA CCGTATTAAT GGTCAAGGTA TTGAAAATGA GTCCAGTGGC GATGATATGG GCGAAATTCA AATGCAGAAA ATCCAATCCC CCCCGAAAGC ACGCGTCAAG CAAGTCAAGT TTTCTAGCTA CAATCAAGAA ATTACGCATG CTCAGTGGGA AGCCGACGAA GAAAAATTGG CTCTTGACAA GTGGGCACTC AGCATCAATA CTTTTGGTGG AGATGACTAT GGCGACGACG ATGACGACGA GAATATGGGG GTGTTATTGC CGACACAGCA GGACAACAAG CTACAACCAC GGTTCATGGA ATGTTTTGAA CCTGATTCAG AACAGGACGA GGAAGACGCT GAGCTGGATG CGCTGGACGC TTGGATCAGT CGGGTGAATT CACATAGTCA AGGGCAAGAT TTGAGCGACG ATGAGCTTGA CACAGCGTAT ACTTCCAGCA AGCCAAAGCT GAAGAAGCCT GCACTCCCCA AGATGAGACA AAGACAAAGT TCTCCGTCAT CTCAGCCCAG CCTAAATCAG CAGTCTAAGC ATATGTCCCC TGAAGAGCTT TCCCAGCCGA CGGAGCTTGG GGATAATGAA GGTTGGCCTA TTTCGGAAGC GGAGGCAACT GAGGACAATC CTTGGGCATT CGATCCCAAG GAAGCTGAAA AAGATAATCC TTGGGCAGCA AAAGACCCAA AGGAGGAGCA TAATCTGCGG TCAGGCGAGG TAGTTTATGA GGAGCAAAAT TTTACTCAGT CGACTTCTCT CCATGCGGAC ATATCGTCAG ATTCACCTCC AGAACAGCCT GATGTTGTTG TTTTTAGTAC AAGTGAAGAA CGTTTAGCCG AACAAAGGCG CGAGTCAGAA GGCGGAGACT GGTCGAATGG AAAGATCGCA AGCAAGGTGT TCAATGAAGT ATACAGCGAG GAAGAAGATC TAGAGGCAGA AGGTGACGAC TCATCTAGCT CATCGACAGG CAAATGGAAA AAATATGAAG CCAAGCAAGA GGAAGGCATG TTCAACGGTG TCAATCATTG GAGTGACGAT GATGAGAGCG AGGATGATAG TGATGAAGGA AGAGATGAGA TCAGCGGTGA AAATGACGGC GACAGTGACA GTCATTACGA AAGTGAAGAA GACGAGGAGG ATGACATGGA AGTGGACGAT TACGATCAAA CGGAATGGTT CATCAACGCC ATGGAAGTAA GTGATATCCA TGATGTCTTG TTATCTGTAT GCTCACGAGC CTCCGCTAGG ATGCCCTTGG CGGTAAAGAA ATTAATTTCA ATGATCCCAG AGCCAAGATG TTCAATGCCA TCAAAGAGGA TTCTTATGAA TTCAGTTCGG GTAAGTGGTT TTCTTTCATA ATTCGGCCAA TTTCCTAATT TGTTTCCTTT TCAGCCCCAG CAAAAAAGGG CAAGAAGAAC AAGCAACTCA AGGGCATTCC AATGGAATTG CAAGTTCAGT GGGAAAAGGA TCGTCAAACC AAGGCCGAGA AGAAGCGTCA ACGTGAGCTT GCGCGCCATG CGGAAGAGTT TGACTCCACC ATCATCTCTG GGTCATCTCG AAATGGCAAA AACAAGCGTT ACAGCAAGAA AGGCGGCCAA AGCAAGGATT CCAAGCAAAT TTATCAAGCT TCTGTTGCTC ATCTAATTTC CGGTTCAGCT GCTGATGTCG CCGACATGTT CTCTGAAGAA GAGGGATTAT CAGACAGTGA ACATGAGGAA TACTTTGGGC ACAGCGATCG TTATGAGACG ATTGGCAATG ATCCTTTCAA CCTCTCTCTT AAACCTAAAA ATGCCCGTTT CTCGATGGGC TTCAAGAAGA ACAAGAAGCG AGGTTTGGAA AAGAAAACGG TAGCCGGTCC CGAATGGAAG ACGCTCGATT GGGTAGATGA CCTCATCCAA GCTTTCCTCA AAGATAAGAA AAGCGAGTCT ATGAGCCTAC CTGCCATGGA TAAGGAAGGG AGAAAGAAGA TCCACATGCT TGCGGAGTGT TATGGTGTCG GCTCTACATC ACGCGGTTCT GGAAAAAAGA AATCCATGTG AGTCCGCAAT GTTTGCAGCA TGGTGTTTGT CACTGACGAG AATCAGCTCC TTGTACAAGA CCAAACGCTC CGGGGTTGAT GTGAAAGAAG AAAAGCGTGA ACGTCTTCTT ACGGCCGCTC CTTTCTCTGG ATCCATGTTC CACAAAACTC TCTATACTAA AAGTAGCCAT GGAGGCGTCA AAAGCAAGGG CAGGGATTGG GCAAGCGGTG CGACTGCCAA ACCTAGGGAA GGAGAACTTG TAGGGTACGG GGCAGACAAG ATTGGGATAG ATAATGTAGG CCACAAATTG TTGAGTAAGA TGGGCTGGGC AGAAGGAAAC AAGATTGGGA TAGGGAGTGG CAGTGGCATA GATGCTCCGT GAGTCATCAA TTCTCTCTTC ACATTCACCG CTGACCGTTC GACAGAATCG TCGCTGTAGT CAAGAACACT AAGAGTGGGT TGGGAGCCTA G
|
Protein sequence | MTQKQSFADF LASGSPYGGK NRGRGSSSRG RGGGGRGGFY GNASKKAYNA DYSNVPFDYD RINSQHYKKL EPASVAPYGN RDINRFHNGP DKNVSSSSYH VPRLAHTPGT ATPNKIHGLG FQTSSQLSQS DSVHPSVEMT KNKTASESSR GLSGSRWGGG LAPLFIKAGE LFKDGEVDVI TQEEDKGIDV GHYYKPEASE TRMADTESQA NTQLLQAEDG DNTSDAVIDG KSEQLKVDSQ SGSSLPPTAG DDSNIFQVVQ YTGELSIGPS IGNSHQDPVE GGYSRNGGIM GIPVPNEHQF TGSNTFTKTC PDIMSSFDRH EHVDGLVSEA DLQPHQEDTL FFIDTNPDPD STPSQTPQYN TVAAPPIGSS SPDSDEEDIV FVPRAYNQQK AIGPHVSSLH QRHGMRCLGQ GSSKYQSYIE IVDDLNAQPL LDSTRQIQNF VEDANPNPQP PSVPASLRTE ELPDSISGYI NAPAASYTDE KMVKHQKKRA NKNAKRLARR AAARGVPRDD SDIEWGSDGP PNLAGGEQDQ MSVDDLIGGG KGKESEAILR DYLEGVKLSQ KADEDEDKYE DDDDMAALSK WANRINVLGK EVDSDEDSDE ESSPPSVQAS ASKIYHGPED RRDELRQDEK DDDELRELET LHNFANRINA LDEGQNLSEG EEITLVQPQA PKISLKLQEK RREQHEKLEH GDGDVMDEEL MELEALQEFA DRINGQGIEN ESSGDDMGEI QMQKIQSPPK ARVKQVKFSS YNQEITHAQW EADEEKLALD KWALSINTFG GDDYGDDDDD ENMGVLLPTQ QDNKLQPRFM ECFEPDSEQD EEDAELDALD AWISRVNSHS QGQDLSDDEL DTAYTSSKPK LKKPALPKMR QRQSSPSSQP SLNQQSKHMS PEELSQPTEL GDNEGWPISE AEATEDNPWA FDPKEAEKDN PWAAKDPKEE HNLRSGEVVY EEQNFTQSTS LHADISSDSP PEQPDVVVFS TSEERLAEQR RESEGGDWSN GKIASKVFNE VYSEEEDLEA EGDDSSSSST GKWKKYEAKQ EEGMFNGVNH WSDDDESEDD SDEGRDEISG ENDGDSDSHY ESEEDEEDDM EVDDYDQTEW FINAMEDALG GKEINFNDPR AKMFNAIKED SYEFSSAPAK KGKKNKQLKG IPMELQVQWE KDRQTKAEKK RQRELARHAE EFDSTIISGS SRNGKNKRYS KKGGQSKDSK QIYQASVAHL ISGSAADVAD MFSEEEGLSD SEHEEYFGHS DRYETIGNDP FNLSLKPKNA RFSMGFKKNK KRGLEKKTVA GPEWKTLDWV DDLIQAFLKD KKSESMSLPA MDKEGRKKIH MLAECYGVGS TSRGSGKKKS ISLYKTKRSG VDVKEEKRER LLTAAPFSGS MFHKTLYTKS SHGGVKSKGR DWASGATAKP REGELVGYGA DKIGIDNVGH KLLSKMGWAE GNKIGIGSGS GIDAPIVAVV KNTKSGLGA
|
| |