Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF00300 |
Symbol | |
ID | 3258382 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 99304 |
End bp | 104321 |
Gene Length | 5018 bp |
Protein Length | 1437 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257151 |
Product | hypothetical protein |
Protein accession | XP_571661 |
Protein GI | 58269010 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.133228 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCTCCC TACCCTCCCT ACCTTTTGCT GCTCTTGGTG CAGGCCACCA ACTCGGCAAA GACCCAGCAA CAGCTTCAGA GTTGATGAAG AATCAGAGGC CAGAGAGCGA AGCGTTCAGC CGAGGAGCAG GGGCAGAAGG AGGGATGAAA AGGCCGTATT TCTTTGGAAA CACTGGTGTC GATACACCTG GCGGAATGAG TGGGACAAGT CCAGGGATGG GAACGTTTGG GTTGGAGGGC AAGGAGGTGG GGTCGAAGAT GCCTACCGTA ACGTTCTTCG AATGATATAC TGGATGTTAC TGAAAACAGC TATAGGCTCT CGGTTCTTCT AATAAAACTC CCATCCCGAC TCAGGCACCC AATCCAGTAG GCCTCCCTGA ATCTCCTAGG CAAGGTGCTC CGCAACCCAA AGTTGACTCT GACCAAGAAG TCCAATATCA CATCCACAAA CCCCATCATA AATTTCCGTT CTCACATGCC GAAGCGGAAG CTAAGCAACG AGAAGGGATG GAGGATGCTC GTGCCGCGGC CCTCGCACAA CTTGAAGCAG CCCAGCATAT CGTCGACGCA AAGCTGGAGA GCGAGATGAT GGAGATACTT GGGATGATGA GTGGTGCTCG ACTTGATGGG GTCAATGCAG CCGGCGGGCC CGGTGGGAAC GCCCCCAAAG GACCTCGACA GACTTTCGAT ACTAATACCA GCGGACCTTC CTTCGGTCAC ATCCTTCACC AAGACGGTGT CCCAATCTCT GCCTGCCCAT CGATGCGAGT TGCTCCTGAA GGCTTTGCCT CCGGCCGTCC ACTAACCGAA CTCAGCGAAG TCTTCCGAAT TCCTTTTGGC GTACCAATAG GGATGTCTAT ACCGAGACCG TTGATGGTGA TGAGGGACAG TGCCGGGGCA CCTGGTGACT TAGGTCTTGG AATGCCGGCA TTGTTGAATA TCATTCAAGG TCATGGGGAA GGATCTCCGC TGCGGAGGAA ACAAGAAGAT AGGAGTGCGT ATGTCGAGAC TGTCGAGGAT GTAAGTCTCT CAAAAGTTGA CTGTGAATTG ACTGTAGCGG CTAACCCAAG TATGCAGGAG GAAGAAGTTT TATCGCCGAT ACGTACAGAT GTTCGTACAG AGGTTCCATC CAAAACATCT TCTCCTGCCG AAGGGGCTGC CGCTGGCTTC GGATCAACAG CCCAGTCTGT CCCACAAGGT GATGTTTTCC CTGCCAAAAC TGAATCCAAC GTTTTGGGTG ATCCAAGTAC GGCGAAATCG AATGCAGGGA GAATTGGGTT TACCGTGCGT GTCATCAGTT TATCAGTATC CTAGTATTAT TTTATCTTTA TCTTGGGGGA AAGGGAGAAA AGAAGCTCTC CTTTGCGCCT TCTGCCTTAC TCCTTTACTT TGGGTGATAC CCTTTGTGCT TTTATTGGCA TGTTGGTGGA TGGGGACGCG CATATGAAAT GGAACGAATG GATAAAAGAA GGATAGCTGA CTTCCTTTGA TGCTCTTTTC CTGGAACAGG CTCATTCACC CAGCCAGCCT ACACAGCCGC CTACTGAACA ACTCATGGCC AGTACTGCAC ATCCTCTCAA AACTTCTACG CATCACGCCC ACCACTCGGC AGCAGCTTCT TCTAAGGTCG GCAGTAAAAC CCCCGGTGCC AAGTCGAAGG TACCATCCAA AGCTGCGTCT AAGGCGGCCA CCACAGCTGC GAGTAAATTA CCTACAGCTT TAAAAGCTCC GAAGGCAAGA GTGCCTAGTG TGGGCTCACA AGCTACGACC GTGCGACTCC CACCGGCAAC GTACACTAAA GGCGAAAGAG CAATGTCTCA TGCCCATGTT TATCCACAGT CTCAAGCCCG AAGTCAGGTA CCTTTTCCCA CGCAATCACA GTGCGAGTAC ACTGCTGTGC CCGAGGCTCC CAGTCCTGGG TCTCAAGCTC AATCTCCACC TCAGCTCGAC CAGAAGACCA TTTACTCGCA GGTCCGAAAA AAGAATACCC CTACCATCAT CCCCGGCCAG TTCCGTGCTC AAGAATACTT TCCTTCTACC CAGTCCGCTA ACGATGCGGT TGAGCACCAT CATGAGCTCG AGAAGATGCA AGGCGCAACA GGAGAAGGTT CCATCGTAGC CTCTCAGCAT GGTTCTGTCG CGCCTACAAC TATGTCCAAC GTCAAGCGCA TCGTTGATGC CTCGAGATAC CACGATGAAA CTCTCTGTCA GCTCCTTGAC GCTGCTCGGC TGAACCTCAT TGGTGAAGAA GCGAAGAAGG CGCTTCAGAG AGCCGCAAGG GCGCGGGTGA TTGAGTTGAG AGAGTTGAGG GCAGCAGGAG AGCTTGAGGT GGGGTTGTTG AGCCCCGCGG TGGTTCTGCC ATCTCATGCA GAAGAAACGC CAAAAAGGGC GAAGAGAAGC AAGAGTAAAG AAAAGGGACG GACTGTGAGT GGGAAGAGCG CAAATAGTGC GAAGAGTGAG AAGGGTGCAG CGAAGCAGGA GCCGCCCATT TGGGCTCAAG ATGTGAGTGC TTGCTTCTCA ACTGCTTGGC TTGGAGGTAC AATCACTGAC TGTTATGGAG TAGATTATGA ACCGTCTCTC CATGTTCGAT CATCGCTTTG CGGCTCTCGA GACTCAGAAA GACACTCGTG GCCCTCATAA GTCTCAAGTG TCAAGGGACA TGCGTAGCAA ACTAATTGAG GAGCTCATCT TCAACGACTT ACATAGCTCA ATTTTTACGA ATGGTATCAA CACGCCTCAT GGATCGACTA ACTCAGTTCC TTTTGGCCAG TCTGTGGCGG CTCCTATGCC CGCTGCTCCA CTTACGGCTT CTGAAGATAT ATCTTTCTCC TTGGGGCAAA AACAAAGGGC AGCAAGCGAT ATGCCGAGTC ATTACAGCCA TAAACCATTG AGCCAACATG TGCCGGGGAT TGCTACTTGC ACTCAGTACG GTGGAACTCA AACGCCTGTT CAGTATGCCC CTAGTCAGGT ACCAGCTACC CAATATGTTC CCACTCAACC TCCCCCGACA CAGTACTCCC AACAAGCTGG TGCTGCAAGC CAAGTCCGAA GCGTTGGTCC TCTCACCGAA CGACGTCCAA GTGCCCAAGG CGATCTCATT TGGGGATCGG AAGTTGAGCT TCCTCATCCT AACGAAAGTA TCTCTTTCCC GCATATCGCA GGTCCTACTA TCAACGTCCT TCCTCCTACA GAGTCTAATA TGGGCAACAA CAAAGCCAGT ACTCGAGCTA GCCCTCGATC ACGAACTTTC TCTGCTTCGA AGTCCCATCC TTCTACCACA GTTGGCGGCA GAATGGCTCC GGATGACGGT CGAATTGATG TAATCACCGA GCAACCTCTA TCCGTTGGCG CAATGCCGGA CCCAAGGGAG AAGGAACTGC CTCCCCAACC TTCTGAGAGT GTCAAGAATC AGCCTCCTGA TCCAACCTTG TTTGATAACA TGACCGCCGC TCAGAGGACA CAACCTGGAA CAGTGACAGG ATCAGCGATA CCAAAGAATA TGGCGCAGAG TGCGTACAGC TATGGCCCTC CAGATCAGTA TCCTCCTTCG GCTCCTAGGC TACGTACAGC GAATACACAG GTTCAATCAG CTGCTGATGT TGGCGTTAAT GCTGGTATAG TCACTGGATC CATTTATGGG ACAGCTATAG AGCCTCCTAC GACGTCATTC TTTCCCGGGC AGACAATGCG TCATATTGCT GTACCTGGTA CAACCGTCCC CCCTGCTCCT GTCTCTCCTA TGCACTCAGA TCCTGTACCA CCAACTAGCC AACCCTTTAG TGAACCTCTC AGCACGATTC ACATGAGGCT GCACGAAACA AAAAGTTCTC CATTTACAGC CACCGCCCCT GGTAAATCTA CTTCTGCACC CCGAGATTGT ATCCACGTGG TGGACCCCCT ATTTTCTCCT ATGGCAGGAT GGAAACCATG GGATATGTTA ACCCAAAGGT TGTACTCGTG GGCATTAATT ATGGAGGAGA AGAGCTTTGT GAGGGCTTTG GAAGATATCA GTTTGGGAAG GCAGGTAGAG GCATTTCCGT TGAGTGTTTT TCTGATGCTG GCTTACAAGA GGTAAGTCAT CATGTCCTGC TCATCCGTGA CTGGGAGCTA ATGGGTTTGC AGATGGGTGA GAAGGAACTT GAGTGAAACG ACCGCTGCCC CATGTGATAA GTTGTTTGTG CCACCCAACC TAGCGGTCGC AATTAATATT GCTGTTCATA GTGTGAGTTT AATTCCCTTT GCCTTACTCC TATGCTGATC CAGGCCATAG CGACGATATC ATGAAGCCAA GGAGATCTTG TTGGAACTTT GGGATTGTCT CGGGATGAAA GAACACCCGA GGATTATCGT TGCGCTTGCG CCACTAGGCG ACGAAGTGAG TTCTCCAATG TTCATGCGAG CTGTCTCGGG TGCTGACGGA CCAGAAGACG GATCAATGGG CCGCTCATCG ATATGATCTT TCCTCAAAAC ATCTCACGAC TTACCGAGTC TCACACCTTG CTGAGATTCG GACTGATGGT CGCTCTTTCT GGTGTATGTT TCTTTCATTC TACCCATGTA ACAAACATGT TGACCTCAAA TCACAGGGTG GGAGGCGATC AGGCAGGCTT GGCCTGAACT TCAAATACCC GAGATGCAAG AGCTCGAAAA ACGAGGCGGT CAACGAATCA TTAACGAACA TCGACCACCA GAGTACAAAC ATGATAACTC TTTGTACGCT GCCAACATCA GCAGAAACCT TCTTTTGTGA GTCTAATTCC CCATCTCATA GTGATACATA CTGACTGTAA TTACAGGGGC TACCGACCCG AGAGACAACA TGATTTGGTG AAACAACGAG AGATCATTTG GGCTGAAGTC AAGCGTCTGC TCCACAAGAA GCGTACAGGT CGCCTCGTTG TGGAACCCGA TTCCCCAGAG CACTTGTATG ATACCTAAAT CTGGTTTCGT ATATCTTTCT AGACTCATTA TTGTGGCAAA AAGCGAATGA TCTTGTGAGC AATTCTGTAT ATAACGTA
|
Protein sequence | MPSLPSLPFA ALGAGHQLGK DPATASELMK NQRPESEAFS RGAGAEGGMK RPYFFGNTGV DTPGGMSGTS PGMGTFGLEG KEVGSKMPTA LGSSNKTPIP TQAPNPVGLP ESPRQGAPQP KVDSDQEVQY HIHKPHHKFP FSHAEAEAKQ REGMEDARAA ALAQLEAAQH IVDAKLESEM MEILGMMSGA RLDGVNAAGG PGGNAPKGPR QTFDTNTSGP SFGHILHQDG VPISACPSMR VAPEGFASGR PLTELSEVFR IPFGVPIGMS IPRPLMVMRD SAGAPGDLGL GMPALLNIIQ GHGEGSPLRR KQEDRSAYVE TVEDVSLSKV DCELTVAANP SMQEEEVLSP IRTDVRTEVP SKTSSPAEGA AAGFGSTAQS VPQGDVFPAK TESNVLGDPS TAKSNAGRIG FTAHSPSQPT QPPTEQLMAS TAHPLKTSTH HAHHSAAASS KVGSKTPGAK SKVPSKAASK AATTAASKLP TALKAPKARV PSVGSQATTV RLPPATYTKG ERAMSHAHVY PQSQARSQVP FPTQSQCEYT AVPEAPSPGS QAQSPPQLDQ KTIYSQVRKK NTPTIIPGQF RAQEYFPSTQ SANDAVEHHH ELEKMQGATG EGSIVASQHG SVAPTTMSNV KRIVDASRYH DETLCQLLDA ARLNLIGEEA KKALQRAARA RVIELRELRA AGELEVGLLS PAVVLPSHAE ETPKRAKRSK SKEKGRTVSG KSANSAKSEK GAAKQEPPIW AQDIMNRLSM FDHRFAALET QKDTRGPHKS QVSRDMRSKL IEELIFNDLH SSIFTNGINT PHGSTNSVPF GQSVAAPMPA APLTASEDIS FSLGQKQRAA SDMPSHYSHK PLSQHVPGIA TCTQYGGTQT PVQYAPSQVP ATQYVPTQPP PTQYSQQAGA ASQVRSVGPL TERRPSAQGD LIWGSEVELP HPNESISFPH IAGPTINVLP PTESNMGNNK ASTRASPRSR TFSASKSHPS TTVGGRMAPD DGRIDVITEQ PLSVGAMPDP REKELPPQPS ESVKNQPPDP TLFDNMTAAQ RTQPGTVTGS AIPKNMAQSA YSYGPPDQYP PSAPRLRTAN TQVQSAADVG VNAGIVTGSI YGTAIEPPTT SFFPGQTMRH IAVPGTTVPP APVSPMHSDP VPPTSQPFSE PLSTIHMRLH ETKSSPFTAT APGKSTSAPR DCIHVVDPLF SPMAGWKPWD MLTQRLYSWA LIMEEKSFVR ALEDISLGRQ VEAFPLSVFL MLAYKRWVRR NLSETTAAPC DKLFVPPNLA VAINIAVHSR RYHEAKEILL ELWDCLGMKE HPRIIVALAP LGDEVSSPMF MRASHTLLRF GLMVALSGWE AIRQAWPELQ IPEMQELEKR GGQRIINEHR PPEYKHDNSL YAANISRNLL LGYRPERQHD LVKQREIIWA EVKRLLHKKR TGRLVVEPDS PEHLYDT
|
| |