Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC03930 |
Symbol | |
ID | 3256310 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | - |
Start bp | 1218789 |
End bp | 1222036 |
Gene Length | 3248 bp |
Protein Length | 738 aa |
Translation table | |
GC content | 45% |
IMG OID | 638255614 |
Product | hypothetical protein |
Protein accession | XP_570008 |
Protein GI | 58265704 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CATTCCTGCT CCTAGCCATG GACTTCTATC CTCCTCAGCT GGCTGGACAA TCCAGTACAA CCACCTCGCT AGAACACCTA GGATCCTCCA CCCAGGCCAA TCAACAGCCT GTGACTCCTG ACACCGTGAC CACCGAAAAT AAAAGAAAGA AGCGCACAGG AAGTAGCGGA AAGGATGGGA CCAAGATAAA AAAAACAAGG CAGAGCCGTG AGTAATTATA GCTTGCGCTG CTGTTGCGGT CCCATAATAC TGATGTTGCA CAATAGAGTC ATGTGATGGT GAGTATGGGA CGAAAACCCT CAATACTTCT CGTCTCTCTC CAAATTTGAT GAAACTGATG AAGATCTTGG CTATTAATGA TTCTTTAGCT TGCCGCGCCC GAAAGTATGC GCCTTCCCTG CCAACTATGT CATTGTTGAT TGCTTACTTC TGTATCGTCC AGAGTCAAAT GCGACAGACC GCCTCCGGGA AAAGAGACTG ACGGTCCAAT CAAGAACATG TGCAGCCATT GCGCTCAGCT GAATCTACCA TGTGCGTGTT TGGAAGTTAT CTACTTGAAA GTACTTTGTT TATCTTTTGT TCACGCAGGT ACTTTCGATT ATGTGCAACG GAAAAGGGGT CCCCCCAATA TGTAAGGAAT TCGCTCTGTC CATAGGCTCG AATAAGCATC ACTTACTTCA TCTTCGACAG GTATTTGAAG CGTATACAGG AAGATCAGCA GGTTGAATGC AACGAAAACG GAAAATCCAT CTCCTCAACT GTTGGTTCTT CAAAATTCAG CTCTGAAGTA CCCGCAACTA TACCACGTGC CCCTCCACCG CTACAGGCTG CGTTGCGGGG AACACCACCT GCGACGTCGG ATAACACGTC TTCTGAATGG CAGCCGACCT TGGGCATGAC TATGTCTACT ACACGCGCGG CTCATGCTGG ACATTACCCT ATCATGTCCA TTTCTGCCAG TTCGTCTCCC AGTGTCCAGG CCGCTACAGT ACCTCTCGGA CTTTCACCTC TACGTTCAAG GAGTTATCTC CCTGTCTCTC TTGACCCCTC CCATTCACTG GCTACAAGTC AAACTCCGAG TGCACAATCT TCACCGCTTC ACTTACCTCA ACACCTTGCT TATATCAACC ACACATATGA CCCGCGAAAT CCTCTTGATT CTGTCCTTCC ACGTCGGCTT CTTTACCATA TCATTGATCT ATACTTTGAT TATATTTATT GCTTAATTCC TTGCTTGCAC AGACCATCAT TTATCCACGA TCTGAACACA AAACGGGAAG AAAACCCGGA TCAGGAGGAA TGGGTGATCT TGGTTCTGGC TGTTGTAGCA AGTACGTTAG TGCAGCTTCC GAGAAGCTTT GTCGATCTGC CCAGAAATGA AGTCAAGGAC CTCGTTTTAA GATGTCACAA TAGAATCAAA GACTATTTAG CACGAGATTT CGATACTATT ACCGTGACAA GAAGTGAGTC TTCTCCCGAA AACGTGGTGA TCATACTAAG AGCAGATGCT ATAGCCATCA TTATCTATCT CAGCTTGTAG GTGACAGTTT ATTGACCATT CTTGTTGTGG GTGTACGGGC TAACAGCTCA ACGATGGTCA ACTAGATACG TTTATGGAAT CACTGGGCAC ATCGTAGTCA GCCATGGATT ATTCGGTCAG AATTACGTTT TCATGCTTGC TCTACGAGCC CATGAGGAAG GTGTAAGCTT TCTTCCTTTG TTGCGCCAAC GGCAGACTGA CTCATTGGCC CATCAAGACA TACGCTACAC TGGGAAACAT TGAGCGGGTA CTTTTGAGAC GGATGTTTTG GCTCATGTAT GGTGGGGATA AGACACTTGC TTTCACAGGA GCTTTCCCTG TATTATTCCA TGAAGACGAT TGTGCCAGTG TGGCGCTTCC CGACGACATG TAGGTAAAAA GTATCGTGTA GTGATCGAAA CTGATCTGTT TCAATCAGTG ATGATGAATA CTTGACGGAA GAAGGATATA CAAAACAACC CGAATCCTAC ACTTCAGTGT TGAGCGGCTT CCGCTATATC AGCCATCTTT TTCGCGGTAA GTGAATCACC AGGTGTATAT GTTGATATGA CGTCTAACTT CATGATATAG TTTCAGGAGA AGTGTTGGAT AAGCGTCGAC GAGATAAAAT AAGGTCGCCG TCTGGTTTAA TGCTCCAAAT GAGAATCAAT GAGATTAATG AGCTTTACAA CCGTACGATG TCCATCATGG ATTTTTGCCC CGCACCTTTA AAGTTAGATT ACAGATCCGC AAGTGCATCT GTCATGTGAG TCGTCTCCTC GAGCTGCCTA TTACAACCAA GGCGATGGGC CAATTTCAAG CATAGGTCTA TGTCTCCAGA TTGGGACGAA AGAATCAAGA GTGACATTCA TACCATCTTT TCTGACCCCA ACGAACATGA CATGGATCTC GTCAAGGACT TCTACTTAGT TCAGCAGGCC AATATCTACG TCACACAGGT GAGCAGAGCT TGTGTTGTTA TTTGTATACC AATGCCGATT ACTGACGAAC AAGAACAGCA ACTAGTTCGG TTCATAATTA TCCAATATCG AGAAGAGCTC CTCGAAATTC AGCAAGACGA AGCACACCAT GGATTAGACC TCGCCCAAAG GGAAGCGATG AAACGCTCAA TCAGAGAACA GACACAAGAT GAAAAGGATG AGGTAGTCGT TGACATGCTA TCTATCTTGC AAAAAATTCC AATTCAAGTT CTCGGTAGGT GCTGTTTTTG TGTGAATATC TCATTCGTGG CTTCAATTTA CACAACATAC AGCGGTGAAC AGTTTCACTA TTATCGAGAA GGTTCGATCT GTTGCTTCGA GTCTGCTTGA TTTTTTGGAC CATGGTGAAG ACATGGGGTT GCTGCCGCTT TCGTCTCATG AAACAAGAGC GCAGAAAGCC CAGAGAAATC TTTGTGCGTA AACTGCCCAT TGCATCTTTG CAAGATTGTC TGACTCTGAA CTAGGGAAGT TCCTAAATTA CTTGTCCGAA ATCGAGAGTA TGTACTCCTG GCATGACGAG AAAGGGACGG GTAAAGGATT TTAATTACCA GCGAGCCGGG CAGAAAAAGG TGGGGGGTTA GGGACAGCTG AGGAGAAGAC CCCGTTGGTT GTAAAGCGTT GTATCGCCAA ACGACTCATA TTGTTGCTGT TGCTATGGTG ACAGATGTTG TATCTGCAGG GCATTGCTTT CACAGTTGCA TTGCATTCAC AGTCTTAACT TCACATTA
|
Protein sequence | MDFYPPQLAG QSSTTTSLEH LGSSTQANQQ PVTPDTVTTE NKRKKRTGSS GKDGTKIKKT RQSQSCDACR ARKVKCDRPP PGKETDGPIK NMCSHCAQLN LPCTFDYVQR KRGPPNMYLK RIQEDQQVEC NENGKSISST VGSSKFSSEV PATIPRAPPP LQAALRGTPP ATSDNTSSEW QPTLGMTMST TRAAHAGHYP IMSISASSSP SVQAATVPLG LSPLRSRSYL PVSLDPSHSL ATSQTPSAQS SPLHLPQHLA YINHTYDPRN PLDSVLPRRL LYHIIDLYFD YIYCLIPCLH RPSFIHDLNT KREENPDQEE WVILVLAVVA STLVQLPRSF VDLPRNEVKD LVLRCHNRIK DYLARDFDTI TVTRTIIIYL SLYVYGITGH IVVSHGLFGQ NYVFMLALRA HEEGTYATLG NIERVLLRRM FWLMYGGDKT LAFTGAFPVL FHEDDCASVA LPDDIDDEYL TEEGYTKQPE SYTSVLSGFR YISHLFRVSG EVLDKRRRDK IRSPSGLMLQ MRINEINELY NRTMSIMDFC PAPLKLDYRS ASASVMSMSP DWDERIKSDI HTIFSDPNEH DMDLVKDFYL VQQANIYVTQ QLVRFIIIQY REELLEIQQD EAHHGLDLAQ REAMKRSIRE QTQDEKDEVV VDMLSILQKI PIQVLAVNSF TIIEKVRSVA SSLLDFLDHG EDMGLLPLSS HETRAQKAQR NLWKFLNYLS EIESMYSWHD EKGTGKGF
|
| |