Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI00270 |
Symbol | |
ID | 3259614 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 65484 |
End bp | 68782 |
Gene Length | 3299 bp |
Protein Length | 1053 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258511 |
Product | hypothetical protein |
Protein accession | XP_572967 |
Protein GI | 58271622 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.198438 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACCGCC ATACACCGAC ACCCTCATCC CATCTCACGT CCTCTCCCCC TCTCTCGTAC ACCTGGTCTG ACACATCCGG TGAGAACGAC ATGCGACCCC GCGAAGGCAA CCGCGCTCCC AGTACAAGCA CATCAGTGTG GTCCGACGAA GGTACTGCTG TCCCTCGACA TGGCGATGGG GATGATGATT ACGGGTATGC CGATGAGGAT GGTCATGAAC TGAAAGGAAA CGGGGTGAAG GAAGGGAAAC TGATTGACTA TGACGACGAG CCGCAACCGC ATCAACGAGA GATGGCGCCG GGCGATAAAC TGAGAGATTT GCTACGGCAG ATGGATGAGG AAGTGCGTCG GGCAGCCCCA AGCGAATCAA AGGAGACGGT TGTGGCTCGT CAATGGATGG GATATACAAG GCCACGCCAA TTTCAACCGG ACGCTCGTGA CGAAGAAATA GAGGAGGAAG AGTTAGAGGA GGACCAAGAA GAGATGAGAG AAGGGGCATC TCCAGCGAGA TCGTCTTCTG TCACGGAACA AGAAAGCCCG CCAACTCCTC CGCCGAGATT TGGAAATCCT TATTCTAGAA GGTCAGAACG AAGAGGATCA CCTGCTTTTG CCCAGGGCAG ATTACCCTCC AGAGCGGCTG TGTTGCTTCA AAGTGTGTCA TTTTTCTGTC TGTAGACCAA GGGCCAAACT GAACTTCGTA GGCACATCTC ATATGGACGA AGGCTTGCGA GCAGTAATAG AACCCGAACC AGAAGCCAAA CACCAACAAA AATCACCACC GTCTCAGCTT CACGCATATA TTGCCTCCCA TCCTGAGCTT CGCTCATCAT CATCTTCACG AACACATCAA CCCGTTCCGC AGTCATCCCG CTCAAGGAGC AAGAGGAAAG AAAAAGAGAT AGAACAGGAT TACGAATCCG AACACGAACC AAGCTCGCCT TCTCGGCATC ACTTTGAGAA GGAACTATCC CAATCTGCAT CTAGACGCCA TTATCGTACA GCCTATTCCC CGTCCCGTGC CTCCCCACCA TCAAATGCAC TACCATCCAC CGCTTCATCA CACCGTTACC CAACCTCACC TCGGCCCCTT GAACCCAAAT ATCAACACCG CCCTATACCA CCAACTCCGC GCCGATCAAG TATAGATCCC CTACCATCAA CCCACGATGA ATTTGCGATG CATCTCAAAG GACTGGAGGT GAGAGCAGAG GTGGAACTAG ACTTGGATGA AGGCGTCTCT GCTGTGGGAT GGGAAGAGAG TACTGTTGAG GGCGATGAGG AGGAGATCAT GTATGAAGAT GGTGTTGATA GGAGTGGGCA CAGGGACAGG AGCGGGACTG CCAGACCTGA TTCCTCTAGA GGGGAAAACC TATACACACA TTCTCAATCA CGCGAAAGCT TGTATCAAGC TGCTTCTAGA TCCCACTCAT CCCCATTATC AAGGCGCCTC CCAACCAGCT CCCCTTCTCC ACCAGCCCTG CCCGACCTAC CTTCTCATTC TATAAGCGGG AGTGAGGAAG AGCAAGAAAT TGATACATAC TCGTTAAGAC GGGCGAGGAT GTTTAGTGGG GAGAGAAAGG GAAAGATGCC AATGGTTGGC AATGGGAGTG ATGCCTCGGC ATCGGCATCG GCATCAAGAT CTAGGTCATA TTCTCGGTCG CTGTCTGTAC GTTTGAATGA AGACGAAGAC AAGGAGGCAA CAAAAGAATA CCAGATGGAG GATGAAGCGG TGAAATACAA TGCCCCAGGA GAGAAATTCT TAGAAGAGGT GTGGGAAGGC GGGGGTGTTA CAATAGAGAG GGAACAGTCA AATTATCATT CTTCTGATCG GAGCTCCCGT ACTCCTCTGC CCACGAACCA TCGCCATACT CCTTCCCCTG TCCGCCTGCC CGTTCGTCAC AGTCGTGCAC GTACGCCTCC GACCACTTTT CAGACGGGTA CCACACCCAG TTCGACAACC ACTTCGCCTG GGCAACGGCA ACCTCAAACA TTATTATCAT CACGAGCGTT GTCGCAATCA CAATCCCCGA CCCAAACTCC CAAATTCCCT ACCCCGCTTC CCTCTTCGCA ATCGAATGCC AATCCAGTAC AAAAGCAACT TGCGACCCCT TCTCATTCTC ATTCACAAAC TCTTGCCCTC GGTCCGACGC CCCGACCCCC AGGGGCATGG AACATTACTC CCTTTCCGCA ACGTCCACAG CGCGAACCTG AGCCACCACG CCATCAAGAT CAGATAGCAA GTGAGAGTGG GAGTAAAACA GGGAGAGTGA GGTTTTCCCC GTTGAGATAT GAAGCTACGA TACATAGAGA GGATTATGAA AGAGAGGATG AAATGGAGAA AAAAGGGAAA AGGGAAGAAG AGAGAGGTGA TGTGTCGATT CTGACTTTGA GTTTATCTCC TCGGCACAAA AACCCTAAAA GTCCCAAAAG TCCAAAGAAT TCAAAGAGCC GAACGAGTCC TGAGAGGCAG ATAAGAGAGG GAAAAATGGA CGGAGACATT GGAGATATCA GTTGGACCGC CAAGTTGGCT AGAAGTGTTA CTTCGTAAGT CTGCATCATA TCTTTTCATA TCGGCGAATT ACGTGTCTTA TGCACGTATC AAGATGACAA AAGCTGACAG AGCTGTGAAC AGGCGAATAT CCATCCCTAT GCGGTCGACG ACATACAATT CCCATCTTCC GCAGCTCACT TCTGCCCACA CCCACACTCA AACAGCTTCT TCCAAACTCC AAACCGCCCA ACAATCATGG CTTGCTGCTC TCTCCTCCAT TTCCACTTCT CACCCACATC TTCACCAACA CAATCAACAC GCCCCTTCAA ATTCCCCAAC CAGTCTGGCG TTAAGGACAT CACAGACTTT AGGCAAAGGG ATGAACAAAG CTGTCGGCTG GAGTCGCTGG GTATGGTGGG TCCTGATGGA AATGGTACTC CTCTGGGCAG TGTTTAGGGT GACCCTGGAT TATGCCGGGT CCGGTGTTTA TCTCGGTAGA GATCCGTTCC ATCCCTTATC ACGCCCTCTT GGTCTAGGCC CTCCTCCGTC CCTTTCATCA CCTCCATCAT TTAGTGTTGG TGGAACGCCA GACATGATGG AGAGAGACAG GGAGACGAAC GCACAGCGTC AATGGGCCAA AGCGTCACGC TATGTGAATC TTGAAATTCC TATACCAGGT AGTCTGAAGA CTTTGGTAGG TAAACAGGGA AGCGCGAATT TCTTTGATCT AGTGGAGAGC TGGGGTTGGG GCACAGCCTT TTTTGGGGAA GGAGAACCGG CTGGAACCTG GGCTGGAGTG CCGACATAG
|
Protein sequence | MNRHTPTPSS HLTSSPPLSY TWSDTSGEND MRPREGNRAP STSTSVWSDE GTAVPRHGDG DDDYGYADED GHELKGNGVK EGKLIDYDDE PQPHQREMAP GDKLRDLLRQ MDEEVRRAAP SESKETVVAR QWMGYTRPRQ FQPDARDEEI EEEELEEDQE EMREGASPAR SSSVTEQESP PTPPPRFGNP YSRRSERRGS PAFAQGRLPS RAAVLLQSTS HMDEGLRAVI EPEPEAKHQQ KSPPSQLHAY IASHPELRSS SSSRTHQPVP QSSRSRSKRK EKEIEQDYES EHEPSSPSRH HFEKELSQSA SRRHYRTAYS PSRASPPSNA LPSTASSHRY PTSPRPLEPK YQHRPIPPTP RRSSIDPLPS THDEFAMHLK GLEVRAEVEL DLDEGVSAVG WEESTVEGDE EEIMYEDGVD RSGHRDRSGT ARPDSSRGEN LYTHSQSRES LYQAASRSHS SPLSRRLPTS SPSPPALPDL PSHSISGSEE EQEIDTYSLR RARMFSGERK GKMPMVGNGS DASASASASR SRSYSRSLSV RLNEDEDKEA TKEYQMEDEA VKYNAPGEKF LEEVWEGGGV TIEREQSNYH SSDRSSRTPL PTNHRHTPSP VRLPVRHSRA RTPPTTFQTG TTPSSTTTSP GQRQPQTLLS SRALSQSQSP TQTPKFPTPL PSSQSNANPV QKQLATPSHS HSQTLALGPT PRPPGAWNIT PFPQRPQREP EPPRHQDQIA SESGSKTGRV RFSPLRYEAT IHREDYERED EMEKKGKREE ERGDVSILTL SLSPRHKNPK SPKSPKNSKS RTSPERQIRE GKMDGDIGDI SWTAKLARSV TSRISIPMRS TTYNSHLPQL TSAHTHTQTA SSKLQTAQQS WLAALSSIST SHPHLHQHNQ HAPSNSPTSL ALRTSQTLGK GMNKAVGWSR WVWWVLMEMV LLWAVFRVTL DYAGSGVYLG RDPFHPLSRP LGLGPPPSLS SPPSFSVGGT PDMMERDRET NAQRQWAKAS RYVNLEIPIP GSLKTLVGKQ GSANFFDLVE SWGWGTAFFG EGEPAGTWAG VPT
|
| |