Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC02140 |
Symbol | |
ID | 3256606 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 603695 |
End bp | 607033 |
Gene Length | 3339 bp |
Protein Length | 1065 aa |
Translation table | |
GC content | 54% |
IMG OID | 638255435 |
Product | hypothetical protein |
Protein accession | XP_569483 |
Protein GI | 58264654 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.271772 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCCAGCAGCG TGGACTAGGA TGGCCTCGGT AAAGGCTCTG CTGCGGACAC CCGCCTACAA CCCCCTCCCG CTTCCGACCC TCCCGTCCCC CGAGCCGCAC GGCCGAGACG CCGAGCAGCT CCCACCGCTC GCAGAAAGGT ACGCGCTCGT CGCCCACGCT GACAACCGTG GCTTACCTTC TCTTGCCAGC CTTGTTCTCC CATACTCCTT GATACAGACA CGCCACCACC ACCTCACATC CCTCCTCCCC CTCGCATTCT CCCACCCGCC CATACCGTCT CATAATAAGC CTAGGGCAAA GCCCTTGCCA AAAGGCGTCT TTGTGTTCCC GACCCCGCCG CCTCCCGTTC ATCCCCTCCC AACTAGAGAA TATCACGGGC CCAGAGATCT GCTCAATGAG GCCTCTCCTT CTGCAACGCC TCCTGCCAAC CCAGCATCGA CACCTGTCCC ACCGCAACCC AAGGGGCGCA AGCGTTCAGG TGAACCTGAA CCTCCAGCCC ATGAGATCGA GTGCATCGCG CGTGTGACAT TGGCTCTGGG CCCAATGTCT TTCCACGGCA CAGAGCTATG GGTCGGGCGC TTTGTGGAGC CCAGAGCTAA TAAGCCCAAG AAGGAAAGAG CCAAACCAGG CGAGCGAAGG GAGAGGGAAA AAAAGAGGCG AGAAGGGATA GAAAAGGAAC GACAGAGAAC ACATTCAGGT CCGAGTGCTA CACCTAAACC CGCCGCTTCG GTGGCAAGGC CAACAGTCGC CACTGCTGGG CCGTCCGCTC CTCGCATGCG GCCCCCAGCA CCGGGACCGG CCGTGACAAA CCGCACAGCG GCTTCGCCCC AACTCATCCA ACTCGTTAAT CAGGCTGCCT CTCGTCACCC ATGGCTTTCA TCTCTTATCT ACAAAGCGGC GGGTAGCACA GCCAACCAAG ACGAGTTGGA AAGATTAGGG AGAGCAGTGG CAAGGCTCAG CAAAGGAGAA GCAATCGACG ATCTAGCACC GCAGCGTGTG AACATTGCTT CTAGTGAGGT TTTGAAAGGG AAAGGAAAGG AGACTGCTGC AAGTACTTCT CACGCTTCAA AGACGTCTGT TCCCGCAGTT CCAGGGCCAT CAAGTCTATC AGAAAAGCCC ACATCTGCTC CTGCTCCCTT GCTTTCTCAA AATACTTCCT CGTCAACTCT TACACCCGTC GTACCGGCTG AAAAAGACAA GGAAGACAAG AAGGATGATG CAGAATCTGA TTGGGACAGT GAAGTTGAGA TGAAAGGTCC TAAACAGGTC GGAGGAGGCC CCATCGGCCC TTCCACTCTC GATTCTGCGG CCGCAGCATC CACCATACCG TTGACATCTA CCGCTACTCA ACCTTCCTAT GCACCCAGCT CCGTGCCCGC TTCCCAGGCG GCCAATCCTT CAGTCCCTTC TCCTGTCCCT GGGATTGTCC CCTCTCCTAT CCCTGTGTCA GCATCAACAT CTTCTGCAGT GCCATTGTCC CCGCATGCGC CGCCTCCTAA ACCTAACCTC CCTAATCCAC CGCCATTCTT GTTGATTGCA TTCAAAGAGC ATCCCACAGA CAAATTCTTA ATCCCCTTGG GATCAAGGAG TTTTGTCAGC CGTGTTGGCG GGGATTGGGT TACTAGCAAA CCTCCCCATC TTGCTCCCGA CACTACGCTG CCTTTGGGCC AATCCCTGGA AACCAGTGGC AATACAAATC CAACAGTGGC TCCACAGCCA GCAGCCCAGG CGGCCATCTC CAGAGAACTG CAAGCTTTAC AGTCTTCTGC AAAATCGTCT TCTTTTGAGC TAGAGGCTCA GTCACACTCA AAACGCAGAG GACGTACCCA TGTTCGCCCA ACCAATGCCA ATTCCCCCTC TCCGGCACCA CCCAAAGTCA AGACCAAGCC CATCGAAGAA CCCAGCTTGA CAACAACAAC AACAACACCA GAGTTTCCGC CTCTTCCTCA ACTTCCCGGT CAAAATCCTC CCCCTGGAAC AGTCCTCATC TCTACTCTTG TGCCGGCTAA TAAATGGAAC AAGGTTGACT GGGCGTCATT GGGCAAAAAA GTACCTTGGT CTGAGGACTG GAACAGTAAA GTCAAAGCTG GGACGAATGA AAATGTCAGA GAGGAGGAGC CTTTGCCATT ACCTTTGTCA GATGCCTCAT CCCAGAACCA CGTCCAGTTA CTTAATCTTG CTGCTGAAGA CTTCCTCCCT GAAAATGGAC CTCTGAAAGC AATCACGATC AGGTTGGGCC AAGTGGACGA TCAGATCTGG GGGAGAATGA AAGACGTGAT GACCTTGGTC GACCGGGCGG AGATCATGGC CTTGTCGGCG ATGGGCGTCT TGCCGCCTGC TCCAGACTCT ATAACCCACC CTGCCGACGA CCCAGAGATC AGAGAAGCAT ACCTCTTGCA CAAAACGTCC CTCTTTTCAT CTCTCATGGG CCGCACTCGA CAACCCCGTC GTTTCTTACA CACCCGTCCG TCTTCTCCGC CTGCTGCTCT TGTAGATGCA ACAGTAGATA AAATGGCTCC GCGTCCCTAT CCCATATCTA CCAAACCGCT GTATCATGTC GAAGAGGGTG ACAACGAAAT GGAGGGGCGA CGGGATAGCG TGAGGCAATG GTCGCCAGAT GTAGAATTCG ATGACGGTCT CGGGAGAAGG AAGAAGAAGA AGACAGCTCA AGAAACGGTA GGATTCGAAA TGCCCGTCTC GTTGGAAGCG CTTGATGAGC GGGTGGAGGC CAGTGCTCAA AAAGCACTTT TGGGAAAGCG CGGAAGAGCC GGTGGAGGAG AAGGAGCAAG GAAAGAAAAG CAAAGGCGGG GAATTGAAAA AGGAATATGC GAAGGCTGCG CAAGAGAGGG AATTAAGATT TGGAGAAGAG GACCGAGTGG AAAAGGAACA TGTACGTCTG GTGTCATTGG AGTTCATGTT TGTTAAAAAA TGGATGAGCC TTACTGACGT TTACGGTTCC AGTGTGTAAT TCATGTGGCG ATCTTTTCAC TGAGGGGAAA CTGCAATACA GTGACTTGAA GGCCCCTGGA GCAATGAAAA CTCTGTTGGC TGCCAACCAG GACGTCAGCG GAGCGGACGC CATGCATAAT CAGGTGGAAG AAAAGAATGA CAGTGTGCCT GTCAAGGCCG AGCAAGGTCG TACGACCGAA GAGGCTCTCG GGGACGGCAC AGCCATCAAT TCGGAAGAAC AGAAAGAAAC ATCTACCCAG GTCGTGCAAG CTGAGAGACC ACCAGAACAT TCGCCACAGC ATCATGCTGT CGAGAGTCAA CCTGCAACGC AAAGCTTGCC AGAAACGATA GAACCAGGGG TGGACATGCA GAATCAAAAG AGTTTGTAA
|
Protein sequence | MASVKALLRT PAYNPLPLPT LPSPEPHGRD AEQLPPLAES LVLPYSLIQT RHHHLTSLLP LAFSHPPIPS HNKPRAKPLP KGVFVFPTPP PPVHPLPTRE YHGPRDLLNE ASPSATPPAN PASTPVPPQP KGRKRSGEPE PPAHEIECIA RVTLALGPMS FHGTELWVGR FVEPRANKPK KERAKPGERR EREKKRREGI EKERQRTHSG PSATPKPAAS VARPTVATAG PSAPRMRPPA PGPAVTNRTA ASPQLIQLVN QAASRHPWLS SLIYKAAGST ANQDELERLG RAVARLSKGE AIDDLAPQRV NIASSEVLKG KGKETAASTS HASKTSVPAV PGPSSLSEKP TSAPAPLLSQ NTSSSTLTPV VPAEKDKEDK KDDAESDWDS EVEMKGPKQV GGGPIGPSTL DSAAAASTIP LTSTATQPSY APSSVPASQA ANPSVPSPVP GIVPSPIPVS ASTSSAVPLS PHAPPPKPNL PNPPPFLLIA FKEHPTDKFL IPLGSRSFVS RVGGDWVTSK PPHLAPDTTL PLGQSLETSG NTNPTVAPQP AAQAAISREL QALQSSAKSS SFELEAQSHS KRRGRTHVRP TNANSPSPAP PKVKTKPIEE PSLTTTTTTP EFPPLPQLPG QNPPPGTVLI STLVPANKWN KVDWASLGKK VPWSEDWNSK VKAGTNENVR EEEPLPLPLS DASSQNHVQL LNLAAEDFLP ENGPLKAITI RLGQVDDQIW GRMKDVMTLV DRAEIMALSA MGVLPPAPDS ITHPADDPEI REAYLLHKTS LFSSLMGRTR QPRRFLHTRP SSPPAALVDA TVDKMAPRPY PISTKPLYHV EEGDNEMEGR RDSVRQWSPD VEFDDGLGRR KKKKTAQETV GFEMPVSLEA LDERVEASAQ KALLGKRGRA GGGEGARKEK QRRGIEKGIC EGCAREGIKI WRRGPSGKGT LCNSCGDLFT EGKLQYSDLK APGAMKTLLA ANQDVSGADA MHNQVEEKND SVPVKAEQGR TTEEALGDGT AINSEEQKET STQVVQAERP PEHSPQHHAV ESQPATQSLP ETIEPGVDMQ NQKSL
|
| |