Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG03800 |
Symbol | |
ID | 3258663 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | + |
Start bp | 1065815 |
End bp | 1069669 |
Gene Length | 3855 bp |
Protein Length | 1026 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258003 |
Product | conserved hypothetical protein |
Protein accession | XP_572084 |
Protein GI | 58269856 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATATTGGCAT ACCTACCGGT AGAAGTGCAA ACTGTTGTAT TCAGGACGAT AAAGACGGCG GATCGACATC ACGACTTACA AGGCGTCTTT TGTGCTTCTA CCAACAAGAT CGTTTTCTCA GTTGTTATTT GCAAGGCAAG ATCAACGGTT AGTTGCGAGC CAAACAACAT AACGACGCTT AGGCCACCGA AGACAAGAGA AGGCACTCAA TCTATACATA TGAGATGACA AACCCCCCAT CACCAACACT CTCTTGTCGT CTCACTCCGT GTATACCGTC TTCGTCAGGT TTAAGCACGC CGACTTTTGG CGCTGGCTCA CAATCTGGAT ATGACGGAGA AACACCTCAT GTTTTGCAAA GGGCAATAAT GCACGGCGAA GATGTCCCGG AGTCGCCAAT TATAGATCGA GATGCTGGAA CGGACTTTCC AGATGTGAGT GGTGATCAAG GAGGGTTTGC AGGGTAAGTT TATTTCGACA AAGAAGAGAA CGTAAGCTGA CTATTCCAAG TCCAAGTAGA TCTTTTAGGA AAGCACCTAC ACCCCAATTT CTTAACGATC CTTCTCCTCA AACCATATTG CCATTATTAG AAGAAGAAGA GGTAGCTCAA GAGCTTAGTG AAAATGTTGA CGAAGATAGA GAGCTTCGGA CAACTCTGAT CCTCCGACAT GCATTACAAT GGGGTATTGA GCGGGGAGAT GTTGAATTGG TCAACTGGCT GGTTTCTCTC AATGGCCGAT GGGTAAGTTT GATATTAAAG TTGATAAACG TGTGTCCGAG GTTAATGTTA TCATTGTGGT TCTAGCGAAG TATACTAGAT CATGAGATAC GAGATCTAGA AGATGAAGAA GGCTGGGGAA TAGTCGGTAT GGCAATACAT CACAGCTGTG GTAGACAGGA CAGAGAGGAG ATTGTTAGGG CCGCAGTGAG TAGATGGGGG GTTAGCGCAG GGCCCCGAGG TGGCCGTGAT CGTCGTAGGT ACATGATCTT ATGACGCCCA AGCTTAAGCT GATCAAGGTG TTAGGTGGCT GGACACCCTT GCATCTGGCT GTACTGGTCT CAACGCCTCC GTTAATCTCC TTTCTTCTAA GCCATGGCGC TTCACCACAC CTATTGACAA ATCGTGGGCT CACCCCTCTC GATCTCGTTG CAGACATACC AGGCCGCGAA GCCATCACCC TATTCCTCGA ACACGCCATC TCCGTCCCCC ACCCTTCCAG CCCTCGGACA TCGACCATCT TCATTCCTAA ACTTCCGCCT GCAAGGCAAA AGATGTTGAA AAAACGGAGA AGACGGGCCA GAGGTCATCT GGAGAAGATG GAAGAGCAAG ACAGGAAAGA GAAAATACGG ATAGAGAGGG CGAAATGGAT TGTGGACCAG ATAAGAGCGG TAGAAGTTCC GCCGGAGCTG GTTTTTGGGA AGGAAGAAAA CTCAAAACGG AAAAGAGAGG ACGTAGGGGG ATCGGGGTGG TCGGGACAGG ATATCGAGGA AGATGATGTG GACAACGAGG ATGATGACGT TGAAGAAGAT GAGGTGGGTA TCTGCACTGA TTATCAAATA CCATAAATTG ACCAAATCAG ATGAATCCCG CGTCTATCGC CAACATGCTA GTCTTCTCTC TCGTTAACCT CCCCGAAATC TTTGACATCC TTATCAGCAA CTATCAACCG ATCTCTCAGC CTTTATCAAA GAGAACACTA CCAGCGAATA TGTTGTGTTA TTATGCGAGG TTTGCTTATC ATATGTGCGA TGAGACATGG TTAGAGGCGT TGATAGAAGG AGCGATTGAA AGGATCGAAG AGGGTGTATA TGTGAGTTCC ATAGTTTTCC TATCTCTGAG CGAGTCGTCT GACAACCTCA ACTAGGAAAA CGTGGAGAAT CTGGCATATC TTGCATTTTG GGCCTATAAC TCTTGTGTCT TGCTACATCA TCTGCAGGCG GATGAGGCGC TTCAAGAAGC GTGTGATGAG CTGGGCCTGT TGGCTATCTT GGAAGAGCTC ATCAATGCTA TACATGGTAC GTCAAGTATC ATTTCACATC ATTAGGGCGA TAGCTGACAT TCCATCCTTA GTGTTTGTGA TACGTATTGC CGAACGCCGA ATTGATACCG TACTTGACGC TGCTATCCTC GATTACGAAA CTCTCGAAGA TTTTAACGAC ATCCGTTTCG AAGGCGAATG GTCCTTGTTC CGATCGTTTG TATCCAAGAA GAAGCGCGAT ACACCCAAGG TAAACAATAT ATTCGCCGGG GTTAATGGTA GTCCGGGAGG AACGAAAGAT GCGGCCGCGT CGTCGATAGG ATCGACAGGA CTTAAAGTCC CCAACCGACC ACAATCGATG GGTGATTTAA AAACGATGCC GAATGGGCGA TCAACCATAC ACGAATTTAA TGCCAACTCT GATAATAATT CTGGACCAGC ACGCATAACA GAGATCTTAT CCGGAGTGCT AATTGTCCTT CAGCTCTACG AGGTTAACCC CGCGCTCATT GTTCAGGCTT TTTTCCAGAT CTATTTTTGG ATTGCCTGCG AATTGTTTAA CCGCATCCTC ACGAGAAGGA AATACCTATG TCGGTCGAAA GCTCTGCAAA TCCGGATGAA CCTCACTTTC CTCGATGACT GGGTGCGCGC CAATGGGTTG CCTGCTCAAA CCTCTACGAA GCATTTCGCC CCCCTCTCTC AGCTCCTGCA ATGGCTTCAG TGTCTATCCC GAATCACAGA GTTTGACACT CTCATCGGTA CGATGCAGAA CATGAAGGCG ATTAACCCGT TACAGATGCG TCGGGCTGTC CGTGAATACC GGTACGAGGT CAATGAAGGA AGAATGTCAG ATGAGTGTGC ACAGTATCTT GTGCAATTGC AAAAAGACTG GGAGAAACGA AGAGTACAGC TTAGTATGCA AGAAGCCGAG CGGATGAGGT CGGCTAGTGA ATGGTCGGAG GGGAGTTATG ATGGGACTGG GATGTCCGGA GGAAGTGGAT ATGAGGAAAG TGCGACCTTG ATTGACGCGC TTTTTGATGG GTCAACGATG TTGGCGGACT TTGTACCTCA TTCGGCTCCC GAGTCACTCG ACGAGTTGCT CGATAGCCGT TACATGCTTC CTTTCCTCCT ACCGGAAGAT AATGCCTACC TCATTGCCAC CCCTCCTACC GACGCGGCGT ACGCCAACCT CCATATCCCT TCAAGTCCCT TTGTCACTGA CACTCCCACT AAACGACCAC TTTCTCACAC TGGCTATTCG TCTTCTAGAT CGATGGGGTA CAAGATACCA AATATGGGTA GATTACGCGA CTTGCCGCCC GATTTTTTCA AGTGGATGAA AGAAAAGGAG ATAGAGTTGA AGCTTGGCCG TGAAGCATTG CGTGTCGAGA AGAAGACCGT ACCCGCATTG TCTCACCCGT TAGGACCTAG TCAAAAAGCG AATGTCACTA TCAACACGCC TGTTCGACCG CCTCCGGAAG ATAAGACTAA CCTCACTCCT ACCCGCACAC CCACTAGCAA TCGCCGAAAA AGTGGCAATT TGGCTTCCCC CCTCCCTATC GAAGGTGCAG AGCTTTCAAC TACCCTTAGC ATGTTGTACC CGTCCACTAC ACCAACCAAG CTGGGCGCTG GCTTACCCAG CCCGGGATTA AGGTCTAGCG CGTCAATAAA TGAGTTGAGG GAGAAGAAAT TGGCAGAAAG ACCGTTTGAA GCCATTCAAG AGGAAATGCC GAGTCACATC AGGTCAGAGA GCTACGAGAT GCGGATGCGA ACGACGGAAA TGCTGAGACA AAATAGTGCC GAATCGGTCT CTAGCTCGGG CTCTAGGTTG AGTTGCGGTT CAGGAGCTAA TGATGACAGC GGGAAGAAAA GGTGA
|
Protein sequence | MTNPPSPTLS CRLTPCIPSS SGLSTPTFGA GSQSGYDGET PHVLQRAIMH GEDVPESPII DRDAGTDFPD VSGDQGGFAG PSRSFRKAPT PQFLNDPSPQ TILPLLEEEE VAQELSENVD EDRELRTTLI LRHALQWGIE RGDVELVNWL VSLNGRWRSI LDHEIRDLED EEGWGIVGMA IHHSCGRQDR EEIVRAAVSR WGVSAGPRGG RDRRGWTPLH LAVLVSTPPL ISFLLSHGAS PHLLTNRGLT PLDLVADIPG REAITLFLEH AISVPHPSSP RTSTIFIPKL PPARQKMLKK RRRRARGHLE KMEEQDRKEK IRIERAKWIV DQIRAVEVPP ELVFGKEENS KRKREDVGGS GWSGQDIEED DVDNEDDDVE EDEMNPASIA NMLVFSLVNL PEIFDILISN YQPISQPLSK RTLPANMLCY YARFAYHMCD ETWLEALIEG AIERIEEGVY ENVENLAYLA FWAYNSCVLL HHLQADEALQ EACDELGLLA ILEELINAIH VFVIRIAERR IDTVLDAAIL DYETLEDFND IRFEGEWSLF RSFVSKKKRD TPKLYEVNPA LIVQAFFQIY FWIACELFNR ILTRRKYLCR SKALQIRMNL TFLDDWVRAN GLPAQTSTKH FAPLSQLLQW LQCLSRITEF DTLIGTMQNM KAINPLQMRR AVREYRYEVN EGRMSDECAQ YLVQLQKDWE KRRVQLSMQE AERMRSASEW SEGSYDGTGM SGGSGYEESA TLIDALFDGS TMLADFVPHS APESLDELLD SRYMLPFLLP EDNAYLIATP PTDAAYANLH IPSSPFVTDT PTKRPLSHTG YSSSRSMGYK IPNMGRLRDL PPDFFKWMKE KEIELKLGRE ALRVEKKTVP ALSHPLGPSQ KANVTINTPV RPPPEDKTNL TPTRTPTSNR RKSGNLASPL PIEGAELSTT LSMLYPSTTP TKLGAGLPSP GLRSSASINE LREKKLAERP FEAIQEEMPS HIRSESYEMR MRTTEMLRQN SAESVSSSGS RLSCGSGAND DSGKKR
|
| |