Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG02160 |
Symbol | |
ID | 3258952 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 608654 |
End bp | 611604 |
Gene Length | 2951 bp |
Protein Length | 746 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257834 |
Product | conserved hypothetical protein |
Protein accession | XP_571924 |
Protein GI | 58269536 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCAATA CCAACCTCCA CGGTTACCCT TTCCAACCCG TCCGTCGCTT TCTGAAATCA ACAATGCCGA CATCCTCAAC CTTGATGATG ACGATAATCA TCCTCTTCTA TGCCCTGCCT CTCCTATCCC ATCTGACATA TCCAATACAT ACTCATACCA ATCGGGTGCC TTTGGCGATT CCAATTCACA GCTTGTCAAC GACCACAACC ATGTTGGAGC CGGCCACAGT GTCCAGGCCC GACCTTATGT CTGCACTTTT GGTACTTGTG ACAAGGCGTT TGCCAGAAAA AGTGATCTTG CAAGGCACTT TAAGATTCAT ACCAATGACA GGTGAGTCAG CTCATCCGTT GTGGTAAACA AAGCTAACGC TACCCCAGAG CCTTTGTCTG CACCTATCGC GGTTGTGGCA AATCTTTCAT CCAGCGATCA GCGTTGACAG TTCATTACCG TGTCCAGTGA GCTTGATCAG CTCTGTTGTC AAGGTCGATG CTGATGTTTT ATACAGTACC GGGGAACGAC CTCACCATTG CGAGACCTGT AACAAAGCAT TTGCGGACTC GAGCTCGCTC GCGAGACATA GGCGTATTCA GTAAGTCAAT GGTTTGTTTC AGCGATGAAA TAGTATTGAC AAAGGCTAGC ACTGGGAAAC GTCCTTACAC CTGCCACGCT GCTGGTTGCG GAAAACCTTT CGCTCGACGC AACACCTTGC TCAAACATTT CAAGCGGCAG CATCCAGGAC TACCTCCGCC TTCTACTGCT GCCCAGCGTA CCTCTATTCG AATGCCTATT CAGTCTCTTG GTCCAAGATC CTCCACTGCG TCTATGTTGT CCCAGCAATC ATTCAACTCC AACGCTGACC GCTACACCGC TAGTCCTTCC AATGCTGGTA CCCCCCACGG CTTCGCCGCT CCCCACCCGC CGGAGGGAGC GGCGTACGCC TTCCATGGCG GGTTCCCCGA ACAGGTACTC GGAGGACATC CAAGTGCCCA ACAGCCCATC ATCTTTCAAG GATCTGGAGG TATCCGACCT CACTTGCAAC AATCCCCGGT CCCTCCCGGA TCCGTCAGCC TCACCCCAGT CTCTACTTCT GGACCCCATT TTGCAGGTGG GTATGGTACC CAGACGCCAA CGAGCCCAGC GCACCATGAC CGAGACAAGC ACGGTATCTC CCCAGTATCT TCAATCGCAA CGAATAGCTT TGGTTCCGGA CAGTACGCTA GCCCGCTCTC TGCCTACCCG CATGCAAGCA GCTATCCGCT TACGCGCATC ACAAGCGATT CAGGAATAAT CTGGCATAGG TCACTCTCAG CTCCAGAAGA ACCTCGTTAT GAGGCTTCTC AGTGGAGCGG CGGCAATGTC GGGGCTGGTT TCCATGCATC ACAATTATCC ATGCCCCAGA CACCAACCCA TCTCTCGCCA TCTTACCCCT ACCATCTACC TATGCAACAA AGATCGGCCA CCAACCCTCT TCCCAAGACT AGGCAGCTGT ATTCTCCTTC TTGTCATGGT TCAGATGATG AGCGGGATGA ACCTCTCGTA TCTCTACCGG ACGCCCACCC TACCTTTGCC ATCCATCCTC CCCAGGGCAT TGTCAGCGTT CCTATGTCCA GCATTGAAGG TTCCAACATC AGCCACATAT CGAATCATGG GGGCCAGATT CTGTTCGCTC CATCTCAGAA CGGTCCACTC CACAGCGCTC CACCCGCTAT TCAGCGTTTC AATTCTATGC CAGCGGTGCC AACTATGAGC TCCTGGGGCC AAATTCCTCA ATACCAGACT CAAAGTGTCG GCAGTGCTAA AAGCGCAGAT GAGGAGTGGG AGGAATTGCA GAAGGAGATG CTCAGTCGGG AGGCATCAGT CGGGACTGAC AAGGAATTGA GTCCTGCTAC CGAAGAAGGG AAAACCCCCA AGGATGGTGA ATCCATCAAC CACTGGGGTG AAGCCATTGC ATATCCAGTG CACCCCCATC TGCAACATGC CAAGGATCCG TTCCATTCAA CCTCTTCCAC CTTTATTGAT CACATGCACA GCAGCGATCC TCTCCCTCCT ATCCACGTTT TCTCCAATCA ACCTCACCAC ATGGTTCTCA CTCCTATCAA TCCCAATGGC ATGTACCCCA CCCCCATTAC GCCTGGTGAA GAGTGGGCTC AACGTCACAT CAAGCCTACT GCAATGCTTG GCCGTGGATA TCCTCGGCAA CACATTGACG TAGAGAACAA AGAAAATGGC AATGAAATCG CCGAGCATAT CACCCTTACT ACGCCTCCCA AATATCAGGG ACATCGCAAG GACAGCCGAT CCGTGACTGC CGTTGGCTTA GGGATTGCCA ATGTTCATTT TACCGAGCCA CAAGCGATTG AAGCGGGCGA TGCGTCAGAA GTGAAAATGG AGGATCTTGA GAGCGAAGAA AGTGATGTGA CTCCAGAGGA CGATAGCGAC GATGAATTTG TGCTTGGGAG GAAACCGAGA AGGAGTGCAA GGAAGGGTGG TGTGAGGAAG AGGGGATCTA GAACCGCCAC AAAGAGAAGA CGTTCGTAAC GTTAATCCAA AGTATTGACT GGTCCTTTGC CATATACCCA TCTCATGATT CGCCATACGG TTTTATATCT CACAATTGTT CTCCTTTACG AGCTGTCAGA TCACGACTTC GGTGTCAGGA AGGAAAAAGA AAAAAGAGAA GTAGTAGTAT ATTCGCCTGT TACTCTAATT CGGCCTGAGC TTCATACGTT TTTAATAGCT TGTTAGCTGC GCTTGGCCTA GTCTTCAAGT CTATCTTTCG CATCACCACA TTTTTCCGCA TTTGCCACAT GTTAACGCTT GCATCGGCCG TTCCCTTCGT TAGTTGAAGA TTTTTATGTG CCTATTTATA CGATTTTTGA CGACTCATTT TTACGCTGTG TTATATGATT ATTGTTTGAT ATACGATAAC TGAGAGACGT TCCAAAAGAC A
|
Protein sequence | MINTNLHGYP FQPLVNDHNH VGAGHSVQAR PYVCTFGTCD KAFARKSDLA RHFKIHTNDR AFVCTYRGCG KSFIQRSALT VHYRVHTGER PHHCETCNKA FADSSSLARH RRIHTGKRPY TCHAAGCGKP FARRNTLLKH FKRQHPGLPP PSTAAQRTSI RMPIQSLGPR SSTASMLSQQ SFNSNADRYT ASPSNAGTPH GFAAPHPPEG AAYAFHGGFP EQVLGGHPSA QQPIIFQGSG GIRPHLQQSP VPPGSVSLTP VSTSGPHFAG GYGTQTPTSP AHHDRDKHGI SPVSSIATNS FGSGQYASPL SAYPHASSYP LTRITSDSGI IWHRSLSAPE EPRYEASQWS GGNVGAGFHA SQLSMPQTPT HLSPSYPYHL PMQQRSATNP LPKTRQLYSP SCHGSDDERD EPLVSLPDAH PTFAIHPPQG IVSVPMSSIE GSNISHISNH GGQILFAPSQ NGPLHSAPPA IQRFNSMPAV PTMSSWGQIP QYQTQSVGSA KSADEEWEEL QKEMLSREAS VGTDKELSPA TEEGKTPKDG ESINHWGEAI AYPVHPHLQH AKDPFHSTSS TFIDHMHSSD PLPPIHVFSN QPHHMVLTPI NPNGMYPTPI TPGEEWAQRH IKPTAMLGRG YPRQHIDVEN KENGNEIAEH ITLTTPPKYQ GHRKDSRSVT AVGLGIANVH FTEPQAIEAG DASEVKMEDL ESEESDVTPE DDSDDEFVLG RKPRRSARKG GVRKRGSRTA TKRRRS
|
| |