Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00460 |
Symbol | |
ID | 3259154 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 1055861 |
End bp | 1056975 |
Gene Length | 1115 bp |
Protein Length | 281 aa |
Translation table | |
GC content | 52% |
IMG OID | 638258440 |
Product | conserved hypothetical protein |
Protein accession | XP_572238 |
Protein GI | 58270164 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAGCTCG AAGGAAAGGG TTCGTATAGC TCGTACCAAT TCCCAATCAA TGCTCATGGT CAATTACTCT AGCTTTCGTT GTCACTGGTG GCTGTGGTTC TATCGGTGGC ACTGCCGCTA AGCATATCAT TGCCAGAGGC GGTATTGCTC TGGTAAGTGG CTTTCATATT TCAATAATAT TCCAAGCCAA CTTTTTGGCA GATCTTCGAC GTTCTCCCCG AGGAAGCTGG TCAGGCCAAG GTTAAGGAGT ACCATGCCGA GCGTGCTTTC TACTTCAAGG CCGACATCAC CAACGTCGAG GTCTTCTCCG CCTGCATTGA TGCCGCCTTG AAGGTTATCC CCAAGGGCTC TCTCTTCGGT GGTGTCCATT GTGCTGCTAT TGCGCCCGGC CGACCTTGGG ACCAAAAGTT GAAGAATTCT ATCGCTGTAA GCTTGATTCT TCGACAGTAT GCAGTCTTCG GTTTTACTAA TAATCTCACT TCCAGCACTT CCAAAAGGTC ATGCATGTCA ACTCTTACGG TACCTTCCTT GTCGACGCCT GTATTGCCGA CGCTATTAAC TCCCAGTACC CGGACGAGGG ACCTTTCGGC CCCCGAGTTA AGGAGGAGCG AGGCTGCATC GTGAACATTG CGTCTGTTGT CGCCAAGCCT GTCCCTGCCC GATGCTTGAC CTACGGTGTC AGCAAGGGTG AGTTCTGATC CTTTATTAGC AATATCAACA TCAAACTAAT GTGCCATGCA GCTACAGTCT TGGGTATCAG CAGCGGTATT GCCGACTTCC TCGGCCCCTA CGGTATCCGA GTCAACTCTG TTAGCCCTGC TGTCGTTGCT TCCTCTCTTA TGGGCCCCGA CCGAATCGTG AGTTTTTCCG TATCGCACGT ATATGTTCGC CATAGACCGA CATGGTTATA GCCCTACTTC GAGTCTGAGC TCGAGGCCGC CGCCATCTAC CCTCGCCGAC TCTCCCAGCC CGACGAAGTC GCTCAGGGTA TTGTCTACCT TCTGGAGAAC TCTATGATGA ACGACTTTGA GCTCAGGGTC GACGGTGGCT GGAGAGGTAG TAGCAACTGG GGCGGCCCCC ACGACCCCCG ATCCAACGCT CCTTCTCTTG AATAA
|
Protein sequence | MKLEGKAFVV TGGCGSIGGT AAKHIIARGG IALIFDVLPE EAGQAKVKEY HAERAFYFKA DITNVEVFSA CIDAALKVIP KGSLFGGVHC AAIAPGRPWD QKLKNSIAHF QKVMHVNSYG TFLVDACIAD AINSQYPDEG PFGPRVKEER GCIVNIASVV AKPVPARCLT YGVSKATVLG ISSGIADFLG PYGIRVNSVS PAVVASSLMG PDRIPYFESE LEAAAIYPRR LSQPDEVAQG IVYLLENSMM NDFELRVDGG WRGSSNWGGP HDPRSNAPSL E
|
| |