Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNE00130 |
Symbol | |
ID | 3257955 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006687 |
Strand | + |
Start bp | 25005 |
End bp | 26115 |
Gene Length | 1111 bp |
Protein Length | 302 aa |
Translation table | |
GC content | 52% |
IMG OID | 638256595 |
Product | conserved hypothetical protein |
Protein accession | XP_571079 |
Protein GI | 58267846 |
COG category | [I] Lipid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism [R] General function prediction only |
COG ID | [COG1028] Dehydrogenases with different specificities (related to short-chain alcohol dehydrogenases) |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 31 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGCAG AATCTTTCAA TGCTGCTCAA AAATTCGAAA ACGGAGAGGA AGTCGTCGAT CTGGCTGTGG CCGGTCAACC TCAACCGGAG GACGGTGAGC ATGGAACGAG TCTCTTGGAC GTCGAGTCTG TAAGCTGATT GAAATCCAGC CACCCCTTTG GTCTCTTTGA AGACCCCCCA GTTGTTCTCC TTAGCCAAAC AAGGAGTTAT CGTTACTGGT GCTGCCCGAG GTCTCGGTTT GGCCATGGCC ATCTCCCTCC TTGAGGCTTC CGCCGCTCAC GTATATTGTG TCGACGTTCT CCCCTCTCCA GTCACGGACG AGTGGGATCT CGCACAGCGC ACCGCAAAGG GATTCGGAAG CACAATCGAG TACCGTCGTC TCGACATCAC CGACGAAGAG GCCGTTAAAT CTTCATTCGC AGACATCTAC TCGACATGTT CTTATCCGGT CAAAGGACTC TTTGCAGCAG CGGGAATCCA ACAAATGATT CCAGCTTTCG ATTATCCGGC CAAGGACTTC AGACGAATCA TGGAAGTAAA TGTTACCGGT ACATCCATTT TTATGCATCG GGATGAAGCT GACCAAGTGC AGGAACCTTC TTGACGGTGC AGGCAGCCGC TAAGGAAATG AAAGCACGTA AAATTGCGGG CAGCATCGTC ATTACAGCCT CTATGTCTGG GTCTATTGCT AACAAGGGTA AGTTAAGGGC CCTTTGAGGG GAACAAAGGC TGAAACGACC CAGGTTTGAC TTGTTCGGCT TACAACACCT CAAAATCGGC TCTCTTGCAA CTCTGTCGCA GCGTCGCAGC CGAGTGGGGT CAACATGGAA TCCGAGTTAA CGTACGTCTT ATTCTGTGTC CTCTGCGTCT CGGGTTCCCA AGCTCACGGT CGCTATAGAC TCTCTCTCCA GGATATATCA GGACAGCCAT GACGGACGGA CTTCTCTCTG AGCTACCCGA CCTTGAGCAG GAGTGGCTAC GAGGAAGTAT GCTCGATCGT CTATCCACTC CTGACGAATA TCGAGGACCA CTTTTGTTCT TGCTCAGTAA CGCTTCCAGT TTCGTCACCG GGGCGGATCT TCTCGTGGAT GGTGGTCACA CCGCATTCTA G
|
Protein sequence | MTAESFNAAQ KFENGEEVVD LAVAGQPQPE DATPLVSLKT PQLFSLAKQG VIVTGAARGL GLAMAISLLE ASAAHVYCVD VLPSPVTDEW DLAQRTAKGF GSTIEYRRLD ITDEEAVKSS FADIYSTCSY PVKGLFAAAG IQQMIPAFDY PAKDFRRIME VNVTGTFLTV QAAAKEMKAR KIAGSIVITA SMSGSIANKG LTCSAYNTSK SALLQLCRSV AAEWGQHGIR VNTLSPGYIR TAMTDGLLSE LPDLEQEWLR GSMLDRLSTP DEYRGPLLFL LSNASSFVTG ADLLVDGGHT AF
|
| |