Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH03120 |
Symbol | |
ID | 3259056 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 229742 |
End bp | 231698 |
Gene Length | 1957 bp |
Protein Length | 366 aa |
Translation table | |
GC content | 49% |
IMG OID | 638258173 |
Product | conserved hypothetical protein |
Protein accession | XP_572481 |
Protein GI | 58270650 |
COG category | [R] General function prediction only |
COG ID | [COG0673] Predicted dehydrogenases and related proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAACGGAAT TATAAGTGAA TTGCTTAACC GCTAGGAAGC TCGGAATGCT TCCGTACGAT TTGGGCCTAT GAAGTTCGGT GAGTTCGGTC GATTCCTGAA AACAAAGTCA TTCTGATTAC ATTTCGGGAG CGGTCTTCGG ATCAAACTAC TATCGGTCTT TCCGACGCTC CCTGAGACCT CTAAGACCTC TCCACTTTAC TTCACTCTAC TCTCTGGCCC GCACCCAAGT CTCTCTTCGG TCTATTTTCC AGCTTCTTGC AATTCCAGAT CAAACAGATG GTTACATTGT ATGGGATACA TTTGAGTTTA GCAGGACGTA CATTTAGCAG AAATTTGCAA GAGCATGGAC TTAGGTATAT ATATAACTGA CCACGATAAA TCGTCGTCGG CTCTGCACAA TACCACCCAC TTCGCATAAC TATTACATCC AAAACAAGCA CAACACAATG GCTATCTCTC AAGGCAATTC TAATGGCAAG CTTCGCATCG GTGTCCTCGG TGTTGGCCGG ATGGGTCGAC GACACGCGTC AAACGTGAGT TTTGTCGCTG TGAAATTGGT ACGAACACAG TTGCTCACAC ATTGTTTGGT AGGTTGCTTA CTCGGCACCA CGGGCAGAGC TGGTGGCAGT AGCGGACCCC AGTCAGGAGG CACTGGCGTG GGCCAAAGCG AACCTGCCGT CCACAGTGCA GTGAGTGCTT TCTCCGTCCG GTTACAGAGA ATGAGAAACA TCGCTGACAG CATGTGAAAA GGTACTACTC CGATTCCGCG GACGTCATTC AATCGCCCAA CGTCGATGCT GTCTTGATTT CTACTGAGAC TTCGGAACAT GCAAGGCTCG CTTTGGAGGC TGCCGCCGCG GGAAAGGTAT GTAGACAGCA GGCGAGCCTT GGGAGTAATA CAACTGACCC TTCCGACAGC ACGTCCTTCT TGAGAAGCCC ATCTCCGTTG ATGTTGACCT TTCGCGGCCT GTCGTGGAAG CAGCCGGTAA GCACCCCGAA CTGAAGATAA TGGTTGGATT CTCTCGGCGA TGTAGGTCTT CATTGACTAT TTACATACAC GCGGCGTCGA TGCTGATAGA AGGCCTACAG TCGATGCTTC CTACAGGGAG GCAAAGAGGA GGATTGATGA AGGTACCGTG GGCAAACCGT ACTTGATCAA ATCGTGCACC AACGATCAAT ACGACTCCAC CGGCTTCTTC ATTGCCTACT CTAAGGCATC TGGAGGCATC TTCATTGATT GCGGTATCCA CGACATTGAT ATCTCACGAT GGCTATTGGA CGTTGAAAAC CCAGCCAACC TCAAGTATCC TAAGAAGCAA GTGACCTCTG TTTGGGCGAC CGGCCTTAAC GCTCAACACC CAGAGCTCGC GTCGTACGGA GACTGTGACA ATGCTATCTG CGTGGTCGAG TACGAGAATG GAACCAAGTG TACCTTCCAC CTCTCGAGGA CAGCCATCCA CGGTCATGAC TGCTTCTGCG AGGTCTTTGG AACCGACAGC AAACTGATCA TCAACGGCGT AAGTCATTGC CCGCTCTGAC AATTCGTATG TGAGCCAAAA TTGATCCCTT TTTTTTTTTT TCAGAATCCA AACATGAACC GTGTAGAAAT TAGAGACATC CACGGAGTTC GTATGGAGTC TACGTAAGTG TTTCAAACCA CCTTTTTGAT AAAGAACGTC TTACTCATTG ACTCGTCTAC AGGCCTACTT ACTACGAGCG ATTCAGGGAT GCATTCATCA GCGAGGTGCA AACTTTCTGT GACGTAGTTT TGGACGACAA ACGTAAGCTG TTTTGCTTCA ACATGTGTTG CCTCGTGCTG ATCATGCCCT TTCAGCCGTC CCCACCACAC CTCAATCCGC CCTGGAAGCT GCTAAGATAG CCATGGCCCT CACTCATTCT TTCCGAGCCG GCAAGCCAGT TTATTTTGAC AATGAAGGCG AGGCAATTAT TAACTAG
|
Protein sequence | MAISQGNSNG KLRIGVLGVG RMGRRHASNV AYSAPRAELV AVADPSQEAL AWAKANLPST VQYYSDSADV IQSPNVDAVL ISTETSEHAR LALEAAAAGK HVLLEKPISV DVDLSRPVVE AAGKHPELKI MVGFSRRFDA SYREAKRRID EGTVGKPYLI KSCTNDQYDS TGFFIAYSKA SGGIFIDCGI HDIDISRWLL DVENPANLKY PKKQVTSVWA TGLNAQHPEL ASYGDCDNAI CVVEYENGTK CTFHLSRTAI HGHDCFCEVF GTDSKLIING NPNMNRVEIR DIHGVRMEST PTYYERFRDA FISEVQTFCD VVLDDKPVPT TPQSALEAAK IAMALTHSFR AGKPVYFDNE GEAIIN
|
| |