Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CND04940 |
Symbol | |
ID | 3257398 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006686 |
Strand | + |
Start bp | 1354082 |
End bp | 1355122 |
Gene Length | 1041 bp |
Protein Length | 308 aa |
Translation table | |
GC content | 52% |
IMG OID | 638256430 |
Product | conserved hypothetical protein |
Protein accession | XP_570419 |
Protein GI | 58266526 |
COG category | [J] Translation, ribosomal structure and biogenesis |
COG ID | [COG0689] RNase PH |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 0.189992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAGTT ATAGAACTTG CACAACCATG GCCACCGCAG GCCCCAGTAG ACGCCCCGAT GGCAGAACGC CTGCCCAGCT TAGGCCTTTG CATCTGTCCA TTGGTGAGCT CGATCGTGCA GACGGCTCGG CTCGCTTCGC TTTCGGTATG TCTCTTCTTG TATACCTCCT CCGCAGACTG CTGAACACAT GCCTCGTAGG GTCAAATGCC GTCCTTGCAA GCTGCTCTGG TCCTATAGAG GTCCGCCTCC GTGAAGAACT CCCAGACAAA GCCACTTTTG AAGTAAATCA TCGCCCTCTC GAGGGCGTTG GTGCAACTCC TTCCCGAGCT CTTGTCACCA CCCTTGAAAC TATTTTCCCT CCCATCTTAT CATTAGAAAA GCACCCGAGA TCCCTTGTTC AGCTTGTAGT GCAGAGCTTA GTGCCATCTA CAGGTAGGGT TGTGTACGGG TCTGTCTTTG GGGCGGAAGG GGTGGGAGCA GAGCAGAACA CATGGCCGGC GACGGATAAG GACGATTACG CCTATATCCC AGAAAGTAGA AAAGATGCAG CTAGGATATC TCCTGCAGCG GGGTATACTT TTACTGCTCG AGCCGCCTCT ATCAACGCTT CGACATTAGC ACTCCTCTCC GCGGGTACAA TATCGATCTT AGCACTTCCC GTCGCTGTAG CCCTCGTGGT GACTACCAAA GGGAGAGTGA TGTTGGATCC AGAAGCCGAT GAGGAGAAGC AGGCAAAGGC GAGACTCGGG TTCGGCTGGG CCTGGGGTGC AGTATTTGGG ACGGCCAATG AAGAGAACAA TATGGGAGTT GCTGGGCAGA ACGACGGTGG GGCAGAACTT GTTTGGATCG AAAGTGAAGG TAGCTTCACT AGGCAGGAAG TGAGTATTTC ATATTTTTTT TTTCAAGCAA GACTTGAGCA CTGATGATGA GCACCTCAGT GGTCGGAAGC GCTGCAAATG TCCAAAACGG CCTCAAAGGC AATCCTTGAA TTCATTCGAA TCCAACTTGA CGCTCATCTT AGTTCACATC AACTCTCATA G
|
Protein sequence | MSSYRTCTTM ATAGPSRRPD GRTPAQLRPL HLSIGELDRA DGSARFAFGS NAVLASCSGP IEVRLREELP DKATFEVNHR PLEGVGATPS RALVTTLETI FPPILSLEKH PRSLVQLVVQ SLVPSTGRVV YGSVFGAEGV GAEQNTWPAT DKDDYAYIPE SRKDAARISP AAGYTFTARA ASINASTLAL LSAGTISILA LPVAVALVVT TKGRVMLDPE ADEEKQAKAR LGFGWAWGAV FGTANEENNM GVAGQNDGGA ELVWIESEGS FTRQEWSEAL QMSKTASKAI LEFIRIQLDA HLSSHQLS
|
| |