Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02900 |
Symbol | |
ID | 3259218 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 301568 |
End bp | 303528 |
Gene Length | 1961 bp |
Protein Length | 498 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258195 |
Product | cytoplasm protein, putative |
Protein accession | XP_572461 |
Protein GI | 58270610 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 32 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCCGGTCTG TTCCTTGAAT TTTGTGCACA GGCCCCTATC AATACCAACC AGAAATCTCA CATTAACACT CGCCCACAGC CAAAAGCACA ACCTGTTTAT CCACCCCAAT CCACAAGCAT TCAGCAATGG GTGTACTCTC AAAGTTCAAA GATCAATTCA CCCAATCCTC CCCTTCTTAC GACTTTCCCC CTCCACCTCC TTCAATCCCT CTCGGCCCTA ACGCAGTCTT CCGTTACCGC AAGCAACGAG GTGTCAATTT GGGATCATGG TTTTCTCTGG AGCAGTGGAT CTGCCCTCAC GTCTTCAAAG GCGCCAAACC CCCTGGCCAA AGTGACTATG ATGTGGCTAG TGGGAAGGAC GCAAAGAGGA TTTTAGAGGA GCACTGGGAT ACTTGGATCA CCGAGGATGA TATGAAATGG ATTGCATCTA GGGGGTTTAA CAGTGTTCGA CTACCCGTGA GCCTACTACA GATATGGATA ATATTTACTT GCTGATGGTA TCTTGGTTGG GCTCAGATTG CATACTATCA CCTCTGCGGT CCTCTCCCCG AAGTTTTGAA AGGCACTGAT TTTGAATCAT TTCGCCACGT TTTTGAAGGC GCTTGGGGAC GAATCGAGAG AGCGGTCGAG ATGGCAGGAG CTTATGGTTT AGGAGTACTT ATAGGTTAGT CATTGAGTTG ACACTTGCCA TCGGATCACT TCTCAAAACA TTAGGCGCTG ACGTCTTTTC ACCGTACTGC AATTAGACCT GCACGGTGCG GCCGGTGCTC AGAATCCGGA TGGTGAGTTT TTACAGCAAG GTAGCCATTT CAGTTCAGCA TCATTAACCA TAACTGCCTT TATACAGCCC ATGCCGGCCT TTCTCGCGGC AAAGTATCAT TCTGGGATAC CCATGCCAAC CAAGCCTCCA CCTCCCTTGC ACTTCGCTTT CTCGCATCCA AGTTCGCTTC CGTCCCCCAC ATCGTCGGCC TAGAACTCCT CAACGAGCCT CAAAACAACC GCAAACTACA GTCATGGTAC AGCAAGACTA TCGACGAAGT CCGTAAAGTC GCTCCGCCGG ATTTTCCCAT CTACTGCTCA GACGCATGGG ATACGGATCA TTATGCTGGT TGGGTCGGAT CGAGAGGCGA TTTCGTCGTA CTAGACCATC ACCTTTACAG ATGTTTCACC GATGAGGACA AGTGTCAAAC CGGAACAGAT CATGCAAACA ATCTCAGATC TGGTTTCAGA GGTAGGTTTG CCCAGCAATG CGAAGCCGCC AAAGGATCTC TCGTCGTGGG CGAATGGTCT GCCTCCCTGG ACCCTCGGTC CTTCCCTCAA GGAATGCCTG ACGGGGAGAA AGACGCTCAG AGACGGGCAT TCGTCCACGC CCAGCTCGAG ATATTCGAGT CCCACGCGGC GGGGTATTGG TTTTGGACTT ACAAGAAAGG TGAAGGATGG GACGCTGGCT GGTCAGCAAC CAATGCGAGT CAGGCAGAGA TTCTGCCTGG GTGGGTTGGA AGCAGACAGT TCAAAGGGAC ACCACCCAGT CATATCAAGG ATCAAGAGTT GCAGAACGGA CATAGTAGGT TTAGCAAATC TCTCCCGCTA ATTGTCAAAA TGCGAAAAGC ACTGATGGGA CATTCTTTTG AATAATCTAG AATCCCACGC TGATTATTGG GCCGCAAACG GCGGTTCTCC CAATCCAGAT ATGTACGCTC CCGGATTCTC TCAAGGATGG GACGATGCTC TCATCTTCTT GAGCACCCAA GGATCACCTA GCGAGATGGG TTTTGTGCAC CAGTGGGCAA TAAGGCGTCA AGCAGAGTTT GAAAGCCAGG GGCATAAGCT CGGCCGCGCT GCTTGGGAAT GGGAGCACGG CTTCAAGCAA GGGGTTGAAG CTTGTGCTAG GTGCTGTCTG GCTTGATACA CGTTTCGGCA AGGAGTTCGC TGTATCATTT AGGCAGCCCG TTGTAGATAG T
|
Protein sequence | MGVLSKFKDQ FTQSSPSYDF PPPPPSIPLG PNAVFRYRKQ RGVNLGSWFS LEQWICPHVF KGAKPPGQSD YDVASGKDAK RILEEHWDTW ITEDDMKWIA SRGFNSVRLP IAYYHLCGPL PEVLKGTDFE SFRHVFEGAW GRIERAVEMA GAYGLGVLID LHGAAGAQNP DAHAGLSRGK VSFWDTHANQ ASTSLALRFL ASKFASVPHI VGLELLNEPQ NNRKLQSWYS KTIDEVRKVA PPDFPIYCSD AWDTDHYAGW VGSRGDFVVL DHHLYRCFTD EDKCQTGTDH ANNLRSGFRG RFAQQCEAAK GSLVVGEWSA SLDPRSFPQG MPDGEKDAQR RAFVHAQLEI FESHAAGYWF WTYKKGEGWD AGWSATNASQ AEILPGWVGS RQFKGTPPSH IKDQELQNGH KSHADYWAAN GGSPNPDMYA PGFSQGWDDA LIFLSTQGSP SEMGFVHQWA IRRQAEFESQ GHKLGRAAWE WEHGFKQGVE ACARCCLA
|
| |