Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01760 |
Symbol | |
ID | 3258066 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 511494 |
End bp | 514943 |
Gene Length | 3450 bp |
Protein Length | 725 aa |
Translation table | |
GC content | 49% |
IMG OID | 638257301 |
Product | hypothetical protein |
Protein accession | XP_571515 |
Protein GI | 58268718 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TAGACTCCGG CCTCTGTCCC ATATACATCC CTTCCATCAC TATACCACCA GCGGGGAGAC TCCGCCATAG AGACAGATGC CTCGAGACAC TTTCTCCAGC AACCCCTTCA TATCCTCTCC GGTCTACCAC GACAGGTCTG CCGACATCTC CTCTTTCTCC TACGATGCAA GGCGGTACGC GCCCTCGCCT TCATCCGAGG CGCTGCCCCA GGAGCTACCT GCAGGAGCGA TGCCTGCTGT TGCCGACACT TTTTCTGTGC GGAACTTGTC TTCGGTGGGT CACTATGTCT AACATTGGTT TTTGGCTGCA AGATTCGCTA TGATGTGCCT GAAAGGAAAA TGCGATGTTA CGGATCACAA ACAACCGCGC GCTTTTTCAT CGTAGTTCTG CGTTTTCCCA CTTTCTTTGG CGTCCATCCT TCTCCCCGCT CATTTGCTGA CTAGATCCTA GTATTCCCAG CGGCCCAGTA TGTCGACCAC CCCTAGGATG AACTCTTCAT TCAGCGGTAG GGATAACCAG CTGGAAAAAG ACAACCTGTT TGGCAGTAGT CCAACGCACT TCAACTCTAG CTCGCGTCTG GCCTACGAAG CAGGGCCACA TAGCAATTAT ATGGCCCCCG AGCCTTTAGG CTATGGTCGA ACCCGGACCA AGAAGCCGAC ACGCACTTGC ATACCCACGA ACCCAACCAA ACGTCGACTT TTGTTTTTCG GCGTTCCTAT TCTCTTGGTC GTTGTGGCAG CTGCTATCAT TGGAGGTGTG GTAGGATCTC AGAAGCGCCA TAGTGCGAGC AATGACAGCA GTAATGGGGA TTCTTCTTCC GGTACTTCTG GTGGCGGCGG GTCAAATTCT ACCAGTGATA CAGATGCGAA CACTTGGAAT TCGTTTATTC AACCTGGCTC GGGCGGTGAT GGGTCTATTG TCACCACAGA TCTCGGTGTA AACTTTACCT ACTCGAATGC TTTTGGTGGA ACCTGGGCTC AAGATCCTTA TGACCCATAT TCTGTGGGTT TGCTTCACCA TATCGGGCCC GTTGAAGCTG ATAACAATTT AGGTATCTGG TCAAGCTCAG AGCTGGAGCC CGAGCTTGTT GGAAGATTGG GTATGGGGAG AACATATCGT TCGAGGGTAA GTGATCATCT AGTCTTCGCT CTTGTCATGT CAGCACTGAC GACCATTCCT TCCATAGCGT CAATATTGGT GGATGGCTGG TGACTGAACG TAAGTTTCTT CCCTCCATAA CCATGTTAGA GTCTAACCAG ACATAGCTTT TATCGTCCCT AGTCTGTATG AGAAATATCA AACTTCCACA CCAAAAGCCA TCGATGAGTA TACGTTGTCG CAAGCGTATG TCTTTTATTT TCTGGTTTTC TGTGCTGTCT AATCCCAGTA TTAGAATGGG AGACAACCTC GCTACTGAGA TGGAAGAGCA TTACAAAACA TTCATCGTAA ACCATTTTAT GTTACTGATC GTACAAGAAG AATGAGCTAA TTTTTTTCTT CCTTCTAGAC TGAAGAAGAC TTTGCTTTGA TCGCTGGGGC GGGTCTCAAC TATGTTCGGT ACATTCGTTT TGAGTATCAA ATTATCAAGC AATGCTGAAC CTTGAACTTG TCTTAGTATT GCATTAGGCT ATTGGGCTGT TGAGACAATC GACGGCGAAC CATACCTCGC AAAAGTTTCT TGGAAGTCAG TCATTCCTGG AATTTTTGAA AGACAGTCGT CGCTTACCAC TAAGATCAGC TACTTCCTCA AGGCCATTGA TTGGGCTCGA AAGTATGGTC TCCGTGTTTT AGTCGACTTC CATTCGTTAC CAGGCAAGTC AGATTCATCT CTTGAACCCT CTACTTAACT AACTCCTGTA TGATAGGATC CCAAAACGGC TGGAATCACT CTGGCAAATC CGGGTCTGTC AATTGGATGT ACGGTGTCAT GGGTATCGCC AATGCGCAAC GCTCTCTTGA GACGCTCCGA TCGATCGTCG AGTACATTTC CCAAGATGGT GTCAAACAAG TCGTGCCTAT GATTGGGCTT GTTAATGAAG TTCAAGCAGA GACCGTCGGC GGAGATGTGT TGGCTGCCTT GTAAGTGTCA ATCTCTTGAT TGAAGGTTTT ATATTTGTTA ACTTTATTAT GTTTTTAGCT ATTACCAAGC CTACGAGATG ATCCGAGAAA TAACCGGCTA TGGTGCAGGC AATGGTCCCG TCATCTTACT GCACGAAGGT TTCTACGGGA TCGCGGCTTG GAATGGATTT TTGGCAGGTG CTGACAGAAT GTAAGGGTGT ACACATCTAT CATTGTACTC TGGCTAAACT TCTGCCATCC AGTGGCCTCG ATCAACACCC GTACTTGGCG TTTCCCACCA CCCAGATCGC TGACAACCAC ACTGTCCAAG CCCATACTGC CTGTGGCTGG GGTGGCGGTA CCAACGACAC TTCCACTGGC TACGGCATCG TCATCGGTGG TGAATGGTCC AATGCAATCA ACGATTGTGG CTACTGGCTC GACGGTGTTG ACTCAACTCC TCAATTCGAG GTCACGGGAA CCGGAAGCTG TGCCGCGCTC GATGAATGGT TCAACTATTC CGATGAAACC AAGCAAGGCA TCATGGGCTA TACTTTGGCC AACATGGATG CTCTCCAAAA TTACTTTTTC TGGACTTGGA AAATTGGGAA CAGTACGGTG AAGGGATATC CGACCAGTCC GATGTGGCAT TACAAATTGG GATTGGAACA AGGATGGATG CCAAAAGACC CTAGAGTTGC AGGTGGTTAC TGCCAAAGCA TCGGTGTAGG AGGGAACCAA GTAAGCTCTC TTCACATTCC TGCATTAGAA ACCCGAGTGC TAACGCTCTC TTCAAGTTTG CCGGGACATA CCCTGCTTCA GCTATCGGTT CCTTCCCCAC CGAAGTTGCT ACACCAACTA TCGATCCGAC GCAAGTTGCT TCTCATTCCG TTTGGCCTCC CACCGCCCTT GGGCCGTCTC CATCCTACTC TGCCGCGCAA ATAACTCTCT TCCCCACACT TACCCAGACC GGCACTAGAA ATGTTCTTGC AACGCCTACA CACCCCAGTA ATGTGACACT GGAAGGTGGT TGGGCAAATG CCGCCGATAC CACCGGAGCT TGGGTGAGAG TGGCTGGATG TGATTACCCA GAGTAAGCAC TTAGTTGGTT CCCTTCAAGG TGGCCATGCT GACGATTACG GCCTGTAGTG AATATGATGC GAATACGGTG GCTGTTCCTA CCGCCCAGTG TACCGGCTCT GGCTCTACAG ACAGTATGCG GAAAAGGAGT TTGCGAAGAT GATAAAGAGT GGCGGAGCGG CCTTTATAAC AACTTTGGTT TGAATCAACG GACTTCTTGT GGCATATTAG ATGAGCTTTA AATCGTTTCG CATTATCTTT ACATGATTTT TATATTGAAT ATATCTCCAT AACCTATACG TAACATCGGG
|
Protein sequence | MSTTPRMNSS FSGRDNQLEK DNLFGSSPTH FNSSSRLAYE AGPHSNYMAP EPLGYGRTRT KKPTRTCIPT NPTKRRLLFF GVPILLVVVA AAIIGGVVGS QKRHSASNDS SNGDSSSGTS GGGGSNSTSD TDANTWNSFI QPGSGGDGSI VTTDLGVNFT YSNAFGGTWA QDPYDPYSVS GQAQSWSPSL LEDWVWGEHI VRGVNIGGWL VTEPFIVPSL YEKYQTSTPK AIDEYTLSQA MGDNLATEME EHYKTFITEE DFALIAGAGL NYVRIALGYW AVETIDGEPY LAKVSWNYFL KAIDWARKYG LRVLVDFHSL PGSQNGWNHS GKSGSVNWMY GVMGIANAQR SLETLRSIVE YISQDGVKQV VPMIGLVNEV QAETVGGDVL AAFYYQAYEM IREITGYGAG NGPVILLHEG FYGIAAWNGF LAGADRIGLD QHPYLAFPTT QIADNHTVQA HTACGWGGGT NDTSTGYGIV IGGEWSNAIN DCGYWLDGVD STPQFEVTGT GSCAALDEWF NYSDETKQGI MGYTLANMDA LQNYFFWTWK IGNSTVKGYP TSPMWHYKLG LEQGWMPKDP RVAGGYCQSI GVGGNQFAGT YPASAIGSFP TEVATPTIDP TQVASHSVWP PTALGPSPSY SAAQITLFPT LTQTGTRNVL ATPTHPSNVT LEGGWANAAD TTGAWVRVAG CDYPDEYDAN TVAVPTAQCT GSGSTDSMRK RSLRR
|
| |