Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG00310 |
Symbol | |
ID | 3258690 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 73174 |
End bp | 76265 |
Gene Length | 3092 bp |
Protein Length | 858 aa |
Translation table | |
GC content | 50% |
IMG OID | 638257645 |
Product | alfa-L-rhamnosidase, putative |
Protein accession | XP_571766 |
Protein GI | 58269220 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR01352] TonB family C-terminal domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGGTGA CCATCGTCAG CTTACAAGCT GAACGTCACG AATCCGGCTT CGGTATCGCC CATCCTACTC CTCGCCTGAC ATGGCGATTC GGTTCGACAA CGCTCAAAGA CTGGAAACAA GCTTCTTACG AGCTTATCAT CACCCATCCT GGGAATCATC AGGCCGAGCA CTACATTGTC AAGTCTGAGC AGTCCGTCCT CGTTTCTTGG CCGTCGAAAT CTATTCAATC GAGAGAGATC GTGGAGGTCA AAGTTCGTTC AACAGGTACC GATGGGTCGA CAACAAACTG GGCTGGGATC ACACTCGAAG CTGCTCTGCT TGACCGAGGA GAGTGGAAAG CTAAATTTAT CTCCGGTCCT CCTCAAGAGG TTGACGCCCC CAAACCGCCT TTCCGTTTAC GCAAGACCTT TGTTCTCAAG TCTGCTCCTA TTCAAGCTCG ACTCTATGCG TCTGCTCTCG GAGTGTACGA ATGCGAGATC AACGGGAAGA GGGTAGGCGA CCAGATTCTG GCCCCCGGGT GGACATCGTA CAAGTACCAT CTCCGTTATC AGATATACGA TATCACCTCC CTCCTCCAGC AAGGCGAGAA TACGATCACC GCGTACGTCG GAGAAGGATG GTACGCTACC CGTCTTGGTA GACCTGGGAA ACGCAATAAT TGGGGCAGTC GCTTGGGATT CTTGGGACAG CTCGAAACAG ATGGTGAGGC TGAAGTGGTG ACGGATGAGA CGTGGGAATG CGTCGATGGG CCTATCAAGA ACTCCGAGAT TTATAACGGC GAGGTGTACG ATTTGACATA CGACGAGTCC AAGGCGAAGA TCTCCCCTGT CGAAGTCCTC TCCTTCCCGG AAGCCCAACT CATCGCTTCC GATGCTCCAC CAATTAGGCG AGTCAAGGAA GTCAAAGCCG TGGAACTTAT CACGACACCT TCCGGCAAAT CCATTCTCGA TTTCGGACAG AACCTTGTAG GATTCTTGAG GATTGAGACG GATCTGAAAG GGAAGGAGCT ATTATTGAGG CATGCAGAGG TATTGGAGGA TGGGGAGCTT GGAACAAGGC CGTTGAGAAC GGCGGAGCCG AATGATAAGA TTATTCTGGG TGGGAAGACG AAAGGATGGG AACCCAAGTT CACTTTCCAC GGCTTCAGGT GGGTGCATAA TGTGTTTGGA TTTCCGCGCT CTAACAATAT GTTAGGTACG TTGAGATAGA GGGCATCAGA CCAACCCTCG AGGACTTTAC CGCCATTGTC ATTTTCTCCG ATATGCGTCG TACAGGGACA TTCACATCGA GTCATGACAT GGTCAATAGA TTGCACGAGA ATGTTGTATG GGGAATGATG TCCAACTTCG TCTCTGGTAA CTCGGATTGT CATTCGTTGA GATTTGATCG CTGATGGATT TGCAGTCCCG ACTGATTGTC CGCAGAGGGA CGAACGATTA GGATGGACGG GGGATATTCA GGTATTCGCA CCGACTGCAA ATTACCTCTT TGACACTTCA GGTAGGTTTC ATTTTTGCCT TATCTCTTAT CCCACATTTA TGAAAATAAC TAGATCCCAC TAGGGTTCCT TGAAGGTTGG CTCCAAGACG TGGCTGCCGA ACAGATTGAA TGGAAAGGCG TGCCGCCTAC CGTCGTACCC TATGTTCCTC CCAACAAATT CAACGACCAA TACCCCAAAC CCCAATCCAT CTGGGCTGAT GTGGTAGCTA TCGCCCCTTG GGATTTGTAC AACACCTTTG GTGATGAAAG GATTATGGAG AAGCAATGGG GTAGCATGCG CATGTGGCTG GATGAGGGTG TGCCGAGAGG CAAGGATGGG CTTTGGTCAG AGATAGCCCC TCAGTATGGT GACTGGTTGG ACCCGAATGC TCCTCGCAAG TGCTACTTCT TTGGGGATAT GATCAACCAA GCTGACTTTT GTGATCAGCT CAATATCCTG CGCATGGGCG TACAGATACA CACTTTGTGG CCAATGCCTA CCTTGTCCAC GTCACGTCCC TCGTTGCGAA AATCGGTAAA CTGCTGAAGA AGGATCCCGA GGTAGTGAAG AAGTACGAAG ATGATGCCAC CCGATTGCAT AAGCTTTTCC TTGAAGAATA TACAACATCC ACGGGGCGAG TCGTTTCGGA TACCCAGACA GCTCTTGCTC TTGTTCTCAA GTTCAATTTG CTCAAAGCAG AACAGATTCC GCGAGCCCGG GAGAGGCTTG AGTTTCTGAC AAGGTGGGCT TACTTCAAGG TATCAACGGG CTTTGCGGGG ACGCCCATTT TGTTACCTGT CTTAGCCGAT AATGGGCTAG AGCATATTGC GTACAGAATG TTGCAAGAAA AAGATAATCC TTCGTGGCTG TACTCTGTGG GTATGGGTGC AACTACTATT GTAAGCATCT TTTTTGTCGC AACACAGTCA AATTGCTAAT GCGTCAGTAG TGGGAGAGAT GGGATTCGAT GCTCCCCAAC GGTCGAATCA ATCCTGGTCA AATGACTTCG TTCAACCACT ACGCCCTTGG CGCTGTCGCT AAATTCATGC ATACCTACAT TGGTGGTCTC TCCCCTTCTT CTCCAGGTTG GAAGTCTGCC CTCATCAAGC CCTTGCCCGG CGGCACGATC ACCTCTGCTC AAACATCCTT CGACTCGCCT TATGGACCTT ACGTGTGTAA GTGGAAGATT GAGGGGGATA CAATGTTGGT TGATACGGAA GTACCGCCCA ACGGAAGCGC GAGGGTTGTT TTGAACGGGA TTGATGAGGT TATTGGGAGC GGGAAAAAGA GGTTCAAGGT GCCGTATGAA AAAGACAAGA GATGGCCACC CAAGGGTATC CGAGGGCCGC AAAGTGTGTT CATGCCTGAT GAGTTTGTGC CCTAGACGGA ATTTTCAGTT TGCAATTGTG TTCTGGACAA TGAAAGGTTG TTAGTGATTA CAGTTCTTAT ACACTATACA GATAGAATTC TAATCGTCGT CTTTCTTACA CAGCCAGATA CTTGCCAGAC TTGTTTACGT ACGCGTAGCC TCTGTTTCGT CGACAATAAT GATGAAACCG TAGCCAAACT AATCATCCAC CGACGTATCA GGTTCCTGCG AGCCAATTAA TA
|
Protein sequence | MSVTIVSLQA ERHESGFGIA HPTPRLTWRF GSTTLKDWKQ ASYELIITHP GNHQAEHYIV KSEQSVLVSW PSKSIQSREI VEVKVRSTGT DGSTTNWAGI TLEAALLDRG EWKAKFISGP PQEVDAPKPP FRLRKTFVLK SAPIQARLYA SALGVYECEI NGKRVGDQIL APGWTSYKYH LRYQIYDITS LLQQGENTIT AYVGEGWYAT RLGRPGKRNN WGSRLGFLGQ LETDGEAEVV TDETWECVDG PIKNSEIYNG EVYDLTYDES KAKISPVEVL SFPEAQLIAS DAPPIRRVKE VKAVELITTP SGKSILDFGQ NLVGFLRIET DLKGKELLLR HAEVLEDGEL GTRPLRTAEP NDKIILGGKT KGWEPKFTFH GFRYVEIEGI RPTLEDFTAI VIFSDMRRTG TFTSSHDMVN RLHENVVWGM MSNFVSVPTD CPQRDERLGW TGDIQVFAPT ANYLFDTSGF LEGWLQDVAA EQIEWKGVPP TVVPYVPPNK FNDQYPKPQS IWADVVAIAP WDLYNTFGDE RIMEKQWGSM RMWLDEGVPR GKDGLWSEIA PQYAQYPAHG RTDTHFVANA YLVHVTSLVA KIGKLLKKDP EVVKKYEDDA TRLHKLFLEE YTTSTGRVVS DTQTALALVL KFNLLKAEQI PRARERLEFL TRWAYFKVST GFAGTPILLP VLADNGLEHI AYRMLQEKDN PSWLYSVGMG ATTIWERWDS MLPNGRINPG QMTSFNHYAL GAVAKFMHTY IGGLSPSSPG WKSALIKPLP GGTITSAQTS FDSPYGPYVC KWKIEGDTML VDTEVPPNGS ARVVLNGIDE VIGSGKKRFK VPYEKDKRWP PKGIRGPQSV FMPDEFVP
|
| |