Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF02120 |
Symbol | |
ID | 3258197 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | - |
Start bp | 619804 |
End bp | 622280 |
Gene Length | 2477 bp |
Protein Length | 470 aa |
Translation table | |
GC content | 44% |
IMG OID | 638257338 |
Product | expressed protein |
Protein accession | XP_571560 |
Protein GI | 58268808 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2730] Endoglucanase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCAAGTGCGT GCGAATACTA CGATGGGTCT AACACAGTCG ACAATGAGCT CCAACCAGCC AGGCAGAGGC TTCTTGAAAG TTAGCGGAAA GGATATAACC TTAGATGGAA AACCGATTAC ATTGAGAGGT ACGAGCAAAA GTCAGGCTTG GACTTCATAC ACTGACTAGT CATTGTAATG TAGGGACTGC AATTGGCGGC TGGTGTGGGT TGCAATGCTA TTTTTTCTGT GCTGACCCGT GTGTTTACTC ATAATATTTG TATTGATTGG GTATCAGTGA ATATGGAGAA CTTCATCACC GGCTATGCTG GACATGAACA TCAAGTTCGG CATGCTCTAA AGCAGGTATT AGGGACAGAG AAATACAATT ACTTCTTTGA AAAGGTCAGT TGAAAGGAAT GAACAAACAT CTGACAACTT CTACTGACTG CTTGACAGTT CCTTGAGTAT TTCTTCGCCG AAGATGATGC AAAATTCTTT GCATCACTAG GATTGAACTG TATTCGCATT CCTGTGAGTT TTCAGTATCG GCCATAAGCA GGTTACTTAC AATTGCTTGT TTATTGCAGG TAAATTATCA TCACTTTGAG GATGACATGA ACCCACGAGT GTTCAAGAAA GACGGCTTGA AACATCTCGA TCGCGTGATT CAAATTGTAT GTCGATCCGT GCAGGTTACT AAACCACTTT TGTTCACATT CATGCAGTGT GCCAAGTACG GTATCTACAC TGTCATCGAT CTGCATGCAG CTCCCGGAGG TATGCACCCT CATGCGAAAC AGATTGACAG GCTTGCTGAC TGTTTAAAGG ACAAAATTTC GACTGGCATT CAGACAATCC AACTCACAAG GCGTTGTGTA AGCCCAAAAT CGCGACACGC GGTAATATTC GTTTGACGTT TGTTTGGTAG TCTATGAGCA CAAGGATTTC CAAGATCGAA CAGTCTTCAT TTGGGAAAAC CTAGCGCGTG TGAGTACGTC CAGCTGCGTC GTGTAACCTC TCGTTCTAAT TTTCGCCCAC GACCAGCATT CTAAGGACAA TACTTGGGTT GCAGGTTATA ATCCCTTGAA TGAACCTTCT GATGAGCAAC ACGTTCGCCT TGTGGCATTC TACAACAGGG TAGAAAAAGC AATCAGATCT ATTGATAGCA ATCATATGCT CTTTTTAGAG TAAGTGCAAA GGATATGCGT GCTGCTTGTC TGCCGGTTCC CTCTTAACCT TTTCAATCAT AGCGGAAAGT GAGGCTATTA CGATACATCG CCACTTCAGA CGCATTGCTA ACCGATATGC TGGGAAAAGC ACTTTTGCAG CGGACTTTAG CCGGTTTGGG AAGCCTCTCC ACAATTGCGT TTATGCTTGT CATGACTATT CCATGTGAGC TCATATAATT AAATAAAGAT GCCTGTACTC ATTCTTTAGC TATGGGTTCC CAAATCCACC CTCTCTATAT GAGGTCAGTC GACGGAAATA TTATCAGGAT CTCTATCATT GACAGTGCTT TAGGGCTCAA AGGAACAAAT CCAATTCCAC ATTGATTCAT TCAATGGTAA AACCGAGTAT ATGCGCAAGC ATGGGGTGAG TAGGTCGACT TGTTGGACAT GGTTGTATTG ACTAATTGTT GTTCGGCGTC CAGAGTCCAG TATGGGGTAA GCAATAGTAA GGAATTTAAA AGGGTATTGA CTAACCTCTG TGTCAGTTGG GGAATTCGGC CCTGTTTATC AAACATCTAA GGACGGATAT CCTGATTGGA AACACATCAA TGACACCCGA TTTGATGTCC TTCAGCTTCA GCTTGATATC TACGCCAAAG CTCGGGCTAG TTGGTCCATC TGGCTCTATA AAGATATTGG TTTCCAGGGT ATGATTTACG CGGGTGAAGA TACTGCATAT GTAAAACTTC TCAAGGAATT CTTACACAAG AAAAAGGCAC GTTCGACTAA TCCCCATCCA CTGCCCCGGC TGATACACCC CTTAGGTTGT TGCCGCTGAT AAGTGGGGAG CGGATGATCG TGCAGTGCGA CCGTTGTTTA CACCCGTTGA GTCATGGCTT CTCAAGACCG TACCATCAAT CTCGGACCGA TACCCACAAG ATTGGAGTGT AGGCGAGCAC CTTTCTAGGC TAGTCAGAAA TATGCTCCTC AGTGAAGAGC TAGTCAAAGA GTACGCAGAG CATTTTAGAG GGAAGAGTCT TGAAGAGTTG GATGAGCTAG CAAAGAGTTT TAAATTCTGT AAGCCTTAAT TTGTGGGTAT TTTATGGATT GAAAGCTGAT GGATGCTTGG CCGCGAAGCT AATTGTACTC AGAGGAAGAG GTTGAATGAT GTGCTCAAGT CAGATTCAGA GCGTGGCACT GATGAGAAGA AGTCGTTGTG GCAAGCTGGT GAGAAGGTAT GACAGAAGAT CAAGACTTTT GATTGCGAGC ATGTTATACA GTGAGAAAGA ATTGTGTCGA AACCAATAAA TCAATGCAGA AGTGTTA
|
Protein sequence | MGLTQSTMSS NQPGRGFLKV SGKDITLDGK PITLRGTAIG GWLNMENFIT GYAGHEHQVR HALKQVLGTE KYNYFFEKFL EYFFAEDDAK FFASLGLNCI RIPVNYHHFE DDMNPRVFKK DGLKHLDRVI QICAKYGIYT VIDLHAAPGG QNFDWHSDNP THKALFYEHK DFQDRTVFIW ENLARHSKDN TWVAGYNPLN EPSDEQHVRL VAFYNRVEKA IRSIDSNHML FLDGNTFAAD FSRFGKPLHN CVYACHDYSI YGFPNPPSLY ESPVWVGEFG PVYQTSKDGY PDWKHINDTR FDVLQLQLDI YAKARASWSI WLYKDIGFQG MIYAGEDTAY VKLLKEFLHK KKVVAADKWG ADDRAVRPLF TPVESWLLKT VPSISDRYPQ DWSVGEHLSR LVRNMLLSEE LVKEYAEHFR GKSLEELDEL AKSFKFSNCT QRKRLNDVLK SDSERGTDEK KSLWQAGEKV
|
| |