Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI03850 |
Symbol | |
ID | 3259788 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | - |
Start bp | 1035296 |
End bp | 1038816 |
Gene Length | 3521 bp |
Protein Length | 869 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258880 |
Product | expressed protein |
Protein accession | XP_572957 |
Protein GI | 58271602 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GATCCATATA GGCTTCCCAG AATACTTCTC AACGTCTCAA CACCTCATCT TCGTAACCAT AATCCGCGCA CACCGCCAAA TATGACGGCC ACTCCTCCGG GCAGAATAGC CCTTCCGCCC AAACCAACAA TTACTGACAT CCTCGGCGGG TCTTCGGCTC TCGCGTCCGA CACACCATCT CCTCAAGCAC ATCAAAGCTT CAACTCTCCA CACTCTTCCG CCGTCAATGG GCATTCAAGA GGGACTATTC CGCGCTCCAA AGCTCGTACA AAGTCGGCTC GACTTAGTTT TTCTAGTTAT GGTAGAGGCA GAGCAGCCAG TTCTCGAGGT CCAGCGGGCG GAATCGTCGA CGATCTGCTA GATGATGAGG ATTTCGGACC AAAGTTGCCC ACTTTGGTGC CCGGGATCCG AGCGGCTTAT GCCACTCCTC TCCCTGCACT TCCTATGATT GTAATGTGTA TAGTGAGTGG CAACGGATTC ATAGATCTGC AATGACATGT AGCTAATCTG CTGCTGCGCA GGCAATGCTG TCAGAGCTAT TAGCGGCGAA CTTGTGCACT CCATTCATCC TCAAGCAGGT CGAAGGTGTG TCTACAGGTG TTTATATGTC TAGTAATGGC TTGCTGACTT AATCCCGCAT GGAAGGTTTC TTCCTTAACT CCGGTCGAGA AAAGAATAGT GAGACGGAAG CCGCAGTTGG TCTTTGGACA GGCAATCTTG TCTCTGTCTT CTTCATCACA CAATTTTTCA CCTCTCTCCT TTGGAGTAGC ATAGCAGATC GTCATGGTCG TCGTGCTGTG TTAGTAGCCA GTCTTGCCGG AAGCGCTATT GGTGAGTCAG TAATTGGGAT TCCGCTTGTG TTCTGTGACT TACGCCATGT AGCTCTTGTA ATTTTTGGTA CTTCTGAATC TGTGAGACTT GGGTCAATGT GTAAATCCAG CATGTAAGCT GACGCTATAA GTTAGCTTCC CGAGGCTATC TGTGTCAGAC TGATTCAAGG CATTTTCGGA GGTGCTGTCG GAGTAGTGAG TTTCCAGCTA ACTATTAGGA ATAGCTCTAA CAGCAGGTTG TCCAGTTCCG AGGTTCGATC CGAGACTTGA CTGACGACAC TAACGCCGCT CGTGCTTATG CTATGCTCGG TTTCTCATGG GGTTTCGGTG GTGTCATTGG TCCTATCATT GGCGGTGTCT TTGAAAGCCC AAAGGAGAAC TTCCCCGGTA CTGCTTTGGC TCAAATTCGT AAGATGACTG GTATCCTTAG TATGTCAAGA TGATGATCGC TAATTTTCAA TGTCTGCCAG CCCTATTCCA AAATTTCCCA TATATCCTTC CGACTATCAT TGGCGGTGGC GTTCTTGCTG TCGGCGCCAT CCTCTCTTGT CTTCTTTCTT GGGATGGAGG TGTTCGAGGC GGCAGGCGTA TCACCCTCGC AGTCGAAAAG AACGAACCCC TTGCCGGTAT TTCTCCTGCT GCTGGTCGTC ACGCTTCCCC TGCGCCTTCC AGCCGAACCG CAATCAGGGT CCCATCAGTC AGTTTGAGGC GAGTCTTGTC ACCCAGCCAA GAAGAAGAGG ATGCCGCTCA CGGTGCTGGA TATCCGGAAC TCTCTCGTAG TGCGGGCGGC CGAAGGGATA GTCGAGCGAG CTTAGGCACG GCTTATGGAT ACGGAGGTAT TCGTTCCAAG CACCCTACTT TGGCAGCTAG AGCAGCATTG GAGGCTGCTA GGAGGGCTTC AGCTGCAGTA GGCAGACCTG ACGAGAGTGA TGAAGAGGAC GGCGAGATGG GTAACAGGGC TCTGAAGGTT GCACAGAGGC TGTTGCTCGC GAATGAGGAG AACACATTTA ACATCAATGA CCTTTGGGTG TCTGCCGCGG TCGCCCAAGA CACTGCTGTC TTTGATGATG AGGAGGAGAC AGAGGATGAC CAAGAAGAGG AGGTTAACGA AGCCGTACCC GACACTTCTT TTGCCTCCCC TTCATTACAT GCACTTTCAC CGTCAACCTA CGATGGCCAT CAAGGTGACC TCTCCTTCCG ACGGTCTAGC AGAGCACGCC TCACAAGTGT GGGCAATATT TCTCTTCATC GTAACCTTCC CGGCCACCGA TTATCCGTAT CACATGGCGG CAGGCGATTC AGCACCACTT CAGGACACAT GCCTGCCATC TTCTCCAACA CTGGTGTCAA GACTCCGCCA GCTGTAGCAG CCGCATATGA AGCAGAATCG CCCAGACACG AAGCTGACTC ATTCTTCCGC GCAGCATCAC CTAGTCCCGA CCATGGCAGG GGATCCACCG GTGGCTTAAG CGCTATCGCT GAAGGGCCTG GAAATGCTGT GGACTCTGCG ACTGCCGGCG TTGCCGCTCA GATCTCTGAG AAAGAGGCCT CTTCGTTCTC ACTTTTGCCT GTTCAAGTAA TCATCCAGGT GTGTTCAAAT AATGACTTTG TTCGTCACAT AGAACTTATC AGTCCATTCC GATAGTATGG TCTCCTTGCT TTGCACAACA CTATCCATGA TCAAATATTC TTGTCATTCT TGGTAACGTA CGAGGCTTAC CTTTTGAAGT TTTGAGTAAT TGCTGATATG ATCATAGTCC CTACCGCTCT GGTGGTCTTG GGCTTAACCC GGCCCATTTC TCCCTGGTAG TCGCTCTCAT GTGTCTCTGC CAGCTCGTGT ATCAATTCTA TATTTACCCT CGACTTGGAC CCCCTCTCGG TCGTTTCACA CATCTTCAAA TGTTCAGGAT TGGATGTGCC CTCTACCTTC CTGCCTACTT CTCTCTGCCA ATCTTACACA AAATTGCTTC TCCTGACTCT GAAGGCAGTT TCTTTTTGAT GTTCTGTGGG TACTCCAAGT CACGACGAAG ACATTGGATC TGATTGATTC ATTGTAGGCT TGGTTACTAT CACTGCTGTG AGGTACTGTG CAGGTACATT CTCGTATACC TCCGTCATGG TACTTATTGT GAGTCGAGAC CCCCTACATC AAGTACGCCT GTTAACCCAT CCCTACAGAA TGCCATGTCT CCTCCCCACG TTGTTGGCCT TGCCAATGGA CTCGCCCAAA GCACCGTATC ATTCTCGCGT TTCTTTGGCC CTGTAATCGG TGGTGCTGTG AGTAATACCT TCCAGAGCAT TTCGTGGCAA CATACTGATG CCTTGTTTGT AGGTTTGGAG CGCCAGCATC AATGGTAATC CAAGCGGTTA TCCTTATGGC TTTTACTTCT GCACTGTAGC ATGCTTTATT CAGTGGTCAT TGTCATTTTT TATCCGTTAA TCAGCCATAC CCGGTGCTTG AGGAAAAGGT AGTTTATATT ATGGGCGCCA AGCGCGTAGA TCGTATAGTC GGGGGACCTT AATTGGTGAA CGACAAGAAT TTGTGAATCA TGGCAGACAT CATTAATCAC TTTTATGACC TTTTTTTTTG ACAGTAACGA ATATATGTGG ACTCTGCTCG CTAGAATGCG ATGGCGAGCA TATGTAGCAC ACTGATTTTT CTGATGATTT GAATCCGACG ACCAAAATAT A
|
Protein sequence | MTATPPGRIA LPPKPTITDI LGGSSALASD TPSPQAHQSF NSPHSSAVNG HSRGTIPRSK ARTKSARLSF SSYGRGRAAS SRGPAGGIVD DLLDDEDFGP KLPTLVPGIR AAYATPLPAL PMIVMCIAML SELLAANLCT PFILKQVEGF FLNSGREKNS ETEAAVGLWT GNLVSVFFIT QFFTSLLWSS IADRHGRRAV LVASLAGSAI ALVIFGTSES LPEAICVRLI QGIFGGAVGV FRGSIRDLTD DTNAARAYAM LGFSWGFGGV IGPIIGGVFE SPKENFPGTA LAQIRKMTGI LTLFQNFPYI LPTIIGGGVL AVGAILSCLL SWDGGVRGGR RITLAVEKNE PLAGISPAAG RHASPAPSSR TAIRVPSVSL RRVLSPSQEE EDAAHGAGYP ELSRSAGGRR DSRASLGTAY GYGGIRSKHP TLAARAALEA ARRASAAVGR PDESDEEDGE MGNRALKVAQ RLLLANEENT FNINDLWVSA AVAQDTAVFD DEEETEDDQE EEVNEAVPDT SFASPSLHAL SPSTYDGHQG DLSFRRSSRA RLTSVGNISL HRNLPGHRLS VSHGGRRFST TSGHMPAIFS NTGVKTPPAV AAAYEAESPR HEADSFFRAA SPSPDHGRGS TGGLSAIAEG PGNAVDSATA GVAAQISEKE ASSFSLLPVQ VIIQYGLLAL HNTIHDQIFL SFLVTPYRSG GLGLNPAHFS LVVALMCLCQ LVYQFYIYPR LGPPLGRFTH LQMFRIGCAL YLPAYFSLPI LHKIASPDSE GSFFLMFCLV TITAVRYCAG TFSYTSVMVL INAMSPPHVV GLANGLAQST VSFSRFFGPV IGGAVWSASI NGNPSGYPYG FYFCTVACFI QWSLSFFIR
|
| |