Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNF01220 |
Symbol | |
ID | 3258227 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006691 |
Strand | + |
Start bp | 366843 |
End bp | 369745 |
Gene Length | 2903 bp |
Protein Length | 529 aa |
Translation table | |
GC content | 46% |
IMG OID | 638257245 |
Product | hypothetical protein |
Protein accession | XP_571265 |
Protein GI | 58268218 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.904942 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TCTGTCTGGA CTGTTTTCCA ATACCAAACA GGGTTTAACT GCCTTACAAA TTGTATCAGC CGATAGGCAA GTCTGGAAGG GTATAAGTGT CTTCGGTATC CACCATTCTG GTGAATCCCT ATGTTGCTAA AGACTAGCCT GACAGCTAAT TACAGGCGTA CTCCTTAGTC ACCATCGCAC CTCGAAACTC ACGATAACCA TATACACATC ACGAAGCTAC GATTTAACAT GTCTGCCGAG ATTAAAGGAG AACATTTGAC ACATCACTAC TTGACCAATG CCCAGCTGGA CGACTACCCA GATGAGAAGC AGGTTGTCGG GAATGGAGAT TTGGTGACGG TAGAGGAAGT CAAAGTCGCT GAAGACGCGT GAGTGATGAC TGTAGATTAT TGATGCAAAC CTATCATTAA TATGGTTTCC AAGCATCCTG GCGGGGGATG AATACACGGA AGAGCAATAC AAGAAGCTCA AGCGCAAGGT AGACTGGGTC CTTTTGCCTC TGATGTGGTG GTGTTACGGT ATCCAGCAGA CGGACAAAAC TGGGTGCGTT TTTGTTTCGC CCATAATTTG CTATGGGATG ATACTGACAA GAAACTATGA TAGTCTCGGT ACAATGGTAT GTCAGCATAT GCGCCCGTGT GATTCTTCTC TGACGACTTG CAGAACCTGT ACGGCGTGCA GGCTGACACT GGCATGCACG GCAATCAATA CTCCCTGTTA ACCGTGGTGT TCTGTAAGTA AAAAATCTGA TAGACAACGA CCAATCTACT TATGGTGTCA CGCAAAGATA CGGCCTATGC TGTCTGCGAG TTCCCTTCAA ATTTTCTCCT TCAACGTTTT AGTAAGTTTA TAAGTAACCA GTAGCTGTAG TATCGAACTG ACAAAACGTC AGACATGGGC AAATGCTTGA CCATCTACAT GTGAGTGCAT GTTTGCATCC GCTGGACCTA GAGCTGATAT CTACCAGGTT CTGCTGGGGT ATCATTGTCC TTGCACAAGG TTTCGTCAAG TCCTTTGCGC CTTTCCTCGT TTTGCGACTG CTCCAAGGTG CTTTCGAATG CACAATCAGC CCCGGTTTCA ACCTCATCAT CGCCAACTGG TACACATCCC AAGAACACAA TTCTCGCTCC CTCATCTTCC AGAGTGCCAA CGCCGGCTGG GGTATCGTTG TCAGTTTGAC AATGTACGGT ATTGCCCAAG CTGCCAACAA GAACCCTGGT GGTTTCGCGG CATGGCGAGG GATTGCCGTC TTCCTAGGTG GTCAAACTTT GCTTGCTGCT GGTGTAGCCT TCTTCTTGCT CGGTACCCCC AACGAGGTCA GGTGGCTTAA GGCAGAGGAG AAGAAGATCG CCTATGCCAG AGTCATGAAG AACAACGCTG GCACTGACAC GACTGGTAGA AAGACCTGGA AGTGGGGCCA AGTACGCGAA GCGTTTTTAG ACCCTGCATT GTACTTTCAG TTTATCAACG CCTTTTTGGT CTCTGTGGTG GGTACACTCA TAGCCGCGTG ATCGTGTTAC TTGAATATTC ACGCCTTTTA GTGTAATGGT GCTCTTACCA CCTTTGGTGC TGTCATCACT CTATCTTTTG GCTTGTAAGA CCCATTGATC TTTTCCGCCT CGGAGCTGAC GAAATACAAT AATAACAGTT CTGAGAGTCA AGTCATCTTG TACGGTATAC CTCAAAATGT TGTCTCTGTC CTTTGGTTCG CCTTTGTCGG TTTCATGACA CTCAAGTTCA AGGGACTCAG GATGTACTTT ATGATGATCA GTGTCATCTT CCCCTTCATC GGTGTAAGTC ATCGTCCCCA ATTTATTATT CTCCTTTGGA TCGGTAACTG AATATAAACG ATCGGTAGCT TCTTTTCATG GCTTTGCTTC CTGAGGACAC CAGCTACCGA TGGACCAAGT GGGGCATGTA CTTTATGACT GTCACGTAAG CCTGTCCTTG GCGTCATGCT ATCCAAGTAC CTTGCTGACC ATTTTCAACA CAGCTTTATT CTCCCTCTCT TCTCTGGATG GGCTCTCATC TCTTCCAACA CTGCCGGTCG TACCAAGCGA ACTGTAATGA GCTCCACGAC CTTTATCGCC TATTGGTGAG TTTTCTTCCG TCTCTATTCT CTTAGGACAA CGTAGGTATG TAGTGACTAA TAAACACGAT ACTTTTAGCG CTGGCAATAT TGCCGGTTCT GAAGTTATGA AGTCCAAAGA CGCCCCACAC TACATTCCCG GTACCGTACG TTTCCATCTT TCCCTCTCTT CTGGCCTTAC CTGCTAACGC TCTCTTCCGC CACTTTTCTC AACTTATCTT TTAACTAAAT TCACACACAA TGCAGATCGC CATCGCATGC TGTATGGGCG TTGAATTCGC CACGCTCATT ATATGGCGTA TCTATCTCCA ATACTGTAAC AGGAAAAAAA CCAGGGCCAT AGCTGAGATG GGGTTGAGTG AGGAGGAGAT TACGAAGAAG GGACAAGCGT TGGGTGCCGA GGATGCGACG GACATGAAGA ACCCTTTCTT CCTGTGAGTA TATAAACCTG AATCTTCATC GATCTGTTTA AAAAAGGAAC AAGGCTGATG GGTTTGATAG CTATTCCACC TAGTTCTTTT GGGATCTCTC TTGAAGCAGT TGGAGGAGAA GATGGATGTG CGAGAGGGCC ATGGAAGCTT CAGTTGTGTT GAAGGTGACT AACAAAGTTG TACTCTTCTG GATCACTATA GAGTTGTAAA GTCGAAGTTG TGCTGTCTAG TTGTTATCCT TTTTCATTTG TTAATATACC GAAAATTTTG TAGATAGGGA AAATTTTTAA TTGTTCAACG GGAAATGGAA TAGATCAAAT GACTGACAAT GTAAACAGCC ATCTGATATA TAGCTGAATT GGGACCAGTG CAGTGTACAT GCG
|
Protein sequence | MSAEIKGEHL THHYLTNAQL DDYPDEKQVV GNGDLVTVEE VKVAEDAILA GDEYTEEQYK KLKRKVDWVL LPLMWWCYGI QQTDKTGLGT MNLYGVQADT GMHGNQYSLL TVVFYTAYAV CEFPSNFLLQ RFNMGKCLTI YMFCWGIIVL AQGFVKSFAP FLVLRLLQGA FECTISPGFN LIIANWYTSQ EHNSRSLIFQ SANAGWGIVV SLTMYGIAQA ANKNPGGFAA WRGIAVFLGG QTLLAAGVAF FLLGTPNEVR WLKAEEKKIA YARVMKNNAG TDTTGRKTWK WGQVREAFLD PALYFQFINA FLVSVCNGAL TTFGAVITLS FGFSESQVIL YGIPQNVVSV LWFAFVGFMT LKFKGLRMYF MMISVIFPFI GLLFMALLPE DTSYRWTKWG MYFMTVTFIL PLFSGWALIS SNTAGRTKRT VMSSTTFIAY CAGNIAGSEV MKSKDAPHYI PGTIAIACCM GVEFATLIIW RIYLQYCNRK KTRAIAEMGL SEEEITKKGQ ALGAEDATDM KNPFFLYST
|
| |