Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG04290 |
Symbol | |
ID | 3258740 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 1210995 |
End bp | 1213746 |
Gene Length | 2752 bp |
Protein Length | 730 aa |
Translation table | |
GC content | 47% |
IMG OID | 638258053 |
Product | conserved hypothetical protein |
Protein accession | XP_572101 |
Protein GI | 58269890 |
COG category | [E] Amino acid transport and metabolism [G] Carbohydrate transport and metabolism [R] General function prediction only |
COG ID | [COG0697] Permeases of the drug/metabolite transporter (DMT) superfamily |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.0256394 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | CTGAACTTCT CAAGTCGCGT TCCGATTTGA GAGTACGACT CAGCCTTCTC AGTAGCTCGT ACGTGAACAC GGACCATGTC TTCATCGATT TCAGCTTCCT CGACAACGTC AGCAACAGCG TCCGCGAGCA CGTCGGCAGC AGGCATCGTT GATTCTAGCT CAAACTCAGT TTTCAAGCTC GTCGGCATCT GTCTTGCCGT CGGTAGCGGT CTTCTCATCG GAACAAGTTT TGTGATCAAG AAGAAGGGGC TAATCAATTC CACGGAAAAG TATGGGAACC AAGCAGGAGA GGGCCATGGG TATCTGAAGA GTTGGATATG GTGGGCGGGA ATGTTGACAA TGATTGTTGG AGAGATCTGC AACTTTGTGG CTTAGTGAGT GTAGCTTTGG CCGAAAGAAG AACCATGCTG AGAGACGAGT AGTGCATTCA CAGAAGCCAT TTTGGTGACT CCCATGGGTG CCCTCTCAGT AGTCGTGGCC GCTATATTAT CACATTTCAT GCTGAAAGAA AAGCTCACGT TTTTTGTGAG TCCAAAATGG GACCAAGTTG CGCAAACTCA CCCTCTTACA GGGTTGGATA GGATGTACAC TTTGTATTAT GGGAGCCGTC ATTATCGCTC TCAATGCCCC GGAAGAGCAA TCTGTCACCA CAATCAATGA GTTCAAGAAG ATGTTTTTAT CTGTCGGATT CCTCGTTTGG GCTTCACTTT CTATCGCAGC AAGTCTGGTG GTGGTATTCT TTGTCGCACC AAAATACGGG AAAAAGAACA TGATGCCGTA TATTAGTATT TGTTCTTTGA TTGGAGGTAT CAGTGTCAGC TGTACTCAAG GGTTGGGTGC AAGTATTTTG ACGAGTATCC AAGGCGATAA CCAAGTGAAG AACTGGTTTT TTTGGTTCTT GTTTGTTTTT GTTATCGTCA CACTATTAAC CGAGTGAGTT GACATCATCA TCTGTCATTT ACTGTATGTT AACATAAATT AGGATCAACT ATTTGAACAA AGCTTTGGAG CTCTTCAAGT GAGTGTAACC CTTTATGGCT TAACACGGCT AACAGCTTGT AGCACTTCCA TGGTCGTCCC AGTATACTTT TGTTTCTTCA CTTCCGCAAC CCTCATCACC TCGTTTATCC TCTACAAAGG TCTCAAAGCC TCAGCAGTTA CGCTGATTAC TATGGTCCTT GGCTTCCTCG TCACGTGTCT TGGTATCACT CTCCTCCAGC TCTCCAAAGT TAATCCCAAA GAGCTTGCCA ACAAACTGGA TCGCAAATCC ACTATACTTA TGGAGGCATC AAGACATCAG ACAGAGGATG CAGAGAAAGG TCAGGTGTCA AGCTACGAAG ATCCGGGCAT GGATGCGCTG AGAGGAGGTT TCGGTGCAGT GGGCAGTATC ATCAGAGCAA GGAGTGTGAA TAGACGCATG AGTAATGCTA GCACCCTGAA CGGAGGGAAG TATGGCGCTG CCAACTTGTC TACCCACGGT TTGGAACATT TACCGAGATT CCAATGTAAG TACTATCTCA CTAGGGACCA AGAGTTATGC TCATACGATG TAGTGTCTGA CAATCCCATG CCTTCCGATG CTATGAGCCA AATTTCATTA CACTCGGCCA AGTCTCCAAC CATTTTGAGT GGAAAAAGTT TTAGAAGTCC ACATGACCAG TACCCCTCTC CTCAGCGGTC AAAGAGCATC AAAGTATGTT AATTCTCGAG GGCTCAGTAG CACAGCTGAC AATCTCTTAG TTTGAGGAAG GCGACATTGT CCACCAATAT CATTACAACC AAGGACCCGA CCAAGACGCC ATGCACACCT ATCTTCCTTT CTCGGCTGAC CATCACAACG GTTACCGCCA CAGCTCTGGT CCTATTTACC CACCTGTCAT GGAAGAAGAC GAAGATGCAT ATGAGAAGGA AGTCGAGGCA AACAGGGAAA ACGGATCGAG CCGTACAAAG GTGCACGTTG ATGGTGCCTC AGAAGAGGTG TTGGCTGAAC CTCTGAGCTA CAACGACCCC TACGCTATCT ATCCCGCGCG AGCCAATGTA CCTCAGATTG CACGGCCGTC GGGCACGAAT AAAGGTCTCA GCGGGCTATT TAGCTTTCAG TCTGGCCCGG ATCTCTCCCT TCATATCCCA CATATCAACC GAAACCGGGA TAGCAGCAAC GAGAAATCTC CCAGACGGGA GCACCACGAT TATCCGACTT TGTCGAAGAA GGACAGAAAC AATGCACAAG AGCGGGAAGA GAGGCAAGCG TTGGTAGGAG GAGATGATGA TGAAAGTGGA GGAAGGCATA GGAGTGATAG CAATGTGAGC TCATCACTGA GTGTTAGCGA TGGCGAGACG AGGGAGAGTG AACAGCCGCA GAGTGCTGTG TCAACGCAAG CAACAATAGG AATGTGTACC AGAGTGCAGA TAGGGCAGGG TTCACCTGCA AGACAGGGAG CGTTTACGTC GAATGGTACT AGTGGTGCAG TGCCAGAACG AAGAAATTCA CCAAGAAAAT TACCGGATTT GCCAGGTTTT GCGGCTGGAT CGCCCTACGG ATCTTCGTGA ATTATGGCAC ACGAGAAGAA GGCTCATAAC GTCATGTCTT TTTGTCTTTT TGGGAGGGTT TGCAATGTAG GACAGAAGAG ACTGGTTTGG TTACAAGGGT TTGCCACATT AGTGTACTAT TTATGGCCAT TTTAAACGAG CTTAGTGTAT CAGAGCTAAA TTAAATTAGG CATCTTGGGT TATGGGGATG AACAAGTCAA AA
|
Protein sequence | MSSSISASST TSATASASTS AAGIVDSSSN SVFKLVGICL AVGSGLLIGT SFVIKKKGLI NSTEKYGNQA GEGHGYLKSW IWWAGMLTMI VGEICNFVAY AFTEAILVTP MGALSVVVAA ILSHFMLKEK LTFFGWIGCT LCIMGAVIIA LNAPEEQSVT TINEFKKMFL SVGFLVWASL SIAASLVVVF FVAPKYGKKN MMPYISICSL IGGISVSCTQ GLGASILTSI QGDNQVKNWF FWFLFVFVIV TLLTEINYLN KALELFNTSM VVPVYFCFFT SATLITSFIL YKGLKASAVT LITMVLGFLV TCLGITLLQL SKVNPKELAN KLDRKSTILM EASRHQTEDA EKGQVSSYED PGMDALRGGF GAVGSIIRAR SVNRRMSNAS TLNGGKYGAA NLSTHGLEHL PRFQLSDNPM PSDAMSQISL HSAKSPTILS GKSFRSPHDQ YPSPQRSKSI KFEEGDIVHQ YHYNQGPDQD AMHTYLPFSA DHHNGYRHSS GPIYPPVMEE DEDAYEKEVE ANRENGSSRT KVHVDGASEE VLAEPLSYND PYAIYPARAN VPQIARPSGT NKGLSGLFSF QSGPDLSLHI PHINRNRDSS NEKSPRREHH DYPTLSKKDR NNAQEREERQ ALVGGDDDES GGRHRSDSNV SSSLSVSDGE TRESEQPQSA VSTQATIGMC TRVQIGQGSP ARQGAFTSNG TSGAVPERRN SPRKLPDLPG FAAGSPYGSS
|
| |