Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH01690 |
Symbol | |
ID | 3259214 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 655690 |
End bp | 658662 |
Gene Length | 2973 bp |
Protein Length | 674 aa |
Translation table | |
GC content | 51% |
IMG OID | 638258319 |
Product | conserved hypothetical protein |
Protein accession | XP_572346 |
Protein GI | 58270380 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.887535 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCAGACA CACCGCCACA CAAACACCAC TACACCTCCC TCCGCTCCCT CATGCACAGC CCAGATCCCG GCAACCCCGC CAAGCGTGAC TCCCTCACCC CGCGCCAGCC ACGCCCAAGT CGCAACAGCA GCAGCTCACG CAGCCGCGAA CGCAGGCTGG ATCCTCCCAG AAAGGCAGAC AAGGACAAGG GCCGCTTATC GACATGGAAA CTCATGTGCC TCACCGTGTC CATGGGCGGG AGCCAGATCG CGTGGACAGT GTACGTGCTC AACGATCACC TGCAGCTGCC TGCTGACCTG CCACCAGGGA ACTGGGATAC GGAACGCCCT ATCTCCTCTC GCTCGGCCTC TCGGAACAGC TCACTTCTCT CGTCTGGCTC GCCGGGCCTA TATCCGGCCT CATCGCCCAG CCGCTCATCG GTGCCATTTC AGATTCCTCG CATTCTCGCT ACCGGAGGCG GTACTGGATT GTAACATCGA CCATGCTGCT CGTATTTAGC GGTCTCGGTC TGGCTTTCAC AGAACCTATT GCCAAAGCTT TGGTCGATTT GATAGGCGGC GGCCAAGGCG ACTGGGATCC AAAGACTATC AGATTGGTAT GTCATTCAGT TTTGCCGCCT TGTAAACGAT CAAAACGGGA TTGTGGCTTA CACATGCAAT TAGGTGAAGA ACACGGCTAT CGGTATAGCC GTCTTCTCAT TCTACTGCCT TGACTTCGCC CTTAATGCGT GAGTAATCCA TGCGCCTTAT GCATCAGTCT GATATCCCGG CGCAGTCTCC AAGCTTCCCT CCGCAATCTT GTCCTCGACA TCACGCCAGG TGAACAACTT GCCACCGCCA ATGCTTGGCA TGGACGCTTC AACCATGTGG GTAACATTGT GGGGTTCACC ATGGGTTCGT CCGTCTCCTT GATCCCAGAC CGGCTTCTTT GGTCTGAATA CTCATCTGGA TACTCTGCAG GTTTCTTAAA TCTCAGTCAT GTACCGATTA TCCGTTTGGT CGGAGGCGGT CAGTTTCGTA AAGGTACGTC TGGCACCCCA CACACCTGCG TTTAATAGCT GACATTATAT CCAGTATGTA TCGTGGCGTT GGTACTGTTG GTCATGACCG TGTGGATCAC GTGTTGGACG CAAGAAGAAA AGGAAACGGA TAGTATTTTT GGCGAAAGGC GCTCGTACGT TGTTATATGC TCGCGACTCT GTCATACGCT GACATCAAAC ACCCCTATAG GAAAATACGA GATGTAGTGG GTACAATCTA CGAAGCGGTA CTCCATCTTC CAAAACCCAT TCGCCGAGTT TGCATCGTAA GTCTTTCCCC CATACCTTCT TTCCAAAGAC CACTCAAACC TAAAATTGCA TCGGGCGCGG AAAGGTACAA ATCGCCGCAT TCATGGGATG GTTCCCGTAT CTTTTCTACT CTACCACTTA CGTCGCCGAA GTCATGGCCA AAGAATTACA TCATAAACCT GATATCGATC GAGCCACCCG AGCTGGTAGC CTGGCCCTTT TGATCTATTC TTTCGGTAAG TGCTGCGTCT TTCTATATTA CTATACAAAT TTGCTCAACT GCCAACACCA TTTGATCAAA AAAAAAAAAC AGTCGCCATC ATCGCTGGGA CACTTCTCCC TTACCTCGCC GCGCGAGATC GCCGACTGCT CAAACCCACT TCGGAAAAAC TGCGAGATGG CGAGATTGAG ATTGAAAACG AAGATGAAGA AGATGAAGAG CATGTGGAGA TGGAGAGGAT CAGAGAGATG GTACAGCAGT GGAAAGCCGA GGCTGCTAGA GAGGGAAGAC CTCTGAAATT GCCTACTAGT AAGTTGCAGC TTCCGTTAAA ACACACACAT ACGGCGTGTG CTGATATATC ATCCGGTCAT TTCCAGTGCC GTTCATGTTG AGAAATATTT GGACGGCGGG CTTGGTCATT TTCGGGTGCT TGATGATGTC CACGTTCTTC ATCACAAAGG TCTGGCAAGC AACAGTGATG ATCGCTTTGG TAGGCATCTG TTGGGCTATT GCTTGTTGGG TGTGAGTGGT TTTATTTTGG TTGAAAGGCA AAAAAAGGTT TCAAGAACTG ATGGAGAAAT TGGCGTCTTA TTAGACCGTT TGCGTAAGTG TTACGATTTC ATTTTCTTCC TCAGTGTATA TTTACTCATG AACAAATGAT CGATGGATAT CGCAGAATCA TCATGGAGGT CAGTTTTGGA TTTCCACTTA TTAAACCGCA CAACCCCCCA GACCCAAACC GATTGACTGA CAAACACCCC CAGTTCCTCA AAGAGCTCGA CGACAAGCCT CCTCCCCGAA TATCAGACGG TCGCCCCCGT CCCACCCACG CGCGCACGGC TTCCACCCCC CTCGGCTGGC GATCACATCC CACCAGCCCT GCAGGCCGCG CCTCCCCCGA CGAACGTACA CCACTTGCCA GAAGCTACTC GACCGCCGAT CTTGATGGCG CTAATGAAAT GGAATATACC GGCCAGGGAC CAGTAGCTGG CGGGACTATT ATGGGTATCC ATAACCTCGC CATCGTTTTC CCTCAATTTA TCGCAAGTCT AATTTTACCT CCTTTCTTGC GTTATCAGCT AATAGTGGTT GTCGTGCGAT CATTTTTTTT TTAGATTGCA GTCGTAGCCT CTATCATTTT CAAGCTAGCC GACACTCAGC CCGACATCCA GCCCACCTCG CCCGAAATCG GTGGACCGCA CGGTCAAGAT AAGAATGGAG TCGCCTGGGT CCTCCGGTTC GGCGGGTTGA TGGCGTTTGT GGGGGCTTTG GTATCGAGGA AAGTACCGCC TACCAAGACT GAAAAGGCGA TGAGGAGGAG ATTGGCGGAT ATGAGGGAGG AAAGTGCAGA GTGAGAGATG TAGGCGGCGA GCGAGAGAAA TGTCAGAGTG GAAAGGTCGG ATTGGGACGT GTTATGTGTA TATATGCGGA GGAGAGTTGA CCGAGTTTAG TCTTGTTATT TAC
|
Protein sequence | MPDTPPHKHH YTSLRSLMHS PDPGNPAKRD SLTPRQPRPS RNSSSSRSRE RRLDPPRKAD KDKGRLSTWK LMCLTVSMGG SQIAWTVELG YGTPYLLSLG LSEQLTSLVW LAGPISGLIA QPLIGAISDS SHSRYRRRYW IVTSTMLLVF SGLGLAFTEP IAKALVDLIG GGQGDWDPKT IRLVKNTAIG IAVFSFYCLD FALNALQASL RNLVLDITPG EQLATANAWH GRFNHVGNIV GFTMGFLNLS HVPIIRLVGG GQFRKVCIVA LVLLVMTVWI TCWTQEEKET DSIFGERRSK IRDVVGTIYE AVLHLPKPIR RVCIVQIAAF MGWFPYLFYS TTYVAEVMAK ELHHKPDIDR ATRAGSLALL IYSFVAIIAG TLLPYLAARD RRLLKPTSEK LRDGEIEIEN EDEEDEEHVE MERIREMVQQ WKAEAAREGR PLKLPTMPFM LRNIWTAGLV IFGCLMMSTF FITKVWQATV MIALVGICWA IACWVPFAII MEFLKELDDK PPPRISDGRP RPTHARTAST PLGWRSHPTS PAGRASPDER TPLARSYSTA DLDGANEMEY TGQGPVAGGT IMGIHNLAIV FPQFIIAVVA SIIFKLADTQ PDIQPTSPEI GGPHGQDKNG VAWVLRFGGL MAFVGALVSR KVPPTKTEKA MRRRLADMRE ESAE
|
| |