Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNL04830 |
Symbol | |
ID | 3254807 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006681 |
Strand | + |
Start bp | 352509 |
End bp | 355337 |
Gene Length | 2829 bp |
Protein Length | 801 aa |
Translation table | |
GC content | 51% |
IMG OID | 638253954 |
Product | conserved hypothetical protein |
Protein accession | XP_568024 |
Protein GI | 58261228 |
COG category | |
COG ID | |
TIGRFAM ID | [TIGR00803] UDP-galactose transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.269217 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGTGTTC ACATCGTTCA CATCGTCCAA ACACCCACAT ATCAATACAT ATTTGAGCAA GTGAGCCATT CCTGTAGCTG TATTTATTCT CCGCTCTGTG GACCGAGCCT CGCCCAAGTC ACCCTTTCCA CCCACGCCGT ACAGCCTGCA CGCGGCCAGC CACATCCAAT TTCATCTCCA CTTTCATCTC TCGATTCTTC TTCACTCCTA CCAGTCTCCA TAAACAACCT CTCCGCACGC AGAACCGACT TGCCGGTATT TTAATTTTTC ACAGTCAACA TTCACTGGTT TGGGAAGACA GCCGCGCACA AATATATACC GCTTTTCCAC ATAACCGACC TTGACGGTAG TCAATAACGG CAACTTCCTG TAGTCTTGGA TTCCCTGGTA TGGCCCATCG AACCAACACT CGATCTCCAT CGGGGTTGAA TCGCCGCCCG ACGGACCGAT GTACGTCCCA GTCTGGGTCC CTTCAAACCG CATCCTCGCC CGTGCAATCG ACATTTGGAG GGAGGATATA TGCTGCTCCG GAGGAGGACC GAGGGTTAAT AATGAGAGAT CGAAGCGAGA GAGATCGGGG AGAGAACGAC TGGGCTGACG AGAAGGGGAA CAGCATGGGG AAGACGCGCG GTATGGATGT GGGAGTAACC CGGCCTAAAA GGAGGCTAAG CAGGTAAGCC ATCCTCCCTC CGCCCTTCCC ATATGTCCAA GTGCGATCTG TGCGCTGGGC TAGGAATGTT TTGAGCGGGT CCTTTTGGCC CTGGCTCTGA GGCATTTCCC CTTCCTCCAT CTCCATTGCA CAATGTCTTA CTGACCTTAG TCTTCCCAAT CTCCTGGTCC CGCCTCCGCT CCACAGAGCC ATGTCCTCTA CCTCTGTCAG CTCTCCGCCC CAGTCTCCCA TAGGTCAACC AGGGCATGGC TCATACCATT CTCTCCAGCA ACCATTCCAT AACCAGAAGG CCCATGGATC ATCTACTGCG AACAATTTTC CTTCTTACAG CCGGAGTAAT CCGCCCAAAT TGGACGATAA GGTGGGCATG GTGGGCTATG CTACTGCGAT GGCCGCCGCT AGCAGAGAAA AGGAAGGGCC GCCTTCTCTG TGGGGGATAG AACTCAAGTG GATCTCGTGA GTGGCTGCAT GATAGCTGGC CAGACCACCA TTAACATCAG CCTTTAGACT GATCACGCTC GCTCTTCAAA ATGCATTCCT CACCATCATC ATGCACTATT CACGGATATC TACCGCTCCC AATCGCACAT ATTCCGCTGC CGCCGCAGTT TTGCTCAACG AGCTTCTCAA AGGTGGCATC TCAGTTTTTA TTGCTCTCAA ACGTATCGAC AATGAGATGA CTGCATCTCC TCCTCCCCCG GTCTATTCAG AAAAGCTTGA CGATAAGGAT TTCGACAAGC GATCTGGACA AAAGCTCCCT TCGATCATTC ACCCCACGAG ACTGCAAGCT CTATCGAAGG CGGTATTTTC ACCCGACTGC TATAAGCTTT CTGTTCCCGC CATCCTCTAT GTCATCCAAA ACAATCTTCA ATACGTCGCC GCATCAAATC TCGACGTCGC AACTTTCCAG GTCACATACC AGATGAAGAT CCTTACTACT GCGTTCTTTT CAGTTCTCTT GTTGCGCAAA CGACTCTCTC GAACCAAGTG GGCTTCCTTG ATTCTCCTGG CTATCGGTGT TGGTATTGTT CAGATCCAAT CCTCTTCAGC ACCTGCTGCA TCTCACCACA CCCACGTCAC TGTCAGCCAT GAACGTCAGT TGCGATCGGA GATTCCGGTT TCTGATGAGC CCATCATGTC CCCGGAAAGA GTGATGCATC CTGTCAGGGG ATTCGTCGCT GTTACACTTG CATGCATGAC CTCAGGTCTT GCGGGTGTGT ACTTTGAATT TATCCTCAAA TCGTCTTCTG GGTCCAGCGC ACCTGATTTG TGGGTGAGGA ATACCCAGCT GTCCTTGTTC TCCCTTGTCC CTGCGTTGGT ACCCATTATC GTCAACCCTT CGGGGCCGAA TGGCATGGGT TACTTTTCAA AAGTGATGTC TTGCTTCGAC AACTTCAACG GATGGGCGAT TGGTACAGTA TTGACTCAGA CTTTTGGTGG ATTGATTACT GCGTTGGTCA TCAGATATAG CGACAATATC ATGTGCGTTT TCCGTTTTGG TATCTTTGCC CTTTGCTAAT GGCCGATTAC AGGAAAGGAT TTGCTACGTC TCTTTCCATC ATTATCTCCT TCCTTGCCTC AGTCGCTCTC TTCTCCTACC CCATCACTCT TAGTTTCATC GTCGGTGCTT CTATTGTTCT TTTCGCTACC TATACATACA ACAGCCCCGC TCCACCTGCC TCTTCTACTC GCAAAGAAAT CGCAGTCCCT GGCTCGCCCA TTTCCACTTC TGCACCTATA CTGGGTGAAC CTGAAAAGCC TAGTCGTGCG TCAAGCGTAA TCAATTTGCT TGGCTTGGGA TCCAACAATG GGTCCAGAAA GCCTAGCGTT TCAGACATCA AATCATATGC CTCTAGTCAG TTGGGCTTAT CGTCATACCC CGTGTCTGCC TCTGTATCAG CACCCGGTAC ACCGAGGACA AACATGAACG ATTATGCGGA AAGCGGAAGG AGCAGCCCTG CAAGCTTTGG TGCCGTACAG ACGAGTCATG GTGGATCTGG AGCTGGATTT GGTCGGGGTA ATGTGGGGGA TAAGGTCAGG CCAATCTTGA GTTTGGACAT TGATAGAAAG CATGGTTAAG GGTTTGGTAT CTGCCTGAGA ATTTTTTTAC AAGGGTTTCA AGTGGGTTTA GGTGCAAGTG GCAATGGTAA TGTGGAGCC
|
Protein sequence | MSVHIVHIVQ TPTYQYIFEQ VSHSCSCIYS PLCGPSLAQV TLSTHAVQPA RGQPHPISSP LSSLDSSSLL PVSINNLSAR RTDLPSITAT SCSLGFPGMA HRTNTRSPSG LNRRPTDRCT SQSGSLQTAS SPVQSTFGGR IYAAPEEDRG LIMRDRSERD RGENDWADEK GNSMGKTRGM DVGVTRPKRR LSSLPNLLVP PPLHRAMSST SVSSPPQSPI GQPGHGSYHS LQQPFHNQKA HGSSTANNFP SYSRSNPPKL DDKVGMVGYA TAMAAASREK EGPPSLWGIE LKWISLITLA LQNAFLTIIM HYSRISTAPN RTYSAAAAVL LNELLKGGIS VFIALKRIDN EMTASPPPPV YSEKLDDKDF DKRSGQKLPS IIHPTRLQAL SKAVFSPDCY KLSVPAILYV IQNNLQYVAA SNLDVATFQV TYQMKILTTA FFSVLLLRKR LSRTKWASLI LLAIGVGIVQ IQSSSAPAAS HHTHVTVSHE RQLRSEIPVS DEPIMSPERV MHPVRGFVAV TLACMTSGLA GVYFEFILKS SSGSSAPDLW VRNTQLSLFS LVPALVPIIV NPSGPNGMGY FSKVMSCFDN FNGWAIGTVL TQTFGGLITA LVIRYSDNIM KGFATSLSII ISFLASVALF SYPITLSFIV GASIVLFATY TYNSPAPPAS STRKEIAVPG SPISTSAPIL GEPEKPSRAS SVINLLGLGS NNGSRKPSVS DIKSYASSQL GLSSYPVSAS VSAPGTPRTN MNDYAESGRS SPASFGAVQT SHGGSGAGFG RGNVGDKVRP ILSLDIDRKH G
|
| |