Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNB01000 |
Symbol | |
ID | 3256101 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006684 |
Strand | - |
Start bp | 301086 |
End bp | 304024 |
Gene Length | 2939 bp |
Protein Length | 597 aa |
Translation table | |
GC content | 44% |
IMG OID | 638254751 |
Product | hypothetical protein |
Protein accession | XP_569106 |
Protein GI | 58263392 |
COG category | [S] Function unknown |
COG ID | [COG5594] Uncharacterized integral membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 23 |
Plasmid unclonability p-value | 0.77991 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTATC TTCTTTTTCT CCGACTTTTG AAATACCTCT TCTCCGCAAT GTCCGTTCTG GCAGGTCTTT TGGCCATCAC AAACTACTAC CTCAACACGC AAACCACATA CGGCAGCACG AGTACGATCT CTTCTTCTGG GGGCGAGGAT GATATAACAA AACGAGACAG TCCAAGTTCA TCAAGCTCAG AAACTTCGAA CAACACCTCC ATAATAGATA ACCCCCAGCT ATTAACTGCT GCCAATGTCA CCAGCAATGG TCTTTTGGTC CACATCTCGT TCGAATGGAT CGTGACAATG ATGATCATAG TCTTTGGTTA GTTACATATT TCAACGTGAA AATCTTCGTG ATTGACTAAC GCGAAACAAT CTGAAGTCCT CAAGACATCT GCTCATCACT TGAAAATTGT GCAAGAATGG ACTCATTTGT AAGCTCTTGA TACCCATTCA TGATTTTGGG TCCTAACGGC AATCCAGGAA TTATAATGAA GTCGCTTTCA AAACGTTGAT GATCACCAAT CTATCGCTTC GCCAGAACAA AGCAAAATCA ATTGTGACTG TTGCAGATGC CAAACGTGAG ATAAAATCTC TTGTTCTCGG TGCAGAAAGG GGCAAAATCG ATGCCAGGGT TTGGTTCGCC ATACATAACA TGAATCCTTT GCATGAGAAG ATGGAAACAT TCAAGAAGAA GCATTTCAAC TTGGCGATTA AAGCCGTTGC CATGGAAACT TTCCATGGAA GAGGTGCGGG AGTTCTTTAC GATAGCTGTA CAGGAAGAAT GTGTGGGCGA TCCAAAGTAA ATCTTTCAAT TATCGGATGG AAAGTTGTTA CTGATGGGCC TTGAGCCAAA CAGTCTGCTT CGAGTAGAGT GTAGGTAAAG TCCATTTACA AAAGAGGGCT AATTGTGTCT GACATGCCCA GGGTTGAAGC GTTCAAGGAG AAACTCGAAA TTGAAGAGCT TCAAGATCGT ATCCGCCAAG GACAAGTTGA TGTTCGTCAC ACGGATCTTT CCGGCACAAT CACATCAGCC TTTGTCACTG TTCCCAGTGC CAAACAAGCT CGTGAAATCT TGAAAAATGT GAAAGACGAC ATGAAGCGGG CAGGTTACCA TATTCAACGA GTGTGTCGGA TTTCGCTTGC CTTCTAAAAA AGCTCGCTGA TAGACATATA TTTAGGCACC ACGTTCTCAC AACGTTGTAA GTGACTTGTA ACTTCATATC ACGTGGGACC GACCTTGATG AACGATGTCT GTGCAGCTCT GGAAGAACCT TGAAAAGGAT GTCAAGTCAC GCCATTCACA TGCAATCATA GGCAAATTTG CTCTCGTAAT TATTTGTTTC GTGAACACTA TTCCGCTCAT GATTGTAACT GTCTTAGCCA ATCTGGGTAC AGTAAGTCTT AGTTTTCTTA CGGCAAATCC CAAGTAGATT ACTTGAGAAG TATTAATACG TGAAATCCGG AAGGCCATAG ATCGCTGGCC AACTCTGGCA AAGCTCGAAG ACTCCTCTGA GATCTGGAAA GCCATCTTCA CCGTCCTTGC AGGAGTTCTT CCAGCCACTA TTTCGGCCAT GTTCTCCTAT ATCCTTCCAT ATATCATGCG ACGGCTTTCT CGTTGGTCAG GCGCTCTTAC TCGGGGTCAA TTGGATAAGG CCGTCATCAG ACAGCTCTTC ATCTTTCAAC TAGTATCCAA TTTCATTGTG TTTTCTTTGC TTGGCGTCGT GTATGAAACA TATCTAACCA TCTCGGAAGA CATTGGGAAA GAAAGCTGGT CCACTATCTA TGCAGGTCTG GGTGATGTCC CAGCCAAAGT CACTCAAGCA TATATCTCTG AAAGCCTGTA CTGGCTGTCA TGGTACCCGT CAGTCATATT TCCTGACTAC GAGGAGTATC ATGCTCATGC GTATTTTAAG GATTCGCTCA GTAGTGGCGT GCTTACAGCT CCTCCAAATA CCAAGACTCA TTTTAAAGAC GCCTCAGTTA CTGATGATCA AAACACCTCA TGACCTGGCG GAAGTGGCGC AGCCAGAAAA TTTTGAGGTA AGTGCAGAGA TATGTGTCTC CTATACAAAC TTATTCTTGC TCTTAAGTAC GCGATCGAGT ATTCACACGT GGTGAGTTCT ATTTTATAAT GCATGAGGCT CTATTGATCC AGATTGAGCT AGCTCTTTGC TATGGTAGTA GGGTAAGCGA AGACATGATA GGCTGTTAAG TTCTTGCTCA CCACTTTCTA GTCTGATGTA CGCTCCACTG GCCCCAATCA TTGTTATATG CGCGGCCATT TACTTTTGGG CACTATACAT CATTGTGAGT CGTGGTCATA TCGAGAAATT GAAAATGTCA TCTGATGCTG CGATGAACTG ACTTCTTCAC CTCGGTAGCA CAACAATCAG CTTAAATTTG TATTTGACTC CAAGGAAACA GATGGAAAGT GCTGGAAGAT CTTGATAAAT CGCGTCCTTA TCGCGACCGT CTTCATGCAG CTGTTCATGG TGTTAAGTGA GTTTTCCTAT AGTGCTGATG TACTATGACT CCAGTTAATT CCTTCACTGA TAAATAAATC TTCAGCCTGC ACTCTTAAGA CGCAGTCGGC GGCGATGGCA GTTGGTGCTG GACTTCCGGT TGGCATTATT TTCCTTTTTA AAATGTATCT TCGGCGTCAT TACCATCCGG ATGGCGAGGT TTTCTCGCAG TATATCGACA AGTATGAAGA CGATGATACC AGACATGGGG AATGGGCCCC TGAGTATGAG CATGAGTTAC TGAGAGAAGA TTGGATGCCA AAAATCAAGA CGGTAAAGAA TGCCAAGCTC ATGAGTGTCG CTATGCGTGA ATTCCCCAAG TTGAAAGAGC TATTAAGGGT TGGCAGGAAA GCGGACGGTG AAAAATATAG AGGCTTGATG GACAAAAAAC GGCGTAAAAG GGTGCGAGAG AAGGGATGA
|
Protein sequence | MVYLLFLRLL KYLFSAMSVL AGLLAITNYY LNTQTTYGST STISSSGGED DITKRDSPSS SSSETSNNTS IIDNPQLLTA ANVTSNGLLV HISFEWIVTM MIIVFAFKEK LEIEELQDRI RQGQVDVRHT DLSGTITSAF VTVPSAKQAR EILKNVKDDM KRAGYHIQRA PRSHNVLWKN LEKDVKSRHS HAIIGKFALV IICFVNTIPL MIVTVLANLG TLEDSSEIWK AIFTVLAGVL PATISAMFSY ILPYIMRRLS RWSGALTRGQ LDKAVIRQLF IFQLVSNFIV FSLLGVVYET YLTISEDIGK ESWSTIYAGL GDVPAKVTQA YISESLYWLS WYPIRSVVAC LQLLQIPRLI LKTPQLLMIK TPHDLAEVAQ PENFEVSAEI CVSYTNLFLL LSTRSSIHTC LMYAPLAPII VICAAIYFWA LYIIHNNQLK FVFDSKETDG KCWKILINRV LIATVFMQLF MVLTCTLKTQ SAAMAVGAGL PVGIIFLFKM YLRRHYHPDG EVFSQYIDKY EDDDTRHGEW APEYEHELLR EDWMPKIKTV KNAKLMSVAM REFPKLKELL RVGRKADGEK YRGLMDKKRR KRVREKG
|
| |