Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH02340 |
Symbol | |
ID | 3259322 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | + |
Start bp | 473075 |
End bp | 476060 |
Gene Length | 2986 bp |
Protein Length | 809 aa |
Translation table | |
GC content | 58% |
IMG OID | 638258251 |
Product | hypothetical protein |
Protein accession | XP_572409 |
Protein GI | 58270506 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.501661 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCATCTC CAGCTCCCCA AGGCGACCAG CGGGATCTCC CCCCTCCATG GTAAGTTGTC CTCCAACAGC GCAGCTCACT CCCCAAGGAT CAGACAGTTC GATCCCAGCT ACCAGACATA CTTTTACGTC AACCCGACCA CAAACCCGCC CACCACTTCC TGGACCCACC CCGGCCTTGC AGAGGGACAA GTCCATCCGG AGCAGGCACA GGCCATCCAC GAGGCTGGGC AGACAGGGGG CGACAACCAG GGAGAGGCTG CAAAGTTCCT GAACTCGGGA AGCGTGGCCG ATCCTGCGAC CGGGAGCTAC AACCAGCCAG GCGAGATCCA GGGTGCTGGC GGGCAGACTC CCGAGGCTGG CGAGCGGGGA TTGGGCAGCA TGGTGAGCGG CCTGATGGGC AAGACGAACA ACAACAACGT ATGTGTCTAC GATCCATGGA CGGGTGCGCA GAGCTCATGT ACGTCTCCGC AGCAATATGG CTACAACCAG CAGCAGTACC CGCAGCAGTA CCCGCAACAG CAGCAGCAGC AATCTGGCGG TAGCAAGTTT GGGTTCGGGA CGGGGATGGT CGCGGGCGGC GCTGCGTTGC TCGCCGGAAA GCTGATCTCC AACGTCGTCG GCGGGGTAAG TCGGGCGGTG TGATTGACGC TGGAGTCGGA GCTGACGAAT GCGACGTTGA CGTGCAGCGC CACAACTCAT CGGGCGGCGG CATGTTCGGC GGGGGAGGAG GCTACAGCCA CAACATGGGT CCCCCTCCCT TCATGGGCGG TGGGCACCAT GGCGGGCACG GCGGGCACCA CGGCGGCGGC GGGATGTTTG GCGGCGGCGG CGGCGGCCCC GGTGGCTTTG GCGGTGGTCC CGGGCGATGG TGAGCTGTAG CATAGTATGG TAGATGCATA TGCATGTAGC GAATTCGTAG ATACCTGGCG CATGTATCCC GTCGTCGCTC CCGCTGTCTC CGACGCCATC ATCCACGCCG CCGTCGACGT CTGTTCCGCG TTCCGCATCA TCTCTGTTTC CATCTACTCT TTATACTGCC ATAGACTCAT TCATTCATTT GTCCATACAT TCACTCCCAC TCAGCAGCCA CCACCCACCC TCGGCCGCCA TGACAGAATC CGCAGGCCAC TCCAGTCCAC CCATGTCCCC CGGCACCCAG CCTGCCGCCC AGCCGCCCGT CACCCTCAAA CTCAAACACT CGATGGCCGA CCGCGCCAAC TACTCTTCGT CTGAAGAGGA AGAGGAAGGA GAAGAAGAAC AACTGGCAGA AGACAGACTT TCTACATCTC CACCACCTAA AAGGAAAAAG CTCTCTTCCA CACCTACCAA TTCATCCGCA GCGTCAAAGG GAAAACAGAG CATAAAACTT ACTCTTGGAC CTCAACATGC CCATCTACAA CAGCCTTCCT CGTCCTCATC GGCTTCTGCG GGCTCTGCGG CGAATCAGGG CAAGAAAAGT TATGACTGGC TACAGCCCTC TGCAGCTGGT GCTTCGCACA GCGGACCTCC CGAACGTGAG CGTGAACGCG AGCGGAGCGC ACTCTCACCT GGTGACCTCC CCAGCGCACT CTCACCGGGA GACCTTCCCA TCCCATCTGT CTCTGTCGCC TCTGCCGCCT CGGGCTCGTC GACGAAAAGC CTAGGCATGT CGCCCGCTGA AGAAGCGATT GGTGGTCTCC TCGATGAGTC TGTTGATGAC AACACCAGCA CCAACGACAA CGGTGATCCG TCTACTCTGA AAAAGGAAAA TCAAAGTGCA CCGAAAGCGA AGCGGAGCCA TCATAAGAAG AAGGCATCTG ATGCACCCCC CGGGCCTGGA AGGAATTGGA AGAAGGGCAT GAAAAAGTGG GTCTATACAT CATCCCCCCT CTGTTTCTCC CTCCCCAATC CCCTCGGTCT GCCATCGTGC AAAGGAAAAT ATTCAAGAGC TGACACCATT ATCCTGTTGG TATAGGGCCG CACCAGGAGC ACCAGGGGTC AAGCTTGAGA ATGAGGGCAC CCCGGCAAGT ACGCCTGCGT TTTCAGCCAT CAGCCGTGAA ACCTCGCCGG ATCCTCTCGG TGAGTTTTCT CCTCTTCAAA ACGCACGCCA AAAGTTGCAT TTTACTAAAC GAAAATGTCG CAAAAGGCCT TCCCTCCCCA CGCCTCGCCC CCGACACCCA AAGTATGACT ATAACTCCGG CTCCCGTTCC ACTCCCTTCA GCTTCCTGCC CCCCGTCCCC GCCCTTCATC CCGGCCGACC CCACCACCCT CGGCTTCCCC GTTTTCTCCC ACCCCATCGT CCCTCCCAAA ATCCATCTCG GCACATTCCC AAAAGTCACT TCCTTTTTCG CACCCATCAA CGGAGGCGAT TCCGGGCCCT TTCCGAGAAA AGAAAAAGTT AGGAGCTGGA CGTTTCAGGA AAAGGGGATT GTAGGTGTTG GCGGGGGTGT GATGAAGTAT AAATCTTGGG CAAGAGGTCT GTTTCCTCAT TTTATCTTCC GCTGTCGATT GACCACACGT TTACTAATTT GACCATATGC AGGCCCAACA TCTGAACTCG AACGAGCACT TCAAGAAGAA AAAGACGCGC AGACGCCACA ACGGCAACCG AAAGCAGCCA AAGGCACCAA CGCAACATCC ACCCCTGCAC CTCAGACCGG TGCTGACCCC ACTGCATCTG CATCTGCATC CTCCACCCCC GCCCCTCCCA ACGCTGCAAA CGCAGCAGAT ATCACCACCG TCAATGACGA CCGCCCGAAC GTGAGTAGGG CAGACTCGTT TGATATGAGT ATGAATGCCA GTCCTGGTCC TCCGGGCGAT GATGAGAGTG AGAATGGAAG CGAGATTGCG GGTCCGACGA GTACACCCCC TGCAGGTGGG AAGAAGAAGA TGGGAAGCGC TCCTGCGAAG AAGAAGGGGA AGACGCCAAA GTCGAAACTG GCACAGGAAA TTGTCATCAG GGAGGATAAT GAAGGCGCAC CGGTTGAACA GGCGATTGCA GAGTAA
|
Protein sequence | MASPAPQGDQ RDLPPPWIRQ FDPSYQTYFY VNPTTNPPTT SWTHPGLAEG QVHPEQAQAI HEAGQTGGDN QGEAAKFLNS GSVADPATGS YNQPGEIQGA GGQTPEAGER GLGSMVSGLM GKTNNNNVCV YDPWTGAQSS CTSPQQYGYN QQQYPQQYPQ QQQQQSGGSK FGFGTGMVAG GAALLAGKLI SNVVGGRHNS SGGGMFGGGG GYSHNMGPPP FMGGGHHGGH GGHHGGGGMF GGGGGGPGGF GGGPGRCSHH PPSAAMTESA GHSSPPMSPG TQPAAQPPVT LKLKHSMADR ANYSSSEEEE EGEEEQLAED RLSTSPPPKR KKLSSTPTNS SAASKGKQSI KLTLGPQHAH LQQPSSSSSA SAGSAANQGK KSYDWLQPSA AGASHSGPPE RERERERSAL SPGDLPSALS PGDLPIPSVS VASAASGSST KSLGMSPAEE AIGGLLDESV DDNTSTNDNG DPSTLKKENQ SAPKAKRSHH KKKASDAPPG PGRNWKKGMK KAAPGAPGVK LENEGTPAST PAFSAISRET SPDPLGLPSP RLAPDTQSMT ITPAPVPLPS ASCPPSPPFI PADPTTLGFP VFSHPIVPPK IHLGTFPKVT SFFAPINGGD SGPFPRKEKV RSWTFQEKGI VGVGGGVMKY KSWARGPTSE LERALQEEKD AQTPQRQPKA AKGTNATSTP APQTGADPTA SASASSTPAP PNAANAADIT TVNDDRPNVS RADSFDMSMN ASPGPPGDDE SENGSEIAGP TSTPPAGGKK KMGSAPAKKK GKTPKSKLAQ EIVIREDNEG APVEQAIAE
|
| |