Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN02190 |
Symbol | |
ID | 3255322 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | - |
Start bp | 677148 |
End bp | 679980 |
Gene Length | 2833 bp |
Protein Length | 773 aa |
Translation table | |
GC content | 49% |
IMG OID | 638254629 |
Product | conserved hypothetical protein |
Protein accession | XP_568704 |
Protein GI | 58262588 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG1331] Highly conserved protein containing a thioredoxin domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.129596 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTTCGAT CTTTGCCTCG CACACTGAAA CCTATAATAG TTCCTTTTCC ACCCCAAATT CGGCCCACTC CACGCGGAAT ATATCATCTC AGAATGTCTT CGACGTCCGC TACCGACCCG ACCCCACGGC TGAGCAATGT GCTGGCCAAA AGCAAATCAC CGTATCTGTT GCAACACAAG GACAATCCTG TAGCTGTAGG CCCCTTTCAA TTTTGGCACA ACGTGCAAGG CTAACCAAGT GACGCAGTGG CAGGAGTGGT CTCCGGAAAC TATTGCTCTG GCTCAGAAGC TTGACAAGCC CATCTTCCTT TCTTCAGGCT ACTCAGCATG TCACTGGTGT CATGTGTTAG CCCATGAATC TTTTGAAGAT GAAGAGACTG CCAAAATGAT GAATGAGTGG TTTGTCAATA TCAAGGTGGA TAGGGAGGAG AGGCCGGATG TGGATCGAAT GTATATGAGC TACCTGCAAG TGGGTGTCAA TAATGTTCAG TCAGAAGGAG GAAACTCATG TTACCCAACT GCGTAGGCTG TATCTGGAGG TGGCGGCTGG CCCATGTCAA TCTGTAAGTT TAGTGTTTCT CCTCGGGTAT ATACTGATGT TCACAAGTCA TGACCCCGAA GCTTGAACCC TTCTTTGCTG GTACGTCACA GAATCTTCAG CTACGGTGGT ATTATAACCT ATGTTTTCAC AGGAACATAC TTCCCACGAC CCAATTTCCA TCAACTTCTC AACAAAATAC ATGAGGTGTG GGAAGAAGAC CGTGAAAAGT GCGAGAAAAT GGGAAAGGGC GTTATTGAAG TCTTGAAAGA TATGAGCCAT ACTGTAAGGT TTCGCCTTTC ATCCAGGATT TGTCAACTAA CGCAGACGTC CAGGGCCGTA CTTCAGAATC CCTCTCTCAA CTCCTCGCAA GCTCTCCTGC TTCCAAACTC TTTTCCCAGT TATCAACTAT GAATGACACT CGCTACGGCG GTTTCACCAA CTCTGGCTCC TCCACCCGAG GCCCCAAGTT CCCTAGCTGT AGTATCACTC TCGAACCCCT CGCACGTCTC GCATCCATTC CAGGTGGAGG TGCGAGAAAT GCCGAAATCC GGGAAGATGC AAGAGAGATG GGAATGAAGA TGCTTAGGTC AATGTGGTCT GGCGGGATAA GGGACTGGGT TGGAGGTGGA ATGGCGCGGT ACAGTGTGGA TGAGAAATGG ATGGTTCCGC ATTTTGAAAA GATGCTATAC GACCAAGCAC AGCTTGTTTC GTCTTGTCTT GATTTTGCCC GTCTTTATCC TGTCGACCAT CAGGATAGGT TGCTTTGTTA CGATTTGGCG GCAGATATCC TCAAGTACAC CTTGAGGGAT CTGAAATCGC CAGAGGGTGG GTTCTGGAGC GCGGAAGATG CTGACTCGGC AGAATACAAG GGTGCGAAGA AGAGTGGTAT GTTGTCCCTT TCCTTCTATA CCATTGATGA CTGATAGCTT GCCAGAGGGA GCATTTTACA TCTGGAAGAA GACCGAGATC GACGAAGTCC TGGGTGATGA CGCCCCATTG TTCAATTCAT TCTTCGGTGT TCAGCCTGAC GGGAATGTTG ACATCATTCA CGATTCCCAT GGCGAAATGC GAGGCAAAAA CATTTTGCAT CAACATAAGA CCTACGAGGA GGTTGCGCTT GAGTTTGGCA AGCGGGAAGA TCAGGCGAAA GGTATTATCA TTCAAGCTTG TGAGAAACTC AGGTTAAAGA GAGAGGAAAG GGAGAGACCG GGTCTTGATG ACAAAGTAGG TTGCATGAGC GAATGGAGTA TTCATACTGA CACCTTATGT AGATCCTCAC TGCCTGGAAT GGCCTGATGG TGCGTCAATC GTGTATTCTG TATATGCTTC TGCACAAGTC CTCACAACTA ACCATCCCAC AGCTCACAGC TTTATCGAAA GCGTCAACCC TTCTTCCGCC ATCCTATGGT ATTAGATCTC AATGCCTTCC CGCGGCTTTA GGCATCGTCA ACTTTGTGAA ATCTCACATG TGGGACTCTT CCACACGCAC CTTGACAAGA AGTTATCGGG AGGGCAAAGG ACCCCAAGCC CAAACTGATG ATTACGCGTT CCTTGTTCAA GGTCTTTTGA ACCTGTACGA GGCTACTGGA GATGAGAGTC ATGTTCTCTT TGCTGAGGAA CTCCAGAAAA GGCAAGACGA ATTGTTCTGG GATGATCATG ATGGAGGGTA CTTTGCGAGT GCGGAGGATG CGCATGTTCT GGTGAGGATG AAAGATGCTC AGGTGAGCCC TTACATGACT CATACTCTGT CTTATCTGAC ACAAACTCCC AGGACGGTGC GGAGCCCTCT GCAGCGGCGG TGTCAGCACA CAACCTCTCC CGCTTTTCAC TTCTGCTCTC ATCCGAGTTT GAAAACTATG AAGCTCGTGC CGAAGCGACT TTCCTCAGCA TGGGACCCCT CATTACTCAG GCACCGAGAG CAGTGGGATA CGCTGTATCT GGGTTGATCG ACCTTGAGAA GGGATACAGA GAGGTCATTG TCATCGGGTC TGCCAGTGAT GAAGTGGTAA AGAAGTTCTT GGAAGCTGCT CGAAAGACGT ATTTCTCCAA CCAGGTCATC GTTCAAATCC AACCGGAGAA CCTGCCTAAA GGACTTGCGG AGAAGAACGA GGTGGTGAAG GCTTTGGTAA ATGATGTAGA GAGTGGGAAG GAAAAAGGAG CGAGTTTACG AGTGTGTGAG GGGGGCACAT GCGGTTTGCC CGTAAAAGAT TTGGAGGGGG CAAAGAATTT GTTGAAGGGT GTGTAGGTAC CTTCTTGTGT CTTTTAAAAA TTAATGTATC GCTATTGGCT GTT
|
Protein sequence | MFRSLPRTLK PIIVPFPPQI RPTPRGIYHL RMSSTSATDP TPRLSNVLAK SKSPYLLQHK DNPVAWQEWS PETIALAQKL DKPIFLSSGY SACHWCHVLA HESFEDEETA KMMNEWFVNI KVDREERPDV DRMYMSYLQA VSGGGGWPMS IFMTPKLEPF FAGTYFPRPN FHQLLNKIHE VWEEDREKCE KMGKGVIEVL KDMSHTGRTS ESLSQLLASS PASKLFSQLS TMNDTRYGGF TNSGSSTRGP KFPSCSITLE PLARLASIPG GGARNAEIRE DAREMGMKML RSMWSGGIRD WVGGGMARYS VDEKWMVPHF EKMLYDQAQL VSSCLDFARL YPVDHQDRLL CYDLAADILK YTLRDLKSPE GGFWSAEDAD SAEYKGAKKS EGAFYIWKKT EIDEVLGDDA PLFNSFFGVQ PDGNVDIIHD SHGEMRGKNI LHQHKTYEEV ALEFGKREDQ AKGIIIQACE KLRLKREERE RPGLDDKILT AWNGLMLTAL SKASTLLPPS YGIRSQCLPA ALGIVNFVKS HMWDSSTRTL TRSYREGKGP QAQTDDYAFL VQGLLNLYEA TGDESHVLFA EELQKRQDEL FWDDHDGGYF ASAEDAHVLV RMKDAQDGAE PSAAAVSAHN LSRFSLLLSS EFENYEARAE ATFLSMGPLI TQAPRAVGYA VSGLIDLEKG YREVIVIGSA SDEVVKKFLE AARKTYFSNQ VIVQIQPENL PKGLAEKNEV VKALVNDVES GKEKGASLRV CEGGTCGLPV KDLEGAKNLL KGV
|
| |