Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNI00810 |
Symbol | |
ID | 3259543 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006694 |
Strand | + |
Start bp | 196833 |
End bp | 200109 |
Gene Length | 3277 bp |
Protein Length | 988 aa |
Translation table | |
GC content | 48% |
IMG OID | 638258566 |
Product | expressed protein |
Protein accession | XP_572741 |
Protein GI | 58271170 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | AAACCTTCAT CTGTCATCTT TTCGCCTCCA CGACCTACAA CAAATAATCT TGAGAGCCCA ACATCAGTAA CAAATAGCGC AGTCACGAAG GTTGATTAAT ATTGTTGTTG ACTGACAAAA TGGGACCTCT ACGTCGTCAG CCAGCTATGC TCGACTTGCA CAGCGCCAGT CGCGCTGTTA GCGTCGAATG GGATGACTGG GACAAGGATC CATGGGGACA GGAAGAGATA AACCATGGAA GTCTCAAGAG ACAAAGCACT GATGTACTGG GGGAGACTGA GGATCGCAAA AGAATCAAGG AGGTGAGGCA GGTAAGCAAT ACCCTGTGTT GATTGTTCGC TCCATAAGCT GACATAGACT GACCAGAGCG ACAAGTATCG GAAGGCATCA TCCATTGCGA AGAGGTCTGA GCCTCTCTTG AGAAGACCGG TAAGGTCATC AAACTCGCCA GACACCACCA CCATTGTGCT CGAACCACGA AGTCGGTCGT TTTCATCCGC CCGCTTCGAA AAAACTGTCA CTTATGACCA TGCATCGGCC ATATCGCCTT GTTCCGAGTA TGGCGAAGGA ATCTCTTTGG TTCTAGGAAT TTCAAAACCT CCCCAACGAT TCCACGCATC AGAGTATCCC GAGTCAGCCT ACTCCACTGA ACGACCAGAC CTTGCTGTGT TTGGACAACC CGCCTCTCAA CCAATGGTCA GAGAATATAC TTCTATGGGC TGGGATGGAG AGAATGTTGA GAACCGCTGT GATACCGGCA AAGTTCTTGA AGGGCTCCCA CTGATTAATA CCAGTAGATT AGAGCGAACG AGGAAGGGTA ATTCCAAGTC GCCATGGCAT CAAGTGTCGC GCAAATACTC TCGGGCGTCA CCTTTGTCAA ATTCTACTGG AGACCATGAT GCTCGAACGC ATACCCTTTT GCATACAATG AAGAGAGGCA AATCCAAGTC CTATCAAGGT GGCCTTTCTC TCAATTTTCA GAGCGTTTCT CTCTCACCAT CACCTATCGC ACCCACACCC TCCCCAACAT CCTCATCTCA CTCTAGTCAA TCATTCCTTT CCGCCTCCAT GTCTGATCAT TCCGTCCGTA TTCTCACAAC GATCACAGCC CAGACGCGAC CACATATGAA TTTCTTGGCC AATGCATCTG ACGTCTCGAT CCTGGCGCCT TCCACCATCG GAATAGGCTA CTCTTCGTCT GCTGATACGG GCTCTCCAGG TAGTTTGCCA GAAAGTCTGG AAGCGGAGTG TGTCGATCTG ACGGAGACTG GTCGTCGGGC CTCGCATGCG AGTAAGAGAG ATAAGCGACA TTCGTGGATG AGCCACACGA ACACGGTAGA CCCGGAAACT GTCGAGATAA TCACCGAAGA ATTACCATCT ACCCAGCCTT TGAAGTCCGA TCCAGTATAT CTCAGGCCCT GCTACATTCT ACCTGTTGGC TCTCAAACCC CTTTACATAC TTGTTCGTCT GCGCAAGGAC CAAAATTGAA AAGAGCACAT ACCTATGCCA ACTTGAAGGA TCTCTGGAAA CTCGCTTCAT TCCCCGGTGG GCTTATCGAG GTGCAGAAAG ACGAAGAACC CGTAAAGCAA GAAAACCAAG GTCTATCCTC TCAAATTCTC GAACCTCCTG CTATCATCCA ACCACCATCG CTTGGTCCGC CAATCGATAT CCTATCCCGT CCAGTCTCTT CTGCCATATC GCGTGACACC ACCACAACAG CTCGAGGTCC TCCAACTACC CATTCTTACT CTCCCTTCCG ATCAGTATCC ATGTGCTCTC CGCCGGATAC AGTCGAAGAG CTCTTTCCTG AATCCCCATT CAACTCCCCC CAACCTGGCA ATATCGGCAA AATCAAACGA TCCCCTTTTC AAGCAACTCG TAATTCCTTG AAAGGGATGC TAAACAAGTC TTTCAAAGGC CGATCACTGG CCCATGCGTT TGATAATGCG GAACAGAACA GTCAGCAAAG AGAGTCGTCT TTGAAATCAA AGATATCTGG GCCTTTGCTG ATCAGAGGTA TGCGAGGTGA GATGGATAGG AAGAAAAGCG ATAAAGAATG GAGAGAAGAG GTCTTGAAAG ACGTTGTAGG AAGGACGCTG TCTTCGCAAT TCAAGCTCCT TGAAGAGGTA GCTAAGCAGC AGCGAGAATC TTTACTTGAT AAAGAGAAAA TTAGCCAGAA GACTCCGTCG TTTGGGATTG GCCTGAAAAA CAGGAGCCCA ATACCAGAAA CACTTTTGAG CGCCGTTTCG ATATGCGACA AAGAAAATCC CATCACTGCT GATTCGCACG CAGACAGAAT GAAAATGAAG AGCGGAACAA GACCGAGTAT GGAAAGTAGC TTGAGCTTGA GGATGGTAAC AGATGAGACA TTCAACAACC CGAAAACCGA GGGTCTGACA TCGTGAGTCT ACAGCCCACA GATGAAGGGC AATCGTTGAC ATGTCTTAGT GGACATCTTG CCAATTCCAT TGGCCGTAAA AATCTCACAA ACCAAACCCC TCCGCATCGG CTATCGGGCA AAGGCCGTCC TACCTCTGCC CCACTTCTCT CTGCATGGTC TCCACCACGT CCCAAATCCG AAAAGCTGTC TCAAGTGATC AATCCACCAA TTGTCGTCAA TCATGCGACA CCCCGTCCCG AGGAGTCTCC TAAGACCAAC CCGCATCATC CGCCTTCATC TCCTTCACCG TCGCCAAAAG CGGGAAAAAG TAAGAGCGGG AGGATCTCTA GAAGGTCAAG TATTCTGAAC TTGCTCAAAT CCAAAGATTC AAAACACCAG AACGCCTCCC AGAGTCCTCC TAGACCAAAG AAGATGGCAA GCGGGACGTT CACTCTCACA GGTAGCATTC GCTCCTCTTT CTTGGCTATC ATTAAGAATC ATATTCAAAC CATGCAGCAA GAAAAAAGAG GTAAATATTC TAGGACACCC TGCATCATCC CTCCTTCCAC GTTACCCCTT CGTACACCTA TCCATCAACA TACTGAACTT TATCGACCTC TTTCAAGAGC GAGATCATCG TTCGAGCTGA CTTTGAACCT TGAGCCTGCA AAGCCATTAT TGGATGAGCT GCTGGCGAGG GATGACGCGC TGGGCTTTTT GGAAGGAAGA AATAAAAGGG AGCAGAGAGT AGAGGTTGAT ATTGAGAAGG TGCTGGAATG GCGTAAAGAA GTGGAAGAAG ATATCTAGAG ATTAGCATGT GGTGTGCGAA ACGATGTGGT TTTGGGCTCT GTGTAAAATC TTTAACTGTT GGGTGGCCTA ATGAGTAATG ACGATTA
|
Protein sequence | MGPLRRQPAM LDLHSASRAV SVEWDDWDKD PWGQEEINHG SLKRQSTDVL GETEDRKRIK EVRQSDKYRK ASSIAKRSEP LLRRPVRSSN SPDTTTIVLE PRSRSFSSAR FEKTVTYDHA SAISPCSEYG EGISLVLGIS KPPQRFHASE YPESAYSTER PDLAVFGQPA SQPMVREYTS MGWDGENVEN RCDTGKVLEG LPLINTSRLE RTRKGNSKSP WHQVSRKYSR ASPLSNSTGD HDARTHTLLH TMKRGKSKSY QGGLSLNFQS VSLSPSPIAP TPSPTSSSHS SQSFLSASMS DHSVRILTTI TAQTRPHMNF LANASDVSIL APSTIGIGYS SSADTGSPGS LPESLEAECV DLTETGRRAS HASKRDKRHS WMSHTNTVDP ETVEIITEEL PSTQPLKSDP VYLRPCYILP VGSQTPLHTC SSAQGPKLKR AHTYANLKDL WKLASFPGGL IEVQKDEEPV KQENQGLSSQ ILEPPAIIQP PSLGPPIDIL SRPVSSAISR DTTTTARGPP TTHSYSPFRS VSMCSPPDTV EELFPESPFN SPQPGNIGKI KRSPFQATRN SLKGMLNKSF KGRSLAHAFD NAEQNSQQRE SSLKSKISGP LLIRGMRGEM DRKKSDKEWR EEVLKDVVGR TLSSQFKLLE EVAKQQRESL LDKEKISQKT PSFGIGLKNR SPIPETLLSA VSICDKENPI TADSHADRMK MKSGTRPSME SSLSLRMVTD ETFNNPKTEG LTSGHLANSI GRKNLTNQTP PHRLSGKGRP TSAPLLSAWS PPRPKSEKLS QVINPPIVVN HATPRPEESP KTNPHHPPSS PSPSPKAGKS KSGRISRRSS ILNLLKSKDS KHQNASQSPP RPKKMASGTF TLTGSIRSSF LAIIKNHIQT MQQEKRGKYS RTPCIIPPST LPLRTPIHQH TELYRPLSRA RSSFELTLNL EPAKPLLDEL LARDDALGFL EGRNKREQRV EVDIEKVLEW RKEVEEDI
|
| |