Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNC04670 |
Symbol | |
ID | 3256188 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006685 |
Strand | + |
Start bp | 1411245 |
End bp | 1414467 |
Gene Length | 3223 bp |
Protein Length | 881 aa |
Translation table | |
GC content | 50% |
IMG OID | 638255686 |
Product | conserved hypothetical protein |
Protein accession | XP_569724 |
Protein GI | 58265136 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GCTCAATCCT CCATCCATCC TTTCTAGAAG CCCTAATCCC GACCTCCTGG CTCCGCTTCT CTTCTTCTTC ATTCTCCGTA CAACTGGAAC CTGGTGTTCT GTACAGCCTT GACGGCATCT ACTCTCGTTT AATCTGTTTG CTGGTCAAAC TAATCTGGAG TTTCAAACCA GTAACCGATC ATTCCCTAGT ACCTCACTCT TTACTCATTG ACACCCATAA TGGCCCCCAT CATTGAAGCT GCCGTGGTTC TCCTAGCTCT TACGGGCAAC ATTGTGGAGG CTAAGCCCCT GCGCACTCCC GGTCGACATG CGCCCCCAGG TGTGGCTAAT GTCCTCGCTT CGTCCAAGAG GTCTCTTCAT AGTCTTCTTG CCAGGTACTA CGGAACTGCT CACGGGCTGG TGGGTCTTTG TCTACTCTTA TATACATCGT TTCTGATCAC GTCCCTCCGT AGAGTAAACC GCCTTCTCTT CCTACCAAGC GTGACACGTC CTTACCGAAC GGGTGGTCAA CCTTTGGTTG CGTCGCCGAG TCTTATGACG AGCGTTTACT GCAAGGATTT GCATTTTCAT CTTCCAGCCT TACTCCTTTT CTTTGCGTCA CCGAATGTAC AAAATTAGGG TATACTATGG CAGGCACCGA GTACGGCGAT GAGGTGAGTC GTGAAAGTTT GTTCGCAACT TGTCGCTAAC TGTCCGTAGT GTTACTGTGG CGATAATTTC GTCGGTAATG GCGGTGGCAT GGCTCCTTCT TCCTTCTGCA GTATGCCATG CGAGGGCGAC ACAAGTGAGA TGTGCGGCAA CAGTTGGTAT CTTAATCTCT ACACGTACAA CTCTAGCGCT CTTCCTCTTT GTAGCGGGCC TACCAGCACG GTTTCTGCTC CTGTGGAAGA AACTAGTAGT CTTTTGTCTA CGTATTACTC GACCTTTGAA TCCTCTGTAA CCGCCACCTC ATCTTTGGCG TCTGCTTCTT CAACAGATGC CACCATTAGT GCCAATTCAT CGAGCTCCGT GATGTCTGCT TCCGCGACTG CGACGGTATC CTCAAGCGCT ATAGAGACAG CCGGGTCTGT GACCAATAGC GTCTCTAGTG ATACCGCTCC TGCAGCGACA TCCATTTCGA CTTGTCCCAT ACATGAGGAT TCCGACGACT CTTCCGAGTG GTATGCGCTT GGCTGTGGCT TGGACTCCGA GGATCGGATT CTATCGTCAT ACTTTATAAG CCTTGACAAT ATGACTGTCG ACTCTTGTCT CACAATCTGC GAAACCCGTG GATACGTGTA TGCGGGGTAA GCATGCTCGT TAAATGGTGT TCGATTAATT AGCTGACATT GTGGATCAGC CTGCAATACT CCGATGAGTG TTACTGTGGA AACTCATTAT CCTCGTCTAC AAGTTACGAC AGCACTCGGT GTGATATGGA CTGCGCTGGG GACTCTGAGG ATACTTGTGG TGGTACTTGG GCTATTGAAC TCTTCAGTCT CATCTCGTCA TCTTCAAGTT CCTGTACCGA TAGTCTGTCC ACCGAGAGTG CAACCACAAC TCTTGTTACC TCAACTACTA GCGGGTTCAA CACTGCAAAT ACCACCGCGA TTGCGAGTAG CACAGACTCT GCTTCTTCAG TCATTGTTAC CTCCTCAGAA CGCGCTACAG AAGCTACTTC TGTTACCGAG TCTACGGCGG GATCTGAGAC AGTCAGTGTC CCCGTCACGT CGGCATCCGT CATTTCCCCA ACTAGTACGA CTTCTACGGA GTCCACCGCT TCTGCAGCCT CAACTTCTGT CCCATCTTCT TCCAGCACTC ATCAAGTCTG GGCTCACTAT ATGGTCGGTA ATACCTACCC TTATACTGTT TCAAATTGGG CTAGCGACAT TTCTGCCGCT TTAGCGGCTG GTATTGATGG GTTCGCACTC AACATGGGTT CCGACGACTG GCAACCCGCT CGTGTAGCAG ATGCGTACTC TGCCGCCGCT TCTACAGGCT TTAAGTTGTT CCTGTCTCTT GACATGACCG TTCTCAGCTG TTCGTCATCT TCGGATGCCG CAAAGCTCGT CTCTATCGTT GAAGGATACG CAACTGCGAC CGCTCAAGCC ACCTACGAGG GCAAGGTACT CGTCTCCACC TTTGCTGGTT CGGATTGTGC TTTCAGCTGG CAGACAGACT TTGTAGACGT TCTCTCGTCT GCTGGAATCA ACATCTTCTT TGTACCTAGT ATCTTCTCCG ACGTCAGCAC GTTCTCTTCC AACACTTGGA TGGACGGTGA GCTCAATTGG AATTCTGGGT GGCCGATGGG AGCCGAGGAC ATCACTACTA CGTCAGATGA CGCGTACATG GCCGCCCTTG GCAGCAAAGA ATACATGCCT GCTGTGTCTC CGTTCTTCTA CACTCACTTC GGTCCCAATT CCTGGAATAA GAACTGGCTT TACCGTTCCG ATGATTGGCT CTACTGCACT CGATGGGAAC AGATTATAGC CATGCGTGAC AGTGTGCAGA TGACGGAAAT TCTTACTTGG AACGATTTTG GAGAATCCTC GTACATTGGT CCTATTGAAG GTGCTCTTCC TGCAGGCTCT GAGGCCTTTG TTGATGGTTT CACACACACT GGGCTCTACT CCCTCACCTA CTATTACGCA ACTGCATTTA AGACCGGTGC CTACCCGACT ATCACAGAAG ACGAAATCAT CGTATGGGCC CGCCCTCACC CGCACGATGC AACTGCCTCG TCCGACTCCA TTGGCAGACC TACAGGCTGG TCCTACACAG AGGATTACCT GTACGCAGTA GCCTTGACGA CAGACGCTGC TACCGTGACT CTTACATCAG GTTCCACCAC TGAAACGTTC ACTGTCTCAG CCGGTCTCAC TAAGCTCAGG GTCTCCTTGT CCGAGGGTTC CATCTCCGGT TCAATCTCTC GTTCGGGCAG CACAGTGGCT TCTTATGATG CAGGCTCCGC CTTCACGTAC ACTACCTCAC CCACCACTTA TAACTTCAAT TACTTTGTTG GATCTAGCTC TTCATAGTAT TTTTTGTGGT TGACTGGGTT GTGGTATATC AGGTGTTAAT AGTTGTTCTT TTTTCTTCTT GATCTACAGA ATTTTTTCTT ATCTATATTG ACCCTTTTTA ATCTTCATAA TGTCTCGCTG CCTGTAATGC CGGCATCCTG ATCATATTAT TACTTATTTA GGACACATTT ATCTAGCAGA TGATGTACTT AATTTCAGCT GTG
|
Protein sequence | MAPIIEAAVV LLALTGNIVE AKPLRTPGRH APPGVANVLA SSKRSLHSLL ARYYGTAHGL SKPPSLPTKR DTSLPNGWST FGCVAESYDE RLLQGFAFSS SSLTPFLCVT ECTKLGYTMA GTEYGDECYC GDNFVGNGGG MAPSSFCSMP CEGDTSEMCG NSWYLNLYTY NSSALPLCSG PTSTVSAPVE ETSSLLSTYY STFESSVTAT SSLASASSTD ATISANSSSS VMSASATATV SSSAIETAGS VTNSVSSDTA PAATSISTCP IHEDSDDSSE WYALGCGLDS EDRILSSYFI SLDNMTVDSC LTICETRGYV YAGLQYSDEC YCGNSLSSST SYDSTRCDMD CAGDSEDTCG GTWAIELFSL ISSSSSSCTD SLSTESATTT LVTSTTSGFN TANTTAIASS TDSASSVIVT SSERATEATS VTESTAGSET VSVPVTSASV ISPTSTTSTE STASAASTSV PSSSSTHQVW AHYMVGNTYP YTVSNWASDI SAALAAGIDG FALNMGSDDW QPARVADAYS AAASTGFKLF LSLDMTVLSC SSSSDAAKLV SIVEGYATAT AQATYEGKVL VSTFAGSDCA FSWQTDFVDV LSSAGINIFF VPSIFSDVST FSSNTWMDGE LNWNSGWPMG AEDITTTSDD AYMAALGSKE YMPAVSPFFY THFGPNSWNK NWLYRSDDWL YCTRWEQIIA MRDSVQMTEI LTWNDFGESS YIGPIEGALP AGSEAFVDGF THTGLYSLTY YYATAFKTGA YPTITEDEII VWARPHPHDA TASSDSIGRP TGWSYTEDYL YAVALTTDAA TVTLTSGSTT ETFTVSAGLT KLRVSLSEGS ISGSISRSGS TVASYDAGSA FTYTTSPTTY NFNYFVGSSS S
|
| |