Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNN00670 |
Symbol | |
ID | 3255354 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006683 |
Strand | - |
Start bp | 216190 |
End bp | 218270 |
Gene Length | 2081 bp |
Protein Length | 469 aa |
Translation table | |
GC content | 50% |
IMG OID | 638254484 |
Product | expressed protein |
Protein accession | XP_568602 |
Protein GI | 58262384 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 0.94962 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GGAAATCCTC ACCGTTTCGT TTGATTAGTC TGTTAGCCCT CCTTTCCACC TTTGAAAATC GTGCACTTCT CGCCTCGTAT TGTTTCACTC AAGCATGTCC CTGTCTCCCG TGGCAGCAGT ACCAATTCTC CCCCAGGAGC AGCCGTCCGC TTCAACGTCA GCAAGAGAAC AAGACGACAC GTGGTTGAAG CCGTTGTGGG AGTACTCTTT CCCTCCTGTG GCTGATGATA AGCGGCTAGA CAAAGAGGGG TCTGGACAAG CCCCTCAGCA AGCGCAGACG AGTTCTCCGG GGCCTTCGTC CGTGGGCAAT GTATCGCCTC GCACCAAGGC AACCAGTGGA TTGGCATTCA AAAAAAGACG CAGATCAAAA GGTGATCCTG CCCGTGAAGG TGAGAGAAGT AACCGAAGAC GAGGAGAGAA TAAAGGTAAG TTTCTGTCAT GCCGGCGTGG GACATAATGC TCAAGATAAT AAGCAGAAAA AAGAATTATG GAGAGATGCC GTGGTGCGCC ATCCTGGCTT ACTGGCAAGA CAAACGCTAT TCACCCTCCT CCTTCCAGCA GCTTCACCGG TTTATCAGCT CCTATCATCT ACCCACCTCA GATTCCCATC CCACCCTCTT CACATCGCGC CACAAAACCT CCATTTCCTG TGCAACAAAC CAATCATGTT GGCCTGTATG ACAGAAGAGG TTCTATGCCT TCTTCATTCA CTCAGTCTAC ATTAGGCATC ATTGGGCCAA TGGAGAGCTA TGGCACAGGT ATGACGAGAA CACAGTCGTG GGGTGGACCT GATACGCCTA CAAGCAGTGT CTTGGCCACG GCAAGTGACT ATTCGGGGGT CCGGATGAGT GACGACATAA CCAGAAGGAG GGCAAGCGAG CCTCACACCA CCATGTTGGG TGTAACGTTC AAGTCCAACT GCTATACGAA CTCGTCACAG GCATCGATGG GTCTCTTGGA TACTCAGAGT ATGGGTAGAA ATAGAGAAAC CAGTCGGGAC AGAAGTGTTG GCGGCGATTG GCCTGGGCAA AATTCAGGCG TGAGTTTATT GTCAGAGGCC GTGAAGCCAA AGTGCTGAAG TTCTCTTCAG GTGTGGGCGA ATGGTATGGG TAGAAGTAGC CCCCTCCCGA CAGTCACAGA GCCCCAACAG CTCAACAGCT CGTACCCAGC ATGGCTTCTT ATGAATACGG ATGCTGCAAA CTGTCAAGAT GGTTCCGTTC AGTGTGTATT ATTAGACCTG AGAAGAGGGC AGATGCTGAT AACGTACAGC TCTTTTGGTC CCCTTACTCC ATCGTCCATT CAGACTGACC TTCAAACCTT CACGTCCAGC ACTTCCTATC CCCCTCCACC GACAACCTCC TTTAGCCCTG GTCCGACATA CGTGCCCGAG CCCATGAACA TCCAAATGCA TTCCTCAGGG TTCTCTTCCG GGCATTGCTT CTCCCATTGG CAGCCTGACC CTGCACCTTC AATCCATATC TACAATCTGG ACGGATGTGA ATCATACCCG TCCTATTTCA ATGGTATGAA TGGACCTGCG CATCCTCAGA CGGATCATGC ATACCAGCAT CAGGTTCTTA ATTATGATAG GGTAGCTTTT AGGCGTAGGT TGTGCGACGA TGTAGAGGCG GAGGAGCCGG TAGAGCGCAG ATAATGTAAT AGAAGAGAAA AAAGGAGAAA CTGTTGAGGG TCACTGCAAA AAGGTGAATT ATCTTAGTGG TCGTTTGGAT TATCCTGGAA CTGCGAGCTG GTGACGGGCT TGTAGATTGG AACTGTAAAC TGTGACTAGT TTCGGGATTC GCGTAAGTTC AGGTTCAACC TTTATCGAGC TCGCAGGATG AACAATATAG GCGGAGGTGG GAGTGGAGGT TTCGTGACCT TTTTCGTATA AAAGTTTGCT GTGAGGTGTT TTCCCGAAAC TTCTCCGAGA ACGGGGGTGG AGAGAAAGGA TTCTAATAAT TGAGAAAAGA CCAAGAGAAT GGAGGATCGA TGGCTGGTGG TGTTGTATAT TTGTGCAGCA TGGTGTGTAG GATAGTATTA CACTCTATCT CTGATGCTAT ACCCGGCAGT TTAACACGTA G
|
Protein sequence | MSLSPVAAVP ILPQEQPSAS TSAREQDDTW LKPLWEYSFP PVADDKRLDK EGSGQAPQQA QTSSPGPSSV GNVSPRTKAT SGLAFKKRRR SKGDPAREGE RSNRRRGENK EKRIMERCRG APSWLTGKTN AIHPPPSSSF TGLSAPIIYP PQIPIPPSSH RATKPPFPVQ QTNHVGLYDR RGSMPSSFTQ STLGIIGPME SYGTGMTRTQ SWGGPDTPTS SVLATASDYS GVRMSDDITR RRASEPHTTM LGVTFKSNCY TNSSQASMGL LDTQSMGRNR ETSRDRSVGG DWPGQNSGVW ANGMGRSSPL PTVTEPQQLN SSYPAWLLMN TDAANCQDGS VHSFGPLTPS SIQTDLQTFT SSTSYPPPPT TSFSPGPTYV PEPMNIQMHS SGFSSGHCFS HWQPDPAPSI HIYNLDGCES YPSYFNGMNG PAHPQTDHAY QHQVLNYDRV AFRRRLCDDV EAEEPVERR
|
| |