Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNH00520 |
Symbol | |
ID | 3259163 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006693 |
Strand | - |
Start bp | 1035730 |
End bp | 1038561 |
Gene Length | 2832 bp |
Protein Length | 742 aa |
Translation table | |
GC content | 50% |
IMG OID | 638258434 |
Product | intra-Golgi transport-related protein, putative |
Protein accession | XP_572244 |
Protein GI | 58270176 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.100818 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTCCATACAG TCAACCGGTA TTCAATACAG TCCTCGAGGG CACAGCCCAT CCCAGGAATC ATGTCCACCA ACCCAAACAC ACCTGCACCA CCAGCGTCGA GGACGAATCC CATCTCGCTC AGGATCTACA AAGCCATCGG CACATCTTTC GACGATGTAT CGTCAAGAGA AGCCTTGGAA ATTGCTTCAG GGATGTATGG TCCTGAAGAC CCAAAAGCGA AGGTCAAAGC GCAGCCAGAA TATGAAGAAC TAGAAGAAGA CGATGACACT CTCCCAAAGA GACGTACATT AAAAGGTCAA AGTGCTGCTA TAGCGAGGAA GTACCTTAAA CAGGACATCG AAACGTGCCT TGCGACTGGG AGTACCAAGT TTCTCGAAGC TTTCGCAGAG GTTGACCAGG TGGGTTTACT GGGTAGCAGA AGCATATCTG ATGGGGTAGA AACTCAATGT CCTGAGGGAA CATATGCAAG AAATGCAAGT GCGGTGCGAC CAAGTTCAAT CCGAACTTGA TCAAGCGAAT AGCGGTACAA AGTTTCTCCT CGAACGAGCG GATGTCCTCA GGTCTCAACG GTATGTATCG TACCCTGTGA ATCAAGGCTA ATGATTAATA GTGACTCTGC CCAACTTCGT GCCCATCTTA TCACCCTCTT CCTCTCGCGA TTCACTCTGT CCAACTCTGA GCTGACAGCT CTTACATCCC GTGAGGTAAC CATCGGTCAA CCCTTGTTCG ACGCTTTGGA CCATGTGGAG AAGATTAGGA CGGATTGTGA GGTGCTTCTC AGCGGTGAAG AAGGGAAGGC ACAGGCAGGG TAAGTCATCA CGACGGAAGA AGGAGGGATT GTGCTTACCC GCACGCCACT CGTAGCCTTG ATATAATGTC GCTCACTTCT GAACAGTTGG AATCGGGATA TTCGAAGATC CATCGATATT GTCAATTCGA ATTCCGACAG TTTACCCGTG AAGCACAGCT CGAAGCGTCC TCCGTGATGC GTCAGGCTAT CTGTCGTTTG CGCGATCGTC CTGCTCTACT TGCGTAAGTT CAAGTCCCCA TTCACCTGCT GTCATACATA ATCCTGATAA TCTTGTGCAC CCCTGCTGAC ATCGATTCGC GCAGTGATGC AATCCAAACC CTTACATCTA CCCGTCAATC ATCCATCCTC CACCAATTCC TCGACGCTCT CACTCGAGGA GGTCCCGGTG GTCTCCCCCG ACCTATCGAA ATCCACGCAC ACGACCCAAC GCGATATGTC GGTGATATGC TTGCTTGGGT CCATCAGACG ACTGCGACGG AACACGAGTT TTTGGAAGGG ATGTTTGGGG TAAAGGAGAA GAAACGGTGG GTGGGTCAGG AGAGAGGTGG GGAAGAGGGG GAAGAGGAGA GGATGGCTAG TGAGGTGTTG GATAAGGATC TTGAAGGGTT GAGTCGGCCT TTGAAGGTGG GTGTTTTAAA GACGGTGACT AGAAGCTAAC AAGTTTTCTC AAGTTGCGTA TCCAAGAAAC GATAAAATCA CAAGAAGGTA TCATCATGAC GTACAAGATT GCCAACCTTT TACATTTCTA TCTTGTGACT ATGCGAAAGA CTATTGGGGG AAAAGCCATG CTGGTTCAGA CATTACAAGA GTACGTCTTC GTTGGCAGGA AACCATGCTG ACATGTAAGG ATCCATGACC AAGCTTATAT CGCCTTCTAC GAAACTCTCG ACGCTCAAGG TCGAGGTCTT CTCCGTTTTC TCCACGTATG TATCACCATC CGCTCTTCCG GCGGACCATC ACTAACATAA AAAACAGCCC CCAGATGCGA CACTCACTCC ACCCATCACA CTGCGCGACG CCGCTCAAAT CCTCCGCGAA CTCTTATTTG TCTACTCTAC TTCCCTCATC GACCCTGCTG AACGCGAATC AGATGCGGAT CTGGCAAAGT TATTGGATAA AGCAGTTGGA CCTTGTGTGG AGATGTGTGA GAGGATGGCA GAGATGAGAA GGGGAAAAAG CGGTGGTGGG GAGTGGGAGA GGGATATCTT TATGGTTAAC AGTTTGGGTT ACCTAGAGGT AAGTTTGTGG ATGGAGCATG AGCGTGAATG ACGGGGGTTA ACGACGTGTA GCATACATTG GAGATGTATG ACTTTACCAC AAAGACGTTG CATATGTTGG ATGAGAAGAT CAAGACTCAT GTGGAGAGTA TGACTTTTGA ACATGTGAGT TTTATTTTGT CATGCCTCGT CACTGACCTT TGTAGCATGG TAAACTTCTG GAGTCTTGTG GCCTTGCCGC CGTCATGCGT ACCATCCGTA CTCGTCCAGA AGATGCAAGT CCCATTCACC CCCGCCATCA AAGGAAGAAC CTTTCAAGCT AACTTTTGCT TTTGTTTTTC AGACCCCTCT ATCCCGTCTT CACGCCACAT CCCCCAAATC ACTCACGTCC GCCCTCTCCA AGTTCTCTAC CTGGATATCC ACCGTCGACC CTTCCACCTC CCCTCGCCTC GCGCTCTTGA CTTCTCCACG ACTTGCAGTA GAGATTCATC GGAAAGCTTT ACGTAAGATA TATGATGCTT ATGGAGAAAT TTGTGAGAGG GTGCTGGATA AAGCGGAGGG GTATGAGTTT GGGGAGACGA TGTTGAGGAG AGGGAGGGAC GAAGTTGGGG TTGCACTCGG GGTGGGGGAA GACTGGGAAC TGGAAGAGGA CACGGAGGAG AAAAGCATGA AACAGAAAGA ACAGCAGGAT GAGGATACGG AAGATCAAGG GGAGAAGGGC ATCATGCAAG AAGAACATAA GGCACAGGAC GCTGGGAACA CAGAAGACAA GGCATAGAGA AGGAGATATA CGCCGCATGT AG
|
Protein sequence | MSTNPNTPAP PASRTNPISL RIYKAIGTSF DDVSSREALE IASGMYGPED PKAKVKAQPE YEELEEDDDT LPKRRTLKGQ SAAIARKYLK QDIETCLATG STKFLEAFAE VDQKLNVLRE HMQEMQVRCD QVQSELDQAN SGTKFLLERA DVLRSQRDSA QLRAHLITLF LSRFTLSNSE LTALTSREVT IGQPLFDALD HVEKIRTDCE VLLSGEEGKA QAGLDIMSLT SEQLESGYSK IHRYCQFEFR QFTREAQLEA SSVMRQAICR LRDRPALLAD AIQTLTSTRQ SSILHQFLDA LTRGGPGGLP RPIEIHAHDP TRYVGDMLAW VHQTTATEHE FLEGMFGVKE KKRWVGQERG GEEGEEERMA SEVLDKDLEG LSRPLKLRIQ ETIKSQEGII MTYKIANLLH FYLVTMRKTI GGKAMLVQTL QEIHDQAYIA FYETLDAQGR GLLRFLHPPD ATLTPPITLR DAAQILRELL FVYSTSLIDP AERESDADLA KLLDKAVGPC VEMCERMAEM RRGKSGGGEW ERDIFMVNSL GYLEHTLEMY DFTTKTLHML DEKIKTHVES MTFEHHGKLL ESCGLAAVMR TIRTRPEDTP LSRLHATSPK SLTSALSKFS TWISTVDPST SPRLALLTSP RLAVEIHRKA LRKIYDAYGE ICERVLDKAE GYEFGETMLR RGRDEVGVAL GVGEDWELEE DTEEKSMKQK EQQDEDTEDQ GEKGIMQEEH KAQDAGNTED KA
|
| |