Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | CNG03380 |
Symbol | |
ID | 3258883 |
Type | CDS |
Is gene spliced | Yes |
Is pseudo gene | No |
Organism name | Cryptococcus neoformans var. neoformans JEC21 |
Kingdom | Eukaryota |
Replicon accession | NC_006692 |
Strand | - |
Start bp | 951894 |
End bp | 955275 |
Gene Length | 3382 bp |
Protein Length | 1039 aa |
Translation table | |
GC content | 51% |
IMG OID | 638257963 |
Product | cell wall surface anchor protein, putative |
Protein accession | XP_572069 |
Protein GI | 58269826 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.644715 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCCGAG CTTCACTAAC TTTCACCTCC CTCGCCTTCC TCATTTTCAT TACTCCGTTA GCTCTCGCCC AATCCTCTGG TACCATTAAC CTTTCATCCA CCTCAGTATG TGTCATTATC GCTCAAATCG CCTCCCTATC TTCACCCAAC GCCATCTCGG CGATGATGCC CATCTGCCCA GACTCTGGTG ATACCGCCAT TGCCTGGCCG CTTACGACTA CCGGAGATGG TGCATCTTCC AGCATCGCAA TCTACGCCCA CACAACTGCC ATTAATGGAG GGGTCAGTCT CGGTTGTGTG ATGGGCCTCA GCCAGATGCA GGCGATGTAT CTCTTGGGAA GGGATCTGAC GTGGGACGAC AGTGCTGGTA CGCTGTACGA CAACACTGCT GGCGGGAATC TACTCGATCC TACCACCTAT GCTACTATAA CCTGGCAAGT GACCTTTTAA ATGAAAGCCA GACATTTAAA TGCTGATATA TGGACGATCA GTGGATCATC CGAAACGCTC CTTCCATCAG GGCTCGTGCT GCCTGACCCA TCATCTAACG CCGCTGGAGC TTCAGTCTGG GATCAAACCA TTACTCTGAC CTCTGGTGCC AAAGCTTCCG GTGGAAACAA CGATGATGTT GTCGAAAGCA CTGTTGCCGA TGTTACCTCG GTTGCCGCGC CGGCCAGCTC TACCATCCTA ATGACTTCTG CAGCTGCCAA GACCTCAGCT GCTCCTGTCA CTTCAGTTTA CCGTATCACA TCGGCTGTGC CTGAGACTTC TGCTGCAGCA GAGACCTCGG AGGCTCCCTC AAGTACCGAT CAATTGGCAG TCCAGACTTC CAGCCCTCTC TCTCCCTCTC CTAGCTCCAC TATAAATGAG GCTTCGCAAA CTAAAGCAGT TGTCACTTCC CCTAAGTCGG TCATTAGCGA CCAACTTAGC TATTCCGCTA CCCCTTCGCT TACTGCTGTA GAGCCTTCTA GTGTTTCAGA AGTGGTTGAC GTTCTTACGA CGTCTTCGAC CACTTTCCAC GAAGTGGGCA GCTCCGGCTC GCCTACTTCC TCGGAGAGCG ACTCGGACCA GGCGAACTTT GAATCTTCAG GCCAGCCGGT GGCGGTACAA ACCAGCTCGC CTGCCCCTCA ACCAATCTCT GATACAGTTC AGCAACTCTC GTCTACCCAA CAAGAGACAA CAGTTGAAGC TCAAATTTCT GCTCAGCCAG AGACCTCTGC TCAACCTCAA ACTTCTGGCC AGCCTCAGAC TTCTGCGCAG CCCCAAACAT CTGACCAACC TCAAACATCT GGTCAGTTTC AATCATCAGC TCAGCCCTCA GGGTCGTCTT ATCGATCGGA AAGTAATGAA CAACCTACCA GGGCCGACTC TTCGTACGCC TCCGATCCTA CTTCTGCTTA TCAGGCTAGC TCTAGTCACC CGGAAGACTC TACACCTACA TTGATTCAAT CTTCTGACAG CGCTTCCTTC TCTACCACTG ATGGGAATGA TTACTCCTCC GATAATGGCG ATTACGATAA CACCAGTGTC ATTGCTGTAC AGACTAGCGC TAGTTTGAAG GCATCTCCGT CAGTTGTTCA ATCAGAAAAT CAACGTGTCA CCTCTTATGA TGCCTCAACT TATCAAGTGG CTACGGCCAG TCTTTCGAGT TCGGCGTACC TTACCGACAC CGCCTACGAA AGTAGTAACG ATGATAATTT GAGCGATTCT GAGGAAGTAC CGCAGACTTC TCCTTCGCAG TCGCTGAGTG CACCAACCAG CGCATCAGCC GCTATAGTGT CAAACACTTA TCTCATCTCT TCGAATAGCT CTGATAGCTC AGATGGAGGT GTCACCGTCA CTGCCCTTGA CGCTAGGCCT TCATCCACCG CTCAAGCGTC ATCCTCGCAA ACTGACGTCA ACGAAGCTGG ATCATCATCC TCTTCTGCAT CATCGTTCGA AAGCAAAATC TCCCCCAGCG GTAGCGACGA CAATGGCAGC CCATCCGAGA CCGATCAGTC TCAATCCCAA TATTTTGAGC CGAGCTCGTC CGATGTCTCC ACTGAACAAT TTACCACGCC TACCTCTTCC CACCAGCCGA CAACATACGC AGCCTTCATT ATGCCTGCAT CTTCCAGCTC TGCAAGCTCA GACGAGCCAT TTGTCTTTAC TGTCGGAGGA ATGACCATTG GACAGTTCAC CGAAGCTGGT GCCGATGCCA TGATAGCTGT GCAATTCAGT TCTACTTCGA CGACTGATAG CCCGGCGGCT GAGAATAGCG CGTGGGCCGA GACTGCTGCT GTTGAAAACA TTGAAGGCAC TGATGCAACG AGCACAAGTG CGTCTGAAGT GGTCGCTGCT TCGGCTGCAC GCACAAGCAG TGCCTTCTTA GCAGATGGTA ACAATGGTAG TTGGGTTAGT TCAGTGAATG AAAGCGTCAG TCCGGGTGCA CCCACGAGTT CGGAAGAAGC AAGGGAGACA AGCGACAGTA ACCGATCTGG CGAAGACAAT TATGGTACCT ACTCGTCGGC CAGCGCGAGT ACAGAGGAAA CAAGCAAAGA CAACAACCAA AGAAGTGGCG CCGGCGAGAA CACTGCGAGT GTCACTGTTG ACTATATAAC CACCTCTCCC ACGCAAAGTT CAAGTATCGT CAGCGCCTCC TACACTGACC AGCGATGGTC GGCTTCTCAA GCCAAAGCCA CTGTAGGGGG AGAGAGCGAA GGCTCCAAGG TGTCTGCTGC ACCGACGATA TTTAGCATAT TGTCAAACAG CACCAGTTCC AATGATTCTT TTGGGCGAGA ATCGTCGGAT GACAAACGTC AAAGTCTGTC AGCGACTTCT TACCAAGCAA ACTTTACCAT CACTGGTGAA AGCGCAATGG TAAGCGCCAG TGGTCACTCA CACCACCAGA GTGATACAGC TGTCAATGAC ATGATTTCCC AAATCATTGC CGTGGAAACT TCTGCAATCT CGTTCGACAG CTCTCCTCTT TCGTCTGTAT ACAGGTTTTC CAAAGGCGAC GACACGTATA CAAAATCAGG TTCCCCTGGG GCCGAGCCTA CTAGCATGTC ACCTTGGGCT CAAGCTCTCA GCAACAATAG AAACAGCAGC GCATCTATTT CGGAAATTAT CCGCACTGCA TCTTCGCTTG TAAGTGGCGC GAGTTACTCG CTGGAAAGCG CAAGTACCCA GAAAGCGGTT GTGACGGGAG GTCAGAGCAG CGGAAAGACA TGTGCGAGGA AGAGGAAAGA GAAAGCGAGG AGAGCTAGGG CTTTGCTTAT AGAAGCTGTA TGATGTTTAC GATGGACGTA AGAGGGAAGG ACCCATTGTC ATAATTTTCT TCTGGTGAAT ACTGAACCTA AAACCCAATA GTATATTTCA TACTGTTCGT TGACATAATA CA
|
Protein sequence | MARASLTFTS LAFLIFITPL ALAQSSGTIN LSSTSVCVII AQIASLSSPN AISAMMPICP DSGDTAIAWP LTTTGDGASS SIAIYAHTTA INGGVSLGCV MGLSQMQAMY LLGRDLTWDD SAGLVLPDPS SNAAGASVWD QTITLTSGAK ASGGNNDDVV ESTVADVTSV AAPASSTILM TSAAAKTSAA PVTSVYRITS AVPETSAAAE TSEAPSSTDQ LAVQTSSPLS PSPSSTINEA SQTKAVVTSP KSVISDQLSY SATPSLTAVE PSSVSEVVDV LTTSSTTFHE VGSSGSPTSS ESDSDQANFE SSGQPVAVQT SSPAPQPISD TVQQLSSTQQ ETTVEAQISA QPETSAQPQT SGQPQTSAQP QTSDQPQTSG QFQSSAQPSG SSYRSESNEQ PTRADSSYAS DPTSAYQASS SHPEDSTPTL IQSSDSASFS TTDGNDYSSD NGDYDNTSVI AVQTSASLKA SPSVVQSENQ RVTSYDASTY QVATASLSSS AYLTDTAYES SNDDNLSDSE EVPQTSPSQS LSAPTSASAA IVSNTYLISS NSSDSSDGGV TVTALDARPS STAQASSSQT DVNEAGSSSS SASSFESKIS PSGSDDNGSP SETDQSQSQY FEPSSSDVST EQFTTPTSSH QPTTYAAFIM PASSSSASSD EPFVFTVGGM TIGQFTEAGA DAMIAVQFSS TSTTDSPAAE NSAWAETAAV ENIEGTDATS TSASEVVAAS AARTSSAFLA DGNNGSWVSS VNESVSPGAP TSSEEARETS DSNRSGEDNY GTYSSASAST EETSKDNNQR SGAGENTASV TVDYITTSPT QSSSIVSASY TDQRWSASQA KATVGGESEG SKVSAAPTIF SILSNSTSSN DSFGRESSDD KRQSLSATSY QANFTITGES AMVSASGHSH HQSDTAVNDM ISQIIAVETS AISFDSSPLS SVYRFSKGDD TYTKSGSPGA EPTSMSPWAQ ALSNNRNSSA SISEIIRTAS SLVSGASYSL ESASTQKAVV TGGQSSGKTC ARKRKEKARR ARALLIEAV
|
| |