Gene CNG03380 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG03380 
Symbol 
ID3258883 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp951894 
End bp955275 
Gene Length3382 bp 
Protein Length1039 aa 
Translation table 
GC content51% 
IMG OID638257963 
Productcell wall surface anchor protein, putative 
Protein accessionXP_572069 
Protein GI58269826 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.644715 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCCGAG CTTCACTAAC TTTCACCTCC CTCGCCTTCC TCATTTTCAT TACTCCGTTA 
GCTCTCGCCC AATCCTCTGG TACCATTAAC CTTTCATCCA CCTCAGTATG TGTCATTATC
GCTCAAATCG CCTCCCTATC TTCACCCAAC GCCATCTCGG CGATGATGCC CATCTGCCCA
GACTCTGGTG ATACCGCCAT TGCCTGGCCG CTTACGACTA CCGGAGATGG TGCATCTTCC
AGCATCGCAA TCTACGCCCA CACAACTGCC ATTAATGGAG GGGTCAGTCT CGGTTGTGTG
ATGGGCCTCA GCCAGATGCA GGCGATGTAT CTCTTGGGAA GGGATCTGAC GTGGGACGAC
AGTGCTGGTA CGCTGTACGA CAACACTGCT GGCGGGAATC TACTCGATCC TACCACCTAT
GCTACTATAA CCTGGCAAGT GACCTTTTAA ATGAAAGCCA GACATTTAAA TGCTGATATA
TGGACGATCA GTGGATCATC CGAAACGCTC CTTCCATCAG GGCTCGTGCT GCCTGACCCA
TCATCTAACG CCGCTGGAGC TTCAGTCTGG GATCAAACCA TTACTCTGAC CTCTGGTGCC
AAAGCTTCCG GTGGAAACAA CGATGATGTT GTCGAAAGCA CTGTTGCCGA TGTTACCTCG
GTTGCCGCGC CGGCCAGCTC TACCATCCTA ATGACTTCTG CAGCTGCCAA GACCTCAGCT
GCTCCTGTCA CTTCAGTTTA CCGTATCACA TCGGCTGTGC CTGAGACTTC TGCTGCAGCA
GAGACCTCGG AGGCTCCCTC AAGTACCGAT CAATTGGCAG TCCAGACTTC CAGCCCTCTC
TCTCCCTCTC CTAGCTCCAC TATAAATGAG GCTTCGCAAA CTAAAGCAGT TGTCACTTCC
CCTAAGTCGG TCATTAGCGA CCAACTTAGC TATTCCGCTA CCCCTTCGCT TACTGCTGTA
GAGCCTTCTA GTGTTTCAGA AGTGGTTGAC GTTCTTACGA CGTCTTCGAC CACTTTCCAC
GAAGTGGGCA GCTCCGGCTC GCCTACTTCC TCGGAGAGCG ACTCGGACCA GGCGAACTTT
GAATCTTCAG GCCAGCCGGT GGCGGTACAA ACCAGCTCGC CTGCCCCTCA ACCAATCTCT
GATACAGTTC AGCAACTCTC GTCTACCCAA CAAGAGACAA CAGTTGAAGC TCAAATTTCT
GCTCAGCCAG AGACCTCTGC TCAACCTCAA ACTTCTGGCC AGCCTCAGAC TTCTGCGCAG
CCCCAAACAT CTGACCAACC TCAAACATCT GGTCAGTTTC AATCATCAGC TCAGCCCTCA
GGGTCGTCTT ATCGATCGGA AAGTAATGAA CAACCTACCA GGGCCGACTC TTCGTACGCC
TCCGATCCTA CTTCTGCTTA TCAGGCTAGC TCTAGTCACC CGGAAGACTC TACACCTACA
TTGATTCAAT CTTCTGACAG CGCTTCCTTC TCTACCACTG ATGGGAATGA TTACTCCTCC
GATAATGGCG ATTACGATAA CACCAGTGTC ATTGCTGTAC AGACTAGCGC TAGTTTGAAG
GCATCTCCGT CAGTTGTTCA ATCAGAAAAT CAACGTGTCA CCTCTTATGA TGCCTCAACT
TATCAAGTGG CTACGGCCAG TCTTTCGAGT TCGGCGTACC TTACCGACAC CGCCTACGAA
AGTAGTAACG ATGATAATTT GAGCGATTCT GAGGAAGTAC CGCAGACTTC TCCTTCGCAG
TCGCTGAGTG CACCAACCAG CGCATCAGCC GCTATAGTGT CAAACACTTA TCTCATCTCT
TCGAATAGCT CTGATAGCTC AGATGGAGGT GTCACCGTCA CTGCCCTTGA CGCTAGGCCT
TCATCCACCG CTCAAGCGTC ATCCTCGCAA ACTGACGTCA ACGAAGCTGG ATCATCATCC
TCTTCTGCAT CATCGTTCGA AAGCAAAATC TCCCCCAGCG GTAGCGACGA CAATGGCAGC
CCATCCGAGA CCGATCAGTC TCAATCCCAA TATTTTGAGC CGAGCTCGTC CGATGTCTCC
ACTGAACAAT TTACCACGCC TACCTCTTCC CACCAGCCGA CAACATACGC AGCCTTCATT
ATGCCTGCAT CTTCCAGCTC TGCAAGCTCA GACGAGCCAT TTGTCTTTAC TGTCGGAGGA
ATGACCATTG GACAGTTCAC CGAAGCTGGT GCCGATGCCA TGATAGCTGT GCAATTCAGT
TCTACTTCGA CGACTGATAG CCCGGCGGCT GAGAATAGCG CGTGGGCCGA GACTGCTGCT
GTTGAAAACA TTGAAGGCAC TGATGCAACG AGCACAAGTG CGTCTGAAGT GGTCGCTGCT
TCGGCTGCAC GCACAAGCAG TGCCTTCTTA GCAGATGGTA ACAATGGTAG TTGGGTTAGT
TCAGTGAATG AAAGCGTCAG TCCGGGTGCA CCCACGAGTT CGGAAGAAGC AAGGGAGACA
AGCGACAGTA ACCGATCTGG CGAAGACAAT TATGGTACCT ACTCGTCGGC CAGCGCGAGT
ACAGAGGAAA CAAGCAAAGA CAACAACCAA AGAAGTGGCG CCGGCGAGAA CACTGCGAGT
GTCACTGTTG ACTATATAAC CACCTCTCCC ACGCAAAGTT CAAGTATCGT CAGCGCCTCC
TACACTGACC AGCGATGGTC GGCTTCTCAA GCCAAAGCCA CTGTAGGGGG AGAGAGCGAA
GGCTCCAAGG TGTCTGCTGC ACCGACGATA TTTAGCATAT TGTCAAACAG CACCAGTTCC
AATGATTCTT TTGGGCGAGA ATCGTCGGAT GACAAACGTC AAAGTCTGTC AGCGACTTCT
TACCAAGCAA ACTTTACCAT CACTGGTGAA AGCGCAATGG TAAGCGCCAG TGGTCACTCA
CACCACCAGA GTGATACAGC TGTCAATGAC ATGATTTCCC AAATCATTGC CGTGGAAACT
TCTGCAATCT CGTTCGACAG CTCTCCTCTT TCGTCTGTAT ACAGGTTTTC CAAAGGCGAC
GACACGTATA CAAAATCAGG TTCCCCTGGG GCCGAGCCTA CTAGCATGTC ACCTTGGGCT
CAAGCTCTCA GCAACAATAG AAACAGCAGC GCATCTATTT CGGAAATTAT CCGCACTGCA
TCTTCGCTTG TAAGTGGCGC GAGTTACTCG CTGGAAAGCG CAAGTACCCA GAAAGCGGTT
GTGACGGGAG GTCAGAGCAG CGGAAAGACA TGTGCGAGGA AGAGGAAAGA GAAAGCGAGG
AGAGCTAGGG CTTTGCTTAT AGAAGCTGTA TGATGTTTAC GATGGACGTA AGAGGGAAGG
ACCCATTGTC ATAATTTTCT TCTGGTGAAT ACTGAACCTA AAACCCAATA GTATATTTCA
TACTGTTCGT TGACATAATA CA
 
Protein sequence
MARASLTFTS LAFLIFITPL ALAQSSGTIN LSSTSVCVII AQIASLSSPN AISAMMPICP 
DSGDTAIAWP LTTTGDGASS SIAIYAHTTA INGGVSLGCV MGLSQMQAMY LLGRDLTWDD
SAGLVLPDPS SNAAGASVWD QTITLTSGAK ASGGNNDDVV ESTVADVTSV AAPASSTILM
TSAAAKTSAA PVTSVYRITS AVPETSAAAE TSEAPSSTDQ LAVQTSSPLS PSPSSTINEA
SQTKAVVTSP KSVISDQLSY SATPSLTAVE PSSVSEVVDV LTTSSTTFHE VGSSGSPTSS
ESDSDQANFE SSGQPVAVQT SSPAPQPISD TVQQLSSTQQ ETTVEAQISA QPETSAQPQT
SGQPQTSAQP QTSDQPQTSG QFQSSAQPSG SSYRSESNEQ PTRADSSYAS DPTSAYQASS
SHPEDSTPTL IQSSDSASFS TTDGNDYSSD NGDYDNTSVI AVQTSASLKA SPSVVQSENQ
RVTSYDASTY QVATASLSSS AYLTDTAYES SNDDNLSDSE EVPQTSPSQS LSAPTSASAA
IVSNTYLISS NSSDSSDGGV TVTALDARPS STAQASSSQT DVNEAGSSSS SASSFESKIS
PSGSDDNGSP SETDQSQSQY FEPSSSDVST EQFTTPTSSH QPTTYAAFIM PASSSSASSD
EPFVFTVGGM TIGQFTEAGA DAMIAVQFSS TSTTDSPAAE NSAWAETAAV ENIEGTDATS
TSASEVVAAS AARTSSAFLA DGNNGSWVSS VNESVSPGAP TSSEEARETS DSNRSGEDNY
GTYSSASAST EETSKDNNQR SGAGENTASV TVDYITTSPT QSSSIVSASY TDQRWSASQA
KATVGGESEG SKVSAAPTIF SILSNSTSSN DSFGRESSDD KRQSLSATSY QANFTITGES
AMVSASGHSH HQSDTAVNDM ISQIIAVETS AISFDSSPLS SVYRFSKGDD TYTKSGSPGA
EPTSMSPWAQ ALSNNRNSSA SISEIIRTAS SLVSGASYSL ESASTQKAVV TGGQSSGKTC
ARKRKEKARR ARALLIEAV