Gene CNG00620 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG00620 
Symbol 
ID3258972 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp168934 
End bp172270 
Gene Length3337 bp 
Protein Length870 aa 
Translation table 
GC content49% 
IMG OID638257678 
Productconserved hypothetical protein 
Protein accessionXP_571775 
Protein GI58269238 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.716502 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
AGCCCGCACC CTATATTAAT ACCAACGTCG CATATTACGA AGGATGAGCG AGCAAGTGGG 
ACAAAGAGAA GCAGTCGATT CTCCTCAAGC TACTGGCGTG AAGACACCTC CGGAGCCTTG
TCCTTTATGT CGAGAAACCG GTCCGCCTCA GCCTCCCTCT ATCACAGAAG AAGGCAAGAC
AAACGAAGAC ATCGACTTCA TATGGGTAGC GTGTAACAAA TGTGATGAGT GGTATCACTC
GGCTTGTTTA TTCCTTGGTG ATGAGAAGTG GAGAGGGACG ATACCCAAAG AGATTATTTC
GACAGTCGAG ACGAACTTTG GCGACGAAGG GGCATGGACG AATTGGGTTG AGTGGATAGG
AAAATGGTAT GCGTATTTCT CATGACTAGG TTTTTCACAT CTGACAATCA GCCACCGGGG
ATCAGGTATT GTGCCCCTTG TCTGGCTCGC TCAACTTCGC CGTCGAACCC TCGCCCCCCT
CGGCATCCAC TAGTTGCCAC TATGAAGCGA GCATCGATTC AACCTAAAGA CATCGATCAA
GCTGGAAAGC CTCTTAAAAG GTCTGCAAGC ACTTCAGCTC CTCTACTCAA GTCCAATATC
AAGCGACCTC GCACCAGTAC AAAAGGTCAG GAAACAGCGT CACCTGAGAT TGATATGAAG
AGCGAGAGGG AGCAACAAGC GGAGAGTACT GCGGGGACAC CTGCTTCGGA TGCTCCTCAA
GGGCGACCGA AACGGAAAAC TGCCCAAATT GATTACCGCA ACTTGAACAA CTCGATTGCT
ACGCCTACTC ATCAATGGTT GGAGTTGATT GCGGATCCTG AGAAGTATGG ACGAACAATC
TTAGACGGTC AGTCAATCAA TTTTACTGGA TAGGCGAGGG TTGATCAAAA CATGTATAGC
CAACTACCCT GCTCTTCCTG GGAAACTTTT GACGCGTGCG TGGCTGGAGT CACAACCTTT
ACCCGGGCAA CCTTCTTCTA TCTCGCCTGA TCTACTACCA ACTCGATTCT GGGGTCCCGA
TAGGGAGCCG CTCATCGTCA GGCCGGAAAA TGGCGGATTC TCAAGTCTAG GAGGACATTT
ACCGTCGAAG GACTTGACCG TGCAGGATGT GGCTAATCTG GTCGGTCCAG ATCGAATGGT
AGATGTGATT GGTATGCCCC TTTTTACTCT CAACACAACG ATCATAGATA CTAATTGTTA
AGGTTACTGT AGACGTCTCC TCTCAGCACT CTTCACAATG GACCCTTCAG AAATGGGCCG
AATATATTCA ATCATCTTCT GGAAATACCA GTGTCCGCAA CCCCAAGGTC TACAACGTTA
TCTCCCTCGA AATATCTGGT ACAGAACTAG CCAAAAAGGT GAAGCCGCCC AAGATCGTCA
GAGAGATCGA CTGGGTGGAC AATTTTTGGA GATTCAATGC CGGAGCGGGA GGGAAGGATG
TGAAGGAAAA GGGTAGGGGG AATGATAGCA GAGAAGGCAG TGAGATAAGA AAGGAGGGAA
GCCACTTAAC CGAAGGGGAT AATGGGGGCG AGATTGAAGA GGATCTTGAA GGTTTGAAAG
AGAAAACAAA TACGCCTTAC CCCAAGGTCC AACTTTATTG CCTCGTAGGT TCCTGCACTT
CAAACTGATC TGTCGGAAGA TATTTACAAC GAACTGATTT TCGGAATTCA GATGGGTATG
AAGGGTGCGT GGACAGTAAG CTACAACTAT AATTGCTCTG GTAGATTTAT TCGTTAATGG
ATCGTCTATA GGACTGGCAC GTTGATTTTG CCGCAAGCTC TGTATATTAC ACGATCCATT
CCGGCGCAAA GGTAAAACTC TCCTGCTTTG TGTCGTTCTT CTCTCCCGGC TCATACCATT
CTCAGGTCTT CTTTTTCGTC AAGCCCACCG AACAAAACCT CAAAGCCTAC GCAGAGTGTA
CGTCAAACTC TCACAAAATC GCCTGGCCCC TTATCCCTTA CTCCTTATTT CTTACCTCGC
GGATCCCCCA GCTGAATCCA TATCTTAGGG TCTGGTTCCT ACGAGAAACA GCAAGATACA
TGGTTGGGCG ACATGGTTGA CGAAGTCCGA AAGGTGGAAT TGCACGCTGG TGATACGATG
TAAGTGATAG TTATGACAAA AAAGCGCGTG TTACCTGACG GGCTTGTCTA GGATAATACC
GACAGGTTAC ATCCACGCCG TTTATACACC CATGGACTCT ATCGTTTTTG GAGGGAATTT
CCTGCATTCG TATAACGTTG ATACTCGTGG GTTTTCAGAT GCTTGAATGC CCCGCGCAAT
TCGATTGTCA CTGACACTTT CATTTTCGTC TGGTAAAGAA TTACGACTAC GTCAGATTGA
GATCGATACT AAAGTCCCTC AGAGATTCCG TTTTCCCATG TTTGACCGGT ACGTCATTTC
CCCCTACAGT TTTGTGCGAT CCATTCATTG ATGATCTTTG GTCATTTCCA GGCTCTGCTG
GTACGTCGCC GAAAAATACT GTAGCGACCT CCGTCATCTT CGTGCCTACC GCCCTCGAGC
AACCACCACT CCGAAACCGC CACATTTTCG CGTCCTCCAA TGTCTCTCCT ATCTTGCCAA
TTTCCTCGTC TCCCAAACGG GAATTCTCGA GGACCCCGAG GCGGAAGATA AAGCTCGGAA
ACTGGTACAC GATAGGATAC CGGGAGATAT CGTCAAGGAT CCCGAGGGAC TAGCAAAGGA
GCTGAAGTGG AGGGTGGAGA GAGAGTTGGG AGCGTTGGGT TTATTAGGGG AGGAGGCGTC
TGGTGTCGAG GCGGAGGAGT TCAAGTCCAA TGGCACCGCG AATGGGAGCG TGAAAATCAA
GGGGAAGGAG GTTAGCAGGA AAAGGGATCG GTTGAGTAAA GTGTTTGATA AGAAGGCAAT
ATCTCGAACT TGGGATTTCC ACCCTCCGTG AGTCCCTTTC CCACAATCCA CTTTGAACTT
GAAGCTGATT TGATTCCAGA GCATGGTCAG AAAACAGACA ATCCCCTCAA ATAGAAACGA
CAACCGTCCA ATTACCTCGC CCTAGCACAT CTTCTTCCGA CGCTATCTCA GGTTCTGGCC
CTGGCGCTAG CCCGGGTGCG AGTGCGAATG GGGGTGCAAA CGAAAACGAA CAAGCAGAAT
TGACGACAAT GCTCGTGAAG CAGACTCGTA AACGTATGCG GGAACTGGAC GATGGTACGG
TTATTGAAGA GAGTCAGGAA ACCACCTTTG TGGAGAAGAA GACGATTTGG GGACCGAAGC
TCGACAAAGA GAAGATATCA CAACCGCAAG GGAAAGTAGA GGAGGATATG GACATTGACC
ACTGATCATT TGGCGTTGGC TATTTGCAGA AAGTTGA
 
Protein sequence
MSEQVGQREA VDSPQATGVK TPPEPCPLCR ETGPPQPPSI TEEGKTNEDI DFIWVACNKC 
DEWYHSACLF LGDEKWRGTI PKEIISTVET NFGDEGAWTN WVEWIGKWYC APCLARSTSP
SNPRPPRHPL VATMKRASIQ PKDIDQAGKP LKRSASTSAP LLKSNIKRPR TSTKGQETAS
PEIDMKSERE QQAESTAGTP ASDAPQGRPK RKTAQIDYRN LNNSIATPTH QWLELIADPE
KYGRTILDAN YPALPGKLLT RAWLESQPLP GQPSSISPDL LPTRFWGPDR EPLIVRPENG
GFSSLGGHLP SKDLTVQDVA NLVGPDRMVD VIDVSSQHSS QWTLQKWAEY IQSSSGNTSV
RNPKVYNVIS LEISGTELAK KVKPPKIVRE IDWVDNFWRF NAGAGGKDVK EKGRGNDSRE
GSEIRKEGSH LTEGDNGGEI EEDLEGLKEK TNTPYPKVQL YCLDWHVDFA ASSVYYTIHS
GAKVKLSCFV SFFSPGSYHS QVFFFVKPTE QNLKAYAEWS GSYEKQQDTW LGDMVDEVRK
VELHAGDTMI IPTGYIHAVY TPMDSIVFGG NFLHSYNVDT QLRLRQIEID TKVPQRFRFP
MFDRLCWYVA EKYCSDLRHL RAYRPRATTT PKPPHFRVLQ CLSYLANFLV SQTGILEDPE
AEDKARKLVH DRIPGDIVKD PEGLAKELKW RVERELGALG LLGEEASGVE AEEFKSNGTA
NGSVKIKGKE VSRKRDRLSK VFDKKAISRT WDFHPPAWSE NRQSPQIETT TVQLPRPSTS
SSDAISGSGP GASPGASANG GANENEQAEL TTMLVKQTRK RMRELDDGTV IEESQETTFV
EKKTIWGPKL DKEKISQPQG KVEEDMDIDH