Gene CNG02640 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG02640 
Symbol 
ID3258607 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp740445 
End bp743839 
Gene Length3395 bp 
Protein Length868 aa 
Translation table 
GC content55% 
IMG OID638257886 
Productconserved hypothetical protein 
Protein accessionXP_572021 
Protein GI58269730 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CAACGTTTCT CCTGCGAGCA GTATCCAGCC CAGTACGTGT ACGACAGACC GAGTGCGCAC 
CGCCTCGCCT CCACATTCCC GCTTCCCACC GTTCCCTCGT AGCAACAATA TTTCTCACCT
TGCTCATCCC CACCGCCGAG CGCATCTCCC ATGTCCCCCG CCCCTGTAGC ACCCATCCGC
CAGCCGCGCG GCCCGTCCAC AAACTCCTTC TCACAGGCCG CCTCCCCCCC GTCATCCACG
TCGGCCGGCA ACTCGGGCAC TGCAGCAGCC ACAGGTCCGC CCGCCCCCGA CACAGACTCG
TTGCGGCGGG CGAACACCGT CTCCAACCCT CGCCACCAGC CGTCCGCATC CATCTCTGCC
AGCTCCGCCG CCTCGTCCGC TTTCCATGGC GTCCCGCCAT CCGCCGCCAG CACATCGGCC
AGCTCCAGCA GGGGCGTCAG AGGCGTCAGC CGTTTCAGGA GTGGATCGCT GTCCAACGCA
TTGCCAGACC CGGGCCTAGT AAGAAGGGCC AGTGGGAGAG AGGTAGCGAA GGAAGTAGTG
CTGGAGAATG AAGGCGAGGG CGAAGGCATC GAGGCGGCGA GCTGGGGCAA GGGTCTGGGC
AGGCAGAGCA GCCTGCCCTC CAGGCGAGGT ACATTCTGAG CAGTCCGGCA GTCGTTCCAG
GAGCTGACAT CCAGTTAGGT CTCAGCCTTG CTCAGGCTGC CAACCAGGGC GCTCCTGAAC
CAGTACCGCC GCCCCGCCCG CCACGCCGCA TCGCTATAAA CACCACCACT GTATCCGGCC
CCACACCGCC CGTTTCTGCC CATTCGGCAT CGCACTCTCT GTCCTCTCTC TCCATGCTCC
ACCGTCCGAC CTACTCTTCC CAACTGTCCC AGGACGAACC TATCCCTTCA GCAACAAGCG
GTGGCGTATC GCGCACGCAG AGCCTGAGGG CACAGGCCAA ACATAGTGAT GTCGGCGGAC
TGGGAAGAAG CATTAGTCTC AAGGCTGTAG GCGAGGTGTG TTGATTGTCT GCCGACAGTG
AGCAAAGGCT AACCTGCATA GCATACGTAC CGCCAGTCAA CGAGCCAGCA GGCGAGTAGC
GGCCAGTCTT CAGGTGTCAT TTCCTCTCCA TCTTCTGTTG CATTCTCTCC TCCCGCCCCT
CCGCCCGTTT CTTCCACAGC AGCGCTCCCC CCGTTCCAGC TTCCACTTCC TGACTCCTCC
TATGTTTCGA CAACGACGGG CGGCGCGGAC GTCAAGCGCC ACCAAAGTCT TACTCAGGGG
TACGGCTCAT CGTCCCGCGT CCGTGATCGA TTGGAGCGCA GTCCCGCCAT CACGCTTGAG
CCCCGTGAAC CTCGCGATAC GTCGGGCGTG AGGGTGAGGA TCGATACCAG GTTTGAAGAC
GAACCGCCTA CGAGCCCCAT TGGTCCCAGC GTGTGGTCCA ACACCCTCTC CAGGACGGAT
GACGGTTGGG GCTCTGCGCC TCTGAACCAA AGCGCCAGGA ATGCTTCGCA GCAGCTGCAA
GACGCCTTTG AAGCAATGAC TCTTGGACGG AATATGATGG CTCCCGACGG TAACGCAATG
CTTGCGAATG ATTTCGATCC CAGAGAAAGG CTCAGCCTGG AGACAGACAT CATGCGTCAC
CCACAACCAT CCTCCAATAA TACTTTACCG TCCGGTGGGC AAGGTCTCCA GAGGATCAAC
TCTGGCGCCG AAGAACCTAG CTGGGTGAAC AAGCTCGTGG GAGGTGATTC AACGCCCATG
GTCGGCAACG TGATGCCTCG CTCCTCCAGT GCGCTCGGTT GGAGCGAACG TGCGCGCCAG
AATGAGACTC AAAGGTTCCC GTACGGGGCT TACAACATGC CCTTTGGCAA TGGATTTAAT
ATGGGTCTAA AGCCTCAAAT GGGTCAAATG GGCATGCCTC AAATGGGTCC GGTGGGTATG
AATGGTATGC CTATGGGAAT GCCCATGGGT ATGGGCCCCA TGAACATGGG CTTCCCGATG
GGGATGAACA TGGGTTTCGG GCAGATGGGC GCAAACGGAT TTGGTCAGTT TGCCCCTGGC
CATCCCGGTC AAGCTCAAAA CCAGCATCCC GGTCAAGGAT ACCCCCCACC ATCTAGTTCG
GCTGCACCCA CTGGCCACGG CATGGGCGCG CAGGATAGGG AAGTGATCGA GTTGGCCAGA
AAGAAGGGAT TAAACCCGGC CACATTCAAC TGCAAACCTC AAAATGCCAG ATTCTTTGTT
ATCAAATCTT ACACTGTAAG TGTTGTTGCC ATAACGAAAT TGCAGAATAC GCAGCTGACG
GATTTTGAGC AGGAGGAAGA CGTTCAGAAA TCGCTCAAAC ACGAGATTTG GTCTTCTACC
GTCCTCGGCA ATAAACGTCT GGACGCTGCG TACAGAGAAA CGGCAAACAA GGGTCCGATA
TACTTGTTCT TCAGCGTCAA TGGGTCCAGG CATTTCTGCG GTGTGGCCGA AATGACAACG
CCGTGAGTCT AGAGTCCCTG ACCAGGGAAC CATCTCGTCG AAAAGTCCAT GTTATAGTAC
TGACCGCCTA GTAGTGTGGA TGAGACAAAG ACGTCCAAAG TGTGGGCACA GGACAAGTGG
AAAGGTATCT TTGAAGTCAA ATGGATCTTT GTCCGTGATG TTCCCTCAGC GTACGTCCTA
TTGCTCTAGT GTTATGCTGC TTCTTTCCGC TTATACCCAT TGTTAGCGCC CTTCGACATA
TCCGACTGAC CAACACTCCC GAGTGCAAAC CCATCACCAA CTCGCGCGAC ACGCAAGAAT
TGCCCTATGA AGCTGGTACA GAGGTCCTAC AGATCTTTTT GGACCACCAA ACCAAGAGCA
AGACTAGTCT GTTGCAGGAT TTTGCTTATT ACGAGGTTTG TCTTTTTGTT TTCTTTGACT
TCTGTATATA ATACTGACGG ATGTTCATAG CAACTTTCTG TAAATCGCCA AAACCAAAAC
TCTCCTCAGA ATGGCAACAG CCAGAACCAG CAGCCATCCT CGTATCAGCA GTTCCAACAG
AAGCCAGGGA ACGCTATCGC TCCTCCGATG AGCAATAGCC ATTCTGGTCC CACTATTCCT
CCTGTTCCCG CCATCCCTGA TAAATTCCGA TAAATCCAAA GTATACTTTA TCAGTCTTGG
TTTTTTTTTT TACTACCGAT TGGAACGACC TTGCAATGAA CAATATGAAC GTTCGGGAAA
CCAAATCTGG GTAATAATAT TCTGTGCAAA CTTTTACCGA ACGACCGCAG TTAATGGATA
AAGAAAAGGA ACGTTATATG TGAAAGCGAA AATCGTATTA ATTCAAAGTA ACATGTGTGG
AATTGGAGAT GAATAGTACA TGGGTCTCTT ATTATAGAGC TGGATATTTA TGTGTGTAGC
TGTCGGATTA TGGTGAGATG TATAAATTTA TCTTG
 
Protein sequence
MSPAPVAPIR QPRGPSTNSF SQAASPPSST SAGNSGTAAA TGPPAPDTDS LRRANTVSNP 
RHQPSASISA SSAASSAFHG VPPSAASTSA SSSRGVRGVS RFRSGSLSNA LPDPGLVRRA
SGREVAKEVV LENEGEGEGI EAASWGKGLG RQSSLPSRRG LSLAQAANQG APEPVPPPRP
PRRIAINTTT VSGPTPPVSA HSASHSLSSL SMLHRPTYSS QLSQDEPIPS ATSGGVSRTQ
SLRAQAKHSD VGGLGRSISL KAVGEHTYRQ STSQQASSGQ SSGVISSPSS VAFSPPAPPP
VSSTAALPPF QLPLPDSSYV STTTGGADVK RHQSLTQGYG SSSRVRDRLE RSPAITLEPR
EPRDTSGVRV RIDTRFEDEP PTSPIGPSVW SNTLSRTDDG WGSAPLNQSA RNASQQLQDA
FEAMTLGRNM MAPDGNAMLA NDFDPRERLS LETDIMRHPQ PSSNNTLPSG GQGLQRINSG
AEEPSWVNKL VGGDSTPMVG NVMPRSSSAL GWSERARQNE TQRFPYGAYN MPFGNGFNMG
LKPQMGQMGM PQMGPVGMNG MPMGMPMGMG PMNMGFPMGM NMGFGQMGAN GFGQFAPGHP
GQAQNQHPGQ GYPPPSSSAA PTGHGMGAQD REVIELARKK GLNPATFNCK PQNARFFVIK
SYTEEDVQKS LKHEIWSSTV LGNKRLDAAY RETANKGPIY LFFSVNGSRH FCGVAEMTTP
VDETKTSKVW AQDKWKGIFE VKWIFVRDVP SAALRHIRLT NTPECKPITN SRDTQELPYE
AGTEVLQIFL DHQTKSKTSL LQDFAYYEQL SVNRQNQNSP QNGNSQNQQP SSYQQFQQKP
GNAIAPPMSN SHSGPTIPPV PAIPDKFR