Gene CNG02160 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNG02160 
Symbol 
ID3258952 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006692 
Strand
Start bp608654 
End bp611604 
Gene Length2951 bp 
Protein Length746 aa 
Translation table 
GC content51% 
IMG OID638257834 
Productconserved hypothetical protein 
Protein accessionXP_571924 
Protein GI58269536 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAATA CCAACCTCCA CGGTTACCCT TTCCAACCCG TCCGTCGCTT TCTGAAATCA 
ACAATGCCGA CATCCTCAAC CTTGATGATG ACGATAATCA TCCTCTTCTA TGCCCTGCCT
CTCCTATCCC ATCTGACATA TCCAATACAT ACTCATACCA ATCGGGTGCC TTTGGCGATT
CCAATTCACA GCTTGTCAAC GACCACAACC ATGTTGGAGC CGGCCACAGT GTCCAGGCCC
GACCTTATGT CTGCACTTTT GGTACTTGTG ACAAGGCGTT TGCCAGAAAA AGTGATCTTG
CAAGGCACTT TAAGATTCAT ACCAATGACA GGTGAGTCAG CTCATCCGTT GTGGTAAACA
AAGCTAACGC TACCCCAGAG CCTTTGTCTG CACCTATCGC GGTTGTGGCA AATCTTTCAT
CCAGCGATCA GCGTTGACAG TTCATTACCG TGTCCAGTGA GCTTGATCAG CTCTGTTGTC
AAGGTCGATG CTGATGTTTT ATACAGTACC GGGGAACGAC CTCACCATTG CGAGACCTGT
AACAAAGCAT TTGCGGACTC GAGCTCGCTC GCGAGACATA GGCGTATTCA GTAAGTCAAT
GGTTTGTTTC AGCGATGAAA TAGTATTGAC AAAGGCTAGC ACTGGGAAAC GTCCTTACAC
CTGCCACGCT GCTGGTTGCG GAAAACCTTT CGCTCGACGC AACACCTTGC TCAAACATTT
CAAGCGGCAG CATCCAGGAC TACCTCCGCC TTCTACTGCT GCCCAGCGTA CCTCTATTCG
AATGCCTATT CAGTCTCTTG GTCCAAGATC CTCCACTGCG TCTATGTTGT CCCAGCAATC
ATTCAACTCC AACGCTGACC GCTACACCGC TAGTCCTTCC AATGCTGGTA CCCCCCACGG
CTTCGCCGCT CCCCACCCGC CGGAGGGAGC GGCGTACGCC TTCCATGGCG GGTTCCCCGA
ACAGGTACTC GGAGGACATC CAAGTGCCCA ACAGCCCATC ATCTTTCAAG GATCTGGAGG
TATCCGACCT CACTTGCAAC AATCCCCGGT CCCTCCCGGA TCCGTCAGCC TCACCCCAGT
CTCTACTTCT GGACCCCATT TTGCAGGTGG GTATGGTACC CAGACGCCAA CGAGCCCAGC
GCACCATGAC CGAGACAAGC ACGGTATCTC CCCAGTATCT TCAATCGCAA CGAATAGCTT
TGGTTCCGGA CAGTACGCTA GCCCGCTCTC TGCCTACCCG CATGCAAGCA GCTATCCGCT
TACGCGCATC ACAAGCGATT CAGGAATAAT CTGGCATAGG TCACTCTCAG CTCCAGAAGA
ACCTCGTTAT GAGGCTTCTC AGTGGAGCGG CGGCAATGTC GGGGCTGGTT TCCATGCATC
ACAATTATCC ATGCCCCAGA CACCAACCCA TCTCTCGCCA TCTTACCCCT ACCATCTACC
TATGCAACAA AGATCGGCCA CCAACCCTCT TCCCAAGACT AGGCAGCTGT ATTCTCCTTC
TTGTCATGGT TCAGATGATG AGCGGGATGA ACCTCTCGTA TCTCTACCGG ACGCCCACCC
TACCTTTGCC ATCCATCCTC CCCAGGGCAT TGTCAGCGTT CCTATGTCCA GCATTGAAGG
TTCCAACATC AGCCACATAT CGAATCATGG GGGCCAGATT CTGTTCGCTC CATCTCAGAA
CGGTCCACTC CACAGCGCTC CACCCGCTAT TCAGCGTTTC AATTCTATGC CAGCGGTGCC
AACTATGAGC TCCTGGGGCC AAATTCCTCA ATACCAGACT CAAAGTGTCG GCAGTGCTAA
AAGCGCAGAT GAGGAGTGGG AGGAATTGCA GAAGGAGATG CTCAGTCGGG AGGCATCAGT
CGGGACTGAC AAGGAATTGA GTCCTGCTAC CGAAGAAGGG AAAACCCCCA AGGATGGTGA
ATCCATCAAC CACTGGGGTG AAGCCATTGC ATATCCAGTG CACCCCCATC TGCAACATGC
CAAGGATCCG TTCCATTCAA CCTCTTCCAC CTTTATTGAT CACATGCACA GCAGCGATCC
TCTCCCTCCT ATCCACGTTT TCTCCAATCA ACCTCACCAC ATGGTTCTCA CTCCTATCAA
TCCCAATGGC ATGTACCCCA CCCCCATTAC GCCTGGTGAA GAGTGGGCTC AACGTCACAT
CAAGCCTACT GCAATGCTTG GCCGTGGATA TCCTCGGCAA CACATTGACG TAGAGAACAA
AGAAAATGGC AATGAAATCG CCGAGCATAT CACCCTTACT ACGCCTCCCA AATATCAGGG
ACATCGCAAG GACAGCCGAT CCGTGACTGC CGTTGGCTTA GGGATTGCCA ATGTTCATTT
TACCGAGCCA CAAGCGATTG AAGCGGGCGA TGCGTCAGAA GTGAAAATGG AGGATCTTGA
GAGCGAAGAA AGTGATGTGA CTCCAGAGGA CGATAGCGAC GATGAATTTG TGCTTGGGAG
GAAACCGAGA AGGAGTGCAA GGAAGGGTGG TGTGAGGAAG AGGGGATCTA GAACCGCCAC
AAAGAGAAGA CGTTCGTAAC GTTAATCCAA AGTATTGACT GGTCCTTTGC CATATACCCA
TCTCATGATT CGCCATACGG TTTTATATCT CACAATTGTT CTCCTTTACG AGCTGTCAGA
TCACGACTTC GGTGTCAGGA AGGAAAAAGA AAAAAGAGAA GTAGTAGTAT ATTCGCCTGT
TACTCTAATT CGGCCTGAGC TTCATACGTT TTTAATAGCT TGTTAGCTGC GCTTGGCCTA
GTCTTCAAGT CTATCTTTCG CATCACCACA TTTTTCCGCA TTTGCCACAT GTTAACGCTT
GCATCGGCCG TTCCCTTCGT TAGTTGAAGA TTTTTATGTG CCTATTTATA CGATTTTTGA
CGACTCATTT TTACGCTGTG TTATATGATT ATTGTTTGAT ATACGATAAC TGAGAGACGT
TCCAAAAGAC A
 
Protein sequence
MINTNLHGYP FQPLVNDHNH VGAGHSVQAR PYVCTFGTCD KAFARKSDLA RHFKIHTNDR 
AFVCTYRGCG KSFIQRSALT VHYRVHTGER PHHCETCNKA FADSSSLARH RRIHTGKRPY
TCHAAGCGKP FARRNTLLKH FKRQHPGLPP PSTAAQRTSI RMPIQSLGPR SSTASMLSQQ
SFNSNADRYT ASPSNAGTPH GFAAPHPPEG AAYAFHGGFP EQVLGGHPSA QQPIIFQGSG
GIRPHLQQSP VPPGSVSLTP VSTSGPHFAG GYGTQTPTSP AHHDRDKHGI SPVSSIATNS
FGSGQYASPL SAYPHASSYP LTRITSDSGI IWHRSLSAPE EPRYEASQWS GGNVGAGFHA
SQLSMPQTPT HLSPSYPYHL PMQQRSATNP LPKTRQLYSP SCHGSDDERD EPLVSLPDAH
PTFAIHPPQG IVSVPMSSIE GSNISHISNH GGQILFAPSQ NGPLHSAPPA IQRFNSMPAV
PTMSSWGQIP QYQTQSVGSA KSADEEWEEL QKEMLSREAS VGTDKELSPA TEEGKTPKDG
ESINHWGEAI AYPVHPHLQH AKDPFHSTSS TFIDHMHSSD PLPPIHVFSN QPHHMVLTPI
NPNGMYPTPI TPGEEWAQRH IKPTAMLGRG YPRQHIDVEN KENGNEIAEH ITLTTPPKYQ
GHRKDSRSVT AVGLGIANVH FTEPQAIEAG DASEVKMEDL ESEESDVTPE DDSDDEFVLG
RKPRRSARKG GVRKRGSRTA TKRRRS