Gene CND04970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND04970 
Symbol 
ID3257296 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp1361853 
End bp1364819 
Gene Length2967 bp 
Protein Length813 aa 
Translation table 
GC content54% 
IMG OID638256433 
Productconserved hypothetical protein 
Protein accessionXP_570440 
Protein GI58266568 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.96924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTTTCC ACGCCGCCTC AAACGCCCCC GCGACGAGCC CTCCGCTGCC GCCCGTGCGC 
CACGCCCTTT CGATCAAACT GCCCGAGCGC TCCTTCTCGC ACTCCACCTC CACCGACGCA
TACACCCCCC TCACGCCCAC CAACGAGAAC GGCCACATCG GCATCACACG CAGGCGCCAC
TCCGTCTCCT CGCCAAAGTC CCGCGCCGCA TCGCTAGACC ACCAGCCACG GCTCGCACCG
AAAAACAGCA CCGCCTCGGA CTGGAGGATG AACGGCTGGA CAAAGCACGG TGAACCTTCG
TCCTCCCCCG CGCCCGACGT GAGCGAGCAC GAGACCACCC CGGTCGCCCA TCCGATCGAC
CTCACGCCGC CGCTTCTCCC TGCCGCTTCC AGTCTCCACC CGCTTCCCCG AGACCCAAAC
AACATCCTTC CTCTCTCCGT CGCGTCCTCG CCGTTCGTCA CGCCCTCCGT CTCGCGCGCG
CCGTCCCCTC GTCCTGCGCC CAGAGTGGAC AGCAGCGCCG AACAGATCAC AGTCTCTACT
GGCATCGTTC CCGCGTCTTC GTCTGCGCCC GGCTCGAGCT CCAGTCCGAG GACTTCAATC
TCGACGTCCT CGTCCCGTGT TTGGCCGGCA GGCCGATCGG CCAGCAGGAG CTCAGCCGAA
GACGATGATA CAGGTCCCTA TGGCTCAGCG GCCGGACCGT CGTACCATCC CCGTTCATGG
TTTTCTCGCG CAGCAGGGTC CATATCCCCA AAGATAAATG CGCCTACAGC ACCCCGCATC
ATGCGACTCA GCTCGGGGAG GTTGACTGGG ACGAGACGGT GGGGATGGAT CTTTGAATGG
CTTGGGGTGC AGTCTAAGCC TGAACTGCCG AAACGAGGAG GGCTGAGCAA GAGGAGCGAT
CGAGAGAGGG AGAGATTGAT GGGTCAAGGG GTAAGGCGAA GAGGAGATGT AAAGATCCTT
GGGTCAAAAT GGCTAGCCAG GGTTATAGCC TTTATACCCA CAGAACCGTG GAGCATTGTG
CGTTCATCTT TTTTCTTCTT GCCTATTGCA ACACCGCTAA CTCGTCGCAG AGCCTCTTCC
TTATCTTCTT TGCAGTCTTT GCAATCACAC TAACGTTTAC CATAAAGCAC ATCCTCAACC
CAGATAAAGA AGCATTACCA TGGCGTCAAT ACTGCACGAC CACCTACCCA TCTCTCTATT
CTCTGCAATA CCCCTCCGAC CAGCCTCATA CCAAACTCAC CCTCTCACCT CTCTCTCCTG
ATCACCCCGC ATGGCCTTAC AAACCCCATA CATCTCCGCT ATGGACAGCG GACATGCCGC
AAGCGGATCT TGATGCGGCG CTCGAACCTG TAGGCGTACT CCTCGGTGTC TTTACCACAG
ATGCCGGTCT CGAGCGTCGG CATATGATAC GGCAAAGCTA TGCGAGTCAT TGGCGAAGTC
GTCGAAAGGG GACGGAAGGA GTGAGAATCA AATTCGTGAT GGGAAGACCT AGGAAACGTT
ATGAGAAGGC TGTCCAGCTC GAAATGGAAG GTGAATTCCT AGGCGCTCTG TTTAAAAGGA
GGAATAACGC GCTAACAAAT TGCAGCATTC AATGACATTC TGCTACTGGA CATTGATGAA
AATATGAACA ATGGCAAAAC GCACGCGTTT TTCTCTTGGG CTGCCGAAAA TGCATCTGTA
CCGGACTGGG AATATCCATC CCATCCCCGA TCCGACTCTG ACTATGCCAA TTCTGGCACG
GCAATCGAAG CTGCGCAAGG AGGAAATTTG CACGCCCCTG TTTGGCGAGG TGAAAAGAAG
CCGCAATATG TTGTAAAAGC AGATGAGGAT TCGTTCATTA TGCTTGGAGA GCTGGAGAGA
AGGCTAAGGG TAGTGCCGAG GATGAAAACC TACTGGGGCT GTGAGTTGTG CATCCCTCCG
TACGGAAAAA AGATGGTGTA AGCTTACAAT AAACGTAAAT GTGTAGATCT GGTGAAAAAC
AAATTCATGG CGGGTGAATG CTATGCTCTG TCTTTTGATT TGGTCGAGTA CATTGCTGCC
TCCCCAGCGC TCAAAACTCT CACCAAAGGC AAGGAAGATA AGCTTGTTGC CAAGTGGATA
GGGATGCATC CCCAAAAGGA GGAGATTGTC TGGTCGACGG ACAGATGTTG GATATATGAC
CACCCCAAAG CTGGTACCGT GTAAGTGTCC AGATTTTTTC TTTTCTTTCT TTTCTTGCAC
TCGACGACTT GCGCTGACAT CGAGAACAGT TACTCACACG GTTTTCTGTA TCCTTCCACA
GTCGAACAAG TCCGCGTAGA AAATCAAACT GGACTTTCAC CTTTGACTCT TGCCCAGCGC
GGCGGGCCAG GAGCGGCCGA CGCTTATTCC ACCGTCTCCA AATTCGGAAC CGCCTACCGA
CCGCTTTCCA CCGACATGTC TGCGGCCGAG CAAGTAGAAG CTCTTGTCGA AGGCTCTCCT
CTCTCCAGAT TAAATGAAGA TGAGCTATCC TCATCGAGTC GCAAAGTCCA GCAAGCATTT
TCTCCCACAG AGTCTCTTCG TCAAAAGATC GATCGACTAT ACTCCTCAAG GCCGACTAGG
ATAGAGAGAT TCTTGGGCGA TGAAGAGGAA CGAGGAGGAA CGGTGGTGGT ACATTATATA
AAAAAGGCAG AATGGTTTGT GGAGACCATG ATAGCCATGC TGGGCACGGC CGAAGAGCAG
AGAGTCTGGC ATCGTGGTGT AGGGAGTGGG CTGGGCGCTT TGGAGAGGCG AAAAGGGCGA
GTGCCTGTAT CAGGAAATGG ACAGGAAGGA TTCGATGCCG GAAACAGGGT CAGACTTAAA
AAGGAAGACG GCCTGTAGAT ACGTTGTAGA TTATATGACT CGAGTTTAAG AGTCGAGATT
TGCTGTCGAA GATCACCAAT GTTATTCGAC CCACGCACGC ACGCTTCGTG CTGTAAAATA
TCATAAATCT ATATCATCAT ATCATAG
 
Protein sequence
MPFHAASNAP ATSPPLPPVR HALSIKLPER SFSHSTSTDA YTPLTPTNEN GHIGITRRRH 
SVSSPKSRAA SLDHQPRLAP KNSTASDWRM NGWTKHGEPS SSPAPDVSEH ETTPVAHPID
LTPPLLPAAS SLHPLPRDPN NILPLSVASS PFVTPSVSRA PSPRPAPRVD SSAEQITVST
GIVPASSSAP GSSSSPRTSI STSSSRVWPA GRSASRSSAE DDDTGPYGSA AGPSYHPRSW
FSRAAGSISP KINAPTAPRI MRLSSGRLTG TRRWGWIFEW LGVQSKPELP KRGGLSKRSD
RERERLMGQG VRRRGDVKIL GSKWLARVIA FIPTEPWSIP HTKLTLSPLS PDHPAWPYKP
HTSPLWTADM PQADLDAALE PVGVLLGVFT TDAGLERRHM IRQSYASHWR SRRKGTEGVR
IKFVMGRPRK RYEKAVQLEM EAFNDILLLD IDENMNNGKT HAFFSWAAEN ASVPDWEYPS
HPRSDSDYAN SGTAIEAAQG GNLHAPVWRG EKKPQYVVKA DEDSFIMLGE LERRLRVVPR
MKTYWGYLVK NKFMAGECYA LSFDLVEYIA ASPALKTLTK GKEDKLVAKW IGMHPQKEEI
VWSTDRCWIY DHPKAGTVYS HGFLYPSTVE QVRVENQTGL SPLTLAQRGG PGAADAYSTV
SKFGTAYRPL STDMSAAEQV EALVEGSPLS RLNEDELSSS SRKVQQAFSP TESLRQKIDR
LYSSRPTRIE RFLGDEEERG GTVVVHYIKK AEWFVETMIA MLGTAEEQRV WHRGVGSGLG
ALERRKGRVP VSGNGQEGFD AGNRVRLKKE DGL