Gene CND04940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND04940 
Symbol 
ID3257398 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp1354082 
End bp1355122 
Gene Length1041 bp 
Protein Length308 aa 
Translation table 
GC content52% 
IMG OID638256430 
Productconserved hypothetical protein 
Protein accessionXP_570419 
Protein GI58266526 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0689] RNase PH 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value0.189992 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAGTT ATAGAACTTG CACAACCATG GCCACCGCAG GCCCCAGTAG ACGCCCCGAT 
GGCAGAACGC CTGCCCAGCT TAGGCCTTTG CATCTGTCCA TTGGTGAGCT CGATCGTGCA
GACGGCTCGG CTCGCTTCGC TTTCGGTATG TCTCTTCTTG TATACCTCCT CCGCAGACTG
CTGAACACAT GCCTCGTAGG GTCAAATGCC GTCCTTGCAA GCTGCTCTGG TCCTATAGAG
GTCCGCCTCC GTGAAGAACT CCCAGACAAA GCCACTTTTG AAGTAAATCA TCGCCCTCTC
GAGGGCGTTG GTGCAACTCC TTCCCGAGCT CTTGTCACCA CCCTTGAAAC TATTTTCCCT
CCCATCTTAT CATTAGAAAA GCACCCGAGA TCCCTTGTTC AGCTTGTAGT GCAGAGCTTA
GTGCCATCTA CAGGTAGGGT TGTGTACGGG TCTGTCTTTG GGGCGGAAGG GGTGGGAGCA
GAGCAGAACA CATGGCCGGC GACGGATAAG GACGATTACG CCTATATCCC AGAAAGTAGA
AAAGATGCAG CTAGGATATC TCCTGCAGCG GGGTATACTT TTACTGCTCG AGCCGCCTCT
ATCAACGCTT CGACATTAGC ACTCCTCTCC GCGGGTACAA TATCGATCTT AGCACTTCCC
GTCGCTGTAG CCCTCGTGGT GACTACCAAA GGGAGAGTGA TGTTGGATCC AGAAGCCGAT
GAGGAGAAGC AGGCAAAGGC GAGACTCGGG TTCGGCTGGG CCTGGGGTGC AGTATTTGGG
ACGGCCAATG AAGAGAACAA TATGGGAGTT GCTGGGCAGA ACGACGGTGG GGCAGAACTT
GTTTGGATCG AAAGTGAAGG TAGCTTCACT AGGCAGGAAG TGAGTATTTC ATATTTTTTT
TTTCAAGCAA GACTTGAGCA CTGATGATGA GCACCTCAGT GGTCGGAAGC GCTGCAAATG
TCCAAAACGG CCTCAAAGGC AATCCTTGAA TTCATTCGAA TCCAACTTGA CGCTCATCTT
AGTTCACATC AACTCTCATA G
 
Protein sequence
MSSYRTCTTM ATAGPSRRPD GRTPAQLRPL HLSIGELDRA DGSARFAFGS NAVLASCSGP 
IEVRLREELP DKATFEVNHR PLEGVGATPS RALVTTLETI FPPILSLEKH PRSLVQLVVQ
SLVPSTGRVV YGSVFGAEGV GAEQNTWPAT DKDDYAYIPE SRKDAARISP AAGYTFTARA
ASINASTLAL LSAGTISILA LPVAVALVVT TKGRVMLDPE ADEEKQAKAR LGFGWAWGAV
FGTANEENNM GVAGQNDGGA ELVWIESEGS FTRQEWSEAL QMSKTASKAI LEFIRIQLDA
HLSSHQLS