Gene CND04750 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCND04750 
Symbol 
ID3256965 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006686 
Strand
Start bp1303299 
End bp1304619 
Gene Length1321 bp 
Protein Length172 aa 
Translation table 
GC content44% 
IMG OID638256411 
Productconserved hypothetical protein 
Protein accessionXP_570495 
Protein GI58266678 
COG category[K] Transcription 
COG ID[COG1095] DNA-directed RNA polymerase, subunit E' 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.551706 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TCATCCATGC AGGGATAGAC ACCCAACTGT CATCCATCCA TCCGCCCGAA CTCTTCCCAC 
CTCGGTCCAC GGAAGGTATA GGATCATACC CATCTTTCTC TTTCCATCTT TTTTTTCCCC
TATCAGGCCA GAAAAAGAAA AAAAAACTGC AAACAACCTC TCAGGATGTT CTTCCTGGTT
GGTACCACCC TTGCGCGAAA GCCTTGCTAT TCGCGCTGAC ACGTTTACTG CTTTCTTTGC
ATCCGAACCT GCCCGACTGT GTTTTTTCTC TGCCCGAAAA ATTTTGGTTG CATAGCGAGA
ACTTACTCAT ACAATCCTAC TTCATCCATC CTACTTTGGT GCCCAGCTTG AAGACTATCT
CCGTCAGAAG CTTTATGAGG ATGTTGAAGG AACGTGTAGC GGTAAACATG GGTATGTTGT
TCATTTTCGT CACAGCTCGG ACGCTGACAT GGAGGATAGG TATATCATCT CGGTCATCAC
CATAACAGAC ATAGGTGAAG GCAAGATCAT CCCATCGACA GGTCAAGCAA AGTTTAAGAC
AAGGTATACT GCCATCGTCA TGAAGCCCTT CAAGGGTGAA GTAGTGGATG CCAAGGTTGT
CAATGTCAAC AAGGCGAGTC TTTTGGCATC TGTTTTCACA TTGTCCGACG GCTGACTGTT
TTGCAGATGG GCTTCTTTGC GATGGTCGGG CCATTACAAG TGTTCGTCTC TTGTCATGTA
AGTCACAGTT TCATTTTCAC TTGCCCTCTT TGAGCATCCC CACCCCTCTC CATCTATGAT
GAACATTATG CATAGGGACA ACAGATCTTT CGGGAATGTT GATTTATGCG GGACGATCTT
GTAGTAATTG ATCGATTAGT TCCTGATCTT TTTATTTCCT TATTATTTCT AGAGTGCAAT
GACTGAGAGT CTGATCAAAG TTTGCTGACT GCCCGATCCT CAGCTTACTC ACTCGGATAT
GAAATTCGAC CCCAGCGTTT CGCCGCCATG CTATCGTTCA AATGACGAAA TTATTCAAAA
GGATACCAAA GTGCGAATAC AAATTGTAGG TTGTAGAGTA GAAGCGAATG ATATGGTAAG
TCTATACCAC TCGAACCGTG CTCCTGTTGC ATCCCAAACT GAGAGGTGAG TTCAGTTTGC
GATCGGAACT ATTAAGAAGG ACTATCTTGG TCAAATAAGA GATGAGTAAG GTAATTATAT
GGTATGGCTG TGTGCGCGAC ACCAAAGCAT AAATTTGGGA ATCCTTGTAT TAGACTACAT
TAACATATAC ACACTATTGT CAGAAATTGA ATACAACATG CATAAATGAT GATGTAACCA
T
 
Protein sequence
MFFLRELTHT ILLHPSYFGA QLEDYLRQKL YEDVEGTCSG KHGYIISVIT ITDIGEGKII 
PSTGQAKFKT RYTAIVMKPF KGEVVDAKVV NVNKMGFFAM VGPLQVFVSC HLTHSDMKFD
PSVSPPCYRS NDEIIQKDTK VRIQIVGCRV EANDMFAIGT IKKDYLGQIR DE