Gene CNB04130 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCNB04130 
Symbol 
ID3256004 
TypeCDS 
Is gene splicedYes 
Is pseudo geneNo 
Organism nameCryptococcus neoformans var. neoformans JEC21 
KingdomEukaryota 
Replicon accessionNC_006684 
Strand
Start bp1212875 
End bp1214142 
Gene Length1268 bp 
Protein Length356 aa 
Translation table 
GC content49% 
IMG OID638255058 
Productnitrilase-like protein, putative 
Protein accessionXP_569238 
Protein GI58264164 
COG category[R] General function prediction only 
COG ID[COG0388] Predicted amidohydrolase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value0.0388964 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CGATAAGAGA TTCCATTTCC CATACAGCAC CACATCAGTT TCCGCTCGAT CATCTATATC 
ATCGGGTTCC ATCGCCTAAG CGATCGAATG CTTCTCTCCC TTCGCTCCTC TCTTTCCAGA
AACATAATCG GCCAAAGCCG TCATCTTCCT ACCATCTCAT TCTCTGTAAC ATCATCGCGT
ATCCGCAACA TGAGTTCTCA AGCAGCAACA TCATCAGCAA CCGTAGCAGT CTGCCAGCTT
CGTTCTACGA GTGATCCTGT ACACAATCTT AAGATCTCAG AAAAGGTGAT TAGGAACGCC
GTCGCCGCAG GTGCTAAAGC TTGCTTCTTG CCTGAAGCAT CGGATTTTAT CAACCCGTCC
AAGACCGAGT CGCGCAAGTT TTCGCACCCA CTACCAAAGC ACGAATACAC CATTGGGCTG
CAAAGGCTGG CTAAAGAACT AGGCATAGTC ATTTCTGTGG GAGTACATGA AGGACCAGAA
GACGAGAGCG AAGAACGAGT GTATAATACT CATGTGTTGA TTGGGAAAGA TGGTGGTATT
CTTGCTAGCT ACAGAAAAGT ATGATGGTGT CTGACTACAC GCAATATCGA TACTGACAGG
GAAACTAGAT TCACTTGTTT GACGTTGAGT TATCAAAACC TCCTGCGCCT GACGGCACCC
CTCGCCCACC CCAGCGCACG GGCGAGTCCG AACGTATTCT TGCTGGACAA GCTGTCACGC
CTCCCGTAGA GGTAGAGGGT ATTGGAAATA TTGGATTAGA AATCTGCTAC GATATCAGGT
TCCCGGAGTT ATCCATCATC TTGACCAGGT TGGGAGCAGA AGTGCTCTTG TTCCCTTCAG
CATTTACTGT CAAAACGGGA CGTGATCATT GGGGAACCCT CTGTGTATGT TTACTCGCCG
GTCATCCAAT CTGCATCAAC TGATCCCTGA ATAGCGTGCG ACAGCTATTC AGTACCAATC
ATATCTCATC GCCTCTGCTC AATATGGAGC TCACAACTCT AAGCGTACAT CATGGGGTGA
AACTCTTGCT TTTGACCCGT GGGGTCGTCA GCTCGGTCGT CTCCGTAGTG TGGACGATAC
GCCCCCTCCC AAAGAGGGCG AAGAAGGTGA CAAGGGTGTG GAGAAACTGT ATGAGGACAG
TGGGGAATTC TTCCTCTGCG AGATAGACGG TACTAAAGTA AAGGAGACGA GGGGGCAAAT
TCCCTTGGCG ATCCAAAAGA GATCAGATAT TTATGGGGTT GTGGGCGAGG GCGCTTAGAT
GTTATAGA
 
Protein sequence
MLLSLRSSLS RNIIGQSRHL PTISFSVTSS RIRNMSSQAA TSSATVAVCQ LRSTSDPVHN 
LKISEKVIRN AVAAGAKACF LPEASDFINP SKTESRKFSH PLPKHEYTIG LQRLAKELGI
VISVGVHEGP EDESEERVYN THVLIGKDGG ILASYRKIHL FDVELSKPPA PDGTPRPPQR
TGESERILAG QAVTPPVEVE GIGNIGLEIC YDIRFPELSI ILTRLGAEVL LFPSAFTVKT
GRDHWGTLCR ATAIQYQSYL IASAQYGAHN SKRTSWGETL AFDPWGRQLG RLRSVDDTPP
PKEGEEGDKG VEKLYEDSGE FFLCEIDGTK VKETRGQIPL AIQKRSDIYG VVGEGA