Gene Ndas_3234 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3234 
Symbol 
ID9247091 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3866205 
End bp3867500 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content75% 
IMG OID 
Productpyrimidine-nucleoside phosphorylase 
Protein accessionYP_003681146 
Protein GI297562172 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCA TCGAGATCAT CCGAGCCAAG CGCGACGGGG GAGAGCTGAG CCCCGGCCAG 
ATCGACTGGG TGATCGACGC CTACACCCGC GGCGAGGTGG CCGAGGAGCA GATGTCGGCG
CTGGCCATGG CGATCTTCCT GCGCGGGATG GACCGGGCCG AGGTGAGCCG CTGGACCGAG
GCGATGCTGG CCTCGGGGGA GCGCCTGGAC TTCTCCGACC TGGCGCGGCC CACCACCGAC
AAGCACTCCA CGGGCGGGGT GGGCGACAAG ATCACCCTGC CGCTCACACC CACCGTCGCG
GCCTGCGGGG CGGCCGTGCC GCAGCTGTCG GGGCGCGGGC TCGGGCACAC CGGCGGGACC
CTGGACAAGC TGGAGTCGAT CCCCGGGTGG CGGGCCTCGC TGTCCACCGG CGAGATGCGG
GAGGTGCTGG ACTCCACGGG CGGGGTGATC TGCGCGGCCG GGTCCGGGCT GGCCCCGGCC
GACCGCAAGC TCTACGCCCT GCGCGACGTG ACCGGCACGG TCGAGTCGAT CCCGCTGATC
GCCGCGTCCA TCATGAGCAA GAAGCTCGCC GAGGGCACCG GCGCGCTGGT CCTGGACGTC
AAGGTGGGTT CGGGGGCGTT CATGAAGGAC GCCGACTCCG CGCGCGAGCT GGCCCGCACG
ATGGTCGACA TCGGAAACGA CCACGGCGTG CGCACGGTCG CGCTGCTCAC CGACATGTCG
GTCCCGCTGG GCAGGCAGGT GGGCAACGCC CTGGAGGTGG CCGAGTCCGT GGAGGTGCTG
TCGGGCGGCG GGCCCGCGGA CGTGGTGGAG CTGACCGTGG CCCTGGCCCG GGAGATGCTG
GCCGCGGCCG GGCTCGTCCC CGGTGAGGGC GGGGTCAAGG ACCCGGCCGA GGCGCTGCGG
GACGGCAGCG CGCTGGAGTC GTGGAAGCGG CTGGTCCGGG CACAGGGCGG GGACCCGGAC
GCGCCGCTGC CGGTGGCCGC CGAGCGCCGG GTGGTGCTCG CTCCGGCCTC GGGGACGGTG
ACCCGGCTGG ACGCCTACCA GGTGGGGCTG GCCGCGTGGC GGCTGGGCGC GGGCCGGGCG
CGCAAGGAGG ACGCGGTGTC GTTCGGGGCG GGGGTGACCC TGCACGCCAA GCCGGGGGAG
TCCGTGCAGG CCGGGGAGCC GCTGTTCACG CTGCACGCGG ACGAGGCGGA GCGGTTCGAG
CGGGCGGCCG AGGCGCTGGA GGGCGCCTTC GACATCGAGC CGGAGGGCGG GGCGGGCTAC
GAGGCCCGGC CGCTGGTGAT CGACCGGATC GCCTGA
 
Protein sequence
MDVIEIIRAK RDGGELSPGQ IDWVIDAYTR GEVAEEQMSA LAMAIFLRGM DRAEVSRWTE 
AMLASGERLD FSDLARPTTD KHSTGGVGDK ITLPLTPTVA ACGAAVPQLS GRGLGHTGGT
LDKLESIPGW RASLSTGEMR EVLDSTGGVI CAAGSGLAPA DRKLYALRDV TGTVESIPLI
AASIMSKKLA EGTGALVLDV KVGSGAFMKD ADSARELART MVDIGNDHGV RTVALLTDMS
VPLGRQVGNA LEVAESVEVL SGGGPADVVE LTVALAREML AAAGLVPGEG GVKDPAEALR
DGSALESWKR LVRAQGGDPD APLPVAAERR VVLAPASGTV TRLDAYQVGL AAWRLGAGRA
RKEDAVSFGA GVTLHAKPGE SVQAGEPLFT LHADEAERFE RAAEALEGAF DIEPEGGAGY
EARPLVIDRI A