Gene Ndas_5375 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5375 
Symbol 
ID9249278 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp556451 
End bp557659 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content74% 
IMG OID 
Productalanine racemase 
Protein accessionYP_003683261 
Protein GI297564288 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGTCGA CCGACCCCGG GGAGCACTCG CTCCACCTCC CCGGCCACCC GGCCTCGCCC 
CTGGGTGAGG CCCTCGTTGA TATAAGCGCG ATCGCCCACA ATGTACGGTT CATGGCAAAG
CGGACCAATT CCGAAATTCT TGCCGTCGTC AAGGCGAACG GCTTCGGCCA CGGGGCCGTG
GAGGTCGCCC GCGCCTCCCT GGAGGCCGGA GCGACCTGGC TGGGCGTGAC CTCCCTGGAG
GAGGCGCTCG CCCTGCGCCG CGCCGGGCTG CGCGCCCCCG TGCTGTGCTG GCTGCACCGT
GCGGACCAGG ACTTCGACTC CGCCGTCGCC GCCGACGTGG ACCTGTCCGT CCCCTCGGTC
GGCCACCTGC GCGCCGTGGC CGACGCCGCC GTGCGCACGG GCCGTGTCGC GCACGTCCAC
CTCAAGGCCG ACACCGGGCT GAGCCGCAAC GGCGCGCCGC CGGACGCCTG GCCCGGACTG
GTCGGTCTGG CCCGCGTACT GGAACTGGAC GGCCTGGTCC GGGTGCGCGG TGTCTGGTCA
CACCTGGCCT CCGCCGACCT GCCCGGCGCG GCGACCACCG CGCAGCAGGT CACCGCCTTC
GAGGGGGCGC TCTCCCAGGC GCGCGCGGCC GGGCTCGACC CGTCGCTGCG GCACCTGGCC
AACACGGCCG CGATCCTCAA CGAGCCCGCC ACGCACTTCG ACCTGGTCCG GGCGGGCGTC
GGCCTCTACG GAGTGGAGCC GGTCGAGGGC AGGCGCTTCG GCCTGCGCCT GGCCATGACC
CTGCGCGCCC GGGTGGCCAT GGTCCGCCGG GTCCCCGCGG GGACGGGCGT CAGCTACCAC
CACGCCTACA CCACCCCACG CGAGAGCCTG CTCGCCCTCG TCCCGCTCGG CTACGCCGAC
GGTGTGCCCC GTGCGGCGGG GGACCGGGCC TACGTGTGGA TCGCCGGACG GCGGTGCCCC
GTGGCCGGAC GCATCGCCAT GGACCAGTTC GTGGTCGACG TCGGCGGCAT GGACGTGCGC
GAGGGCGACG AGGTGGTCGT GTTCGGCCCC GGCGACCGCG GCGAGCCCAC CGTCGAGGAG
TGGGCCGACT GGGCCGGAAC CATCCCCCAC GAGATTCTCA CCGGCGTGGG TGCGCGCGTG
CCCCGCCTCC ACCAGGACCT GGCGCGGCCC GTGCCGCGCG AACCGAGCAA GGAGAGATCG
AGTGCCTGA
 
Protein sequence
MPSTDPGEHS LHLPGHPASP LGEALVDISA IAHNVRFMAK RTNSEILAVV KANGFGHGAV 
EVARASLEAG ATWLGVTSLE EALALRRAGL RAPVLCWLHR ADQDFDSAVA ADVDLSVPSV
GHLRAVADAA VRTGRVAHVH LKADTGLSRN GAPPDAWPGL VGLARVLELD GLVRVRGVWS
HLASADLPGA ATTAQQVTAF EGALSQARAA GLDPSLRHLA NTAAILNEPA THFDLVRAGV
GLYGVEPVEG RRFGLRLAMT LRARVAMVRR VPAGTGVSYH HAYTTPRESL LALVPLGYAD
GVPRAAGDRA YVWIAGRRCP VAGRIAMDQF VVDVGGMDVR EGDEVVVFGP GDRGEPTVEE
WADWAGTIPH EILTGVGARV PRLHQDLARP VPREPSKERS SA