Gene Ndas_4045 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4045 
Symbol 
ID9247917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4837459 
End bp4838589 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content75% 
IMG OID 
Productalanine racemase 
Protein accessionYP_003681948 
Protein GI297562974 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.43554 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTCACT TCGCGCACGC TCGCGTCGAC CTCGACGCGA TCTCCCACAA CGCGCGGGTG 
CTCCGCGGGT TCGCAGGGGG CACCCCGCTC ATGGGTGTGG TCAAGGCCGA CGGCTACGGG
CACGGCATGC TCCCGGCAGC CCGCGCCCTG ATCGCGGGCG GCGCGACCTG GCTGGGCACG
GCCTTCATCG GCGAGGCCCT CGAACTGCGC CGCGCCGGAC TGACCCCGCC CGTCCTGGCC
TGGATCATCC CGCCCGGCGA GCCGGTCGCG GAGGCCGTCG AGGCCGACAT CGACCTCGGG
GTGAGCGACC GCGCGGTCCT GGACACCGTG ATCGCCGAGG CCCGCCGCAT CGGCCGCACC
GCCCGCGTAC AGCTCAAGGC CGACACCGGC CTCAACCGCG GCGGCGTGGG TCCCGCCGAC
TGGGGCGCCC TGGCCGAGGC CGCCGCCCGC GCCGAGGACG AGGGGCACCT GCGCGTCACC
GGCGTGTGGT CCCACTTCGC CTGCGCCGAC GAGCCGGGCC ACCCCTCCGT CGCACGCCAG
CTCTCCCGCT TCCACGAGGC CCTGGAGACC GCGGACAAGG TCGGCCTGAC CCCCGAGGTC
CGGCACATCG CCAACTCGGC CGCGCTGCTC ACCCTCCCCG AGGCCCGCTT CGACCTCGTC
CGCGGCGGGA TCGCCAGCTA CGGCCTGAGC CCGATCCCCG GCCTCACGGG GACCGGGCTG
CGGCCCGCGA TGACGCTGCG CTCCCGGCTC GCCCTCACCA AGCGCGTCCC CGAGGGCAGC
GGCGTCTCCT ACGGCCACCG CTACGTGACC GACCGGGAGA CCACCCTGGC CCTGGTGCCG
CTGGGTTACG CCGACGGGGT CCCCCGCGCC GCCACCAACC GGGGGCCCGT CCTCCTGGGC
GGACGCCGCC GGGCCGTCGC GGGAACGGTC TGCATGGACC AGTTCGTCGT GGACGTCGGC
GACGACGCCG TGGAGGCCGG TGAGTACGCG GTGCTCTTCG GCAACCCCGA GGACCACCCG
GACACCCCGA CCGCCGAGGA CTGGGCCGAG ATCCTGGACA CTATCCCGTA CGAGATCGTC
ACGCGGGTGG GCCCCCGGGT CCCGCGCGAG TACGTCGGCG GGGGCGCCTG A
 
Protein sequence
MSHFAHARVD LDAISHNARV LRGFAGGTPL MGVVKADGYG HGMLPAARAL IAGGATWLGT 
AFIGEALELR RAGLTPPVLA WIIPPGEPVA EAVEADIDLG VSDRAVLDTV IAEARRIGRT
ARVQLKADTG LNRGGVGPAD WGALAEAAAR AEDEGHLRVT GVWSHFACAD EPGHPSVARQ
LSRFHEALET ADKVGLTPEV RHIANSAALL TLPEARFDLV RGGIASYGLS PIPGLTGTGL
RPAMTLRSRL ALTKRVPEGS GVSYGHRYVT DRETTLALVP LGYADGVPRA ATNRGPVLLG
GRRRAVAGTV CMDQFVVDVG DDAVEAGEYA VLFGNPEDHP DTPTAEDWAE ILDTIPYEIV
TRVGPRVPRE YVGGGA