Gene Ndas_3670 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3670 
Symbol 
ID9247539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4406285 
End bp4407793 
Gene Length1509 bp 
Protein Length502 aa 
Translation table11 
GC content73% 
IMG OID 
Productmethylmalonate-semialdehyde dehydrogenase 
Protein accessionYP_003681574 
Protein GI297562600 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.313455 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAAGC ACGTCACCCA CTGGATCGGC GGGTCCGCCC ACGAGGGGCC GGCCCGGCGC 
ACGGGGGACA TCTACAATCC GGCCTCCGGA CGGGTCACCG GCACGGTCGA CCTCGCCGGA
CGGCAGGAGG TCGACGCGGC GGGGACGGCC GCGCGGGAGG CCTTCCCGGG GTGGCGCGAC
ACCCCGCTGT CCCGGCGGGT GCAGGTCCTG TTCCGCTTCC GCGAGCTGCT CAGCGCCAAC
GCCGACCGGC TGGCGGAGCT GGTCAGCGCC GAGCACGGCA AGGTCCTCTC CGACGCCCGG
GGCGAGGTGG CCCGCGGCCT GGAGGTCGTC GACTTCGCCT GCGGCATCCC GCACCTGCTC
AAGGGCGGCT ACTCCGAGAA CGTGTCCACG GGCGTGGACG CCTACTCGAT CCTCCAGCCT
CTCGGCGTGG TCGCCGGGAT CACGCCGTTC AACTTCCCGG CGATGGTGCC GATGTGGATG
TTCCCGGTGG CCCTGGCCTG CGGCAACGCG TTCGTGCTCA AGCCCAGCGA GAAGGACCCC
TCCGCGTCGG TGCTGCTGGC CGGGCTGTGG GCCGAGGCGG GGCTGCCCGA GGGCGTGTTC
AACGTGGTGC ACGGCGACAA GGAGGCGGTG GACGCCCTCC TGGAGCACCC GGACGTGGCG
GCGGTCAGCT TCGTGGGCTC CACCCCGATC GCGCGGTACG TCTACCGGAC CGCCGCCGAG
CACGGCAAGC GCGTGCAGGC CCTGGGCGGG GCCAAGAACC ACATGGTGGT GCTGCCCGAC
GCCGACCTGG ACCTGGCCGC GGACGCGGCG GTCTCGGCCG GGTTCGGCTC GGCCGGCGAG
CGGTGCATGG CCATCTCCGC GGTGGCCGTG GTGGACTCGG TGGCCGACGG GCTGGTGGAG
CGGATCCGCG AGCGGGTGGC GCGCCTGCGC GTGGGCCCCG GCGACGACGA GCGCAGCGAG
ATGGGGCCGC TGGTCACCAG GGAGCACCGC GACAGGGTGG CCTCCTACCT GGAGTCGGGG
GTGCGCGAGG GCGCGACCCT GGCGGTCGAC GGCCGCGCGC ACCCTGTGTC GGGCGGGAGC
CCGGACGGGT TCTGGCTGGG ACCGTCGCTG CTGGACCACG TCGGCCCCGA GATGTCGTGC
TACCGGGACG AGATCTTCGG CCCCGTGCTG AGCGTGGTGC GCGTGGGCGG CTACGACGAG
GCGGTCAAGC TCGTCAACGC CAGCCCCTAC GGCAACGGCA CGGCGGTCTT CACCAACGAC
GGGGGCGCCG CCCGGCGGTT CCAGAACGAG GTCGAGGTCG GCATGGTGGG GATCAACGTC
CCCATCCCGG TGCCGATGGC CTACTACTCC TTCGGCGGCT GGAAGCAGTC CCTGTTCGGC
GACTCACACG CGCACGGCAC GGAGGGTGTC CACTTCTACA CCCGTACCAA GGCGGTCACC
GCCCGGTGGG CCGACCCGGG CCAGCGGCCC GAAGGGGGCG TCGACCTGGG GTTCCCCACC
AACGGTTGA
 
Protein sequence
MSKHVTHWIG GSAHEGPARR TGDIYNPASG RVTGTVDLAG RQEVDAAGTA AREAFPGWRD 
TPLSRRVQVL FRFRELLSAN ADRLAELVSA EHGKVLSDAR GEVARGLEVV DFACGIPHLL
KGGYSENVST GVDAYSILQP LGVVAGITPF NFPAMVPMWM FPVALACGNA FVLKPSEKDP
SASVLLAGLW AEAGLPEGVF NVVHGDKEAV DALLEHPDVA AVSFVGSTPI ARYVYRTAAE
HGKRVQALGG AKNHMVVLPD ADLDLAADAA VSAGFGSAGE RCMAISAVAV VDSVADGLVE
RIRERVARLR VGPGDDERSE MGPLVTREHR DRVASYLESG VREGATLAVD GRAHPVSGGS
PDGFWLGPSL LDHVGPEMSC YRDEIFGPVL SVVRVGGYDE AVKLVNASPY GNGTAVFTND
GGAARRFQNE VEVGMVGINV PIPVPMAYYS FGGWKQSLFG DSHAHGTEGV HFYTRTKAVT
ARWADPGQRP EGGVDLGFPT NG