Gene Ndas_5451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_5451 
Symbol 
ID9249354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014211 
Strand
Start bp636528 
End bp637706 
Gene Length1179 bp 
Protein Length392 aa 
Translation table11 
GC content71% 
IMG OID 
Productprotein-L-isoaspartate(D-aspartate) O-methyl transferase 
Protein accessionYP_003683336 
Protein GI297564363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.658243 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCGGAG TTGTTCACGA ACCCTCGTCC GCACTGGTGG CCCGCCTGGT CGAGGAGGGC 
GCGCTGGTGC CTGACGACCG CCTCACCGAG GCGTTCCGAC GCGTGGACCG GGGCGTGTTC
GTCCCGGCCT TCGCCCTGCA CGAGGAAACC CCGCAGGGTG CGCGTTACAG GCTCCTGTCC
GCTGAGGACC CCGACCAGCG TGAGGAATGG GCCTGCCACG TCTACGCCGA CGAGACGCTG
ACCATCGAGA TCGCCGGAGA ACCGGTGACC GACGCCCTGC CCGGGGGACG CGGTACCGGC
CGGTGGACCA GCTCCTCCAC GATGCCCAGC CTCATGGCGC GGATGCTCCA CCAACTCGAC
CTGGGCGGCG ACCCCCGCGT TCTGGAGGTA GGAGTCGGTT CCGGGTACAA CGCGGCGATC
CTGTGCGAGG TGCTGGGTTC GGACCGGGTC ACGAGCATCG ACATCTCACC GCGCCTGGTC
TCCGACGCCG CCCGACGCCT CTCCGCCCTG GGATACACAC CCGTCGTGGC CGAGTACGAC
GGGCACAAGG GCTTCCCCGA CCGCGCTCCG TACGACCGGA TCGTCAGCAC CACGGCCTTC
ACCCACGTTC CGCCGGAGTG GATCACCCAG GCGGCACCGG GCGGGAGCAT CCTCGTCAAC
ATCGCGGGAG GCACCGGAGG CGCCATGCTC GGGCTCCGGG TGCGCGACGA CCACACGGCG
CAGGGGCGCT TCCTTCCGCA GTGGGCCGGG TTCATGCCCG CGCGGAGCAG CGTTCCCCGC
CAACGCGTCA GCGTGGATGA CGCGGGGGAA CGGAGCACGA CCACTCTGAA CCCGGCGCTC
GTTCGCGGGG AACCGGCCAT GGCGTTCCTC GCGCAACTGG CCACGACGGA CGCCGACACC
GTGGTCAGGA CGGCCGATAC CGGGGCGGAC TTCCTCTTCA TGGAGGGCGC CGACGGGGCC
TGGGCCGAGA TCGACATGGA CGCGGACGAG GGTCGCTACC CCGTGGTCCA GGGCGGACCG
CGCCGTCTGT GGACGCGGGT GGAGGAAGCG CACCGGTGGT GGGTCGCCAA CGGCCAGCCG
GGCTGGAGCG CCTACGGCGT CACCGTCACG CCGGGGGACC AGCACGTGTG GTTCGGTTCG
GCGGAGAGCG ACCAGCGCTG GCCGCTGCCG CTCCCGTAG
 
Protein sequence
MTGVVHEPSS ALVARLVEEG ALVPDDRLTE AFRRVDRGVF VPAFALHEET PQGARYRLLS 
AEDPDQREEW ACHVYADETL TIEIAGEPVT DALPGGRGTG RWTSSSTMPS LMARMLHQLD
LGGDPRVLEV GVGSGYNAAI LCEVLGSDRV TSIDISPRLV SDAARRLSAL GYTPVVAEYD
GHKGFPDRAP YDRIVSTTAF THVPPEWITQ AAPGGSILVN IAGGTGGAML GLRVRDDHTA
QGRFLPQWAG FMPARSSVPR QRVSVDDAGE RSTTTLNPAL VRGEPAMAFL AQLATTDADT
VVRTADTGAD FLFMEGADGA WAEIDMDADE GRYPVVQGGP RRLWTRVEEA HRWWVANGQP
GWSAYGVTVT PGDQHVWFGS AESDQRWPLP LP