Gene Ndas_0794 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_0794 
Symbol 
ID9244639 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp975234 
End bp976448 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content70% 
IMG OID 
ProductMethyltransferase type 12 
Protein accessionYP_003678744 
Protein GI297559770 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGGCACA GCGAGATCGC CCTCAGGCGG CACGACCTGG CGCTCCGCTC CCGGCAGACG 
CCCGTCGTCC GCGCGGCGCT GGCGGCGGCG AACGGGCTGG GCCTGCGCGG CGGCCCCCTG
CTCCAGCTCC TCCTCCTGTC CAAGCTCCAG GGGTTCCTCC GGCACATCGT GCTCACCCTC
ATGGACCGGG AGTTCCCCGA CGGCGACCTG TCCGCGCGCC CGGACACCGA CCCCCGGATC
CTCGACGACT TCCTCCGGCT CGCCCTGGAG CTGGGCGTCG CCGCACGGGT GGACGGCGCC
GTCGTCCCCG AACCCGCCTA CACCACCGGG TTCCCCGGCT TCGGCGGCGA CTCGGCCACG
CGCCCGCGCG CCGAGGCCGA GAAGGACTAC GTGTTCGTCC GAGGGCTCAT CCGCAAGGCC
GAGGCGGACG GGGAGACCGG GCCGCTCCAC CGACCGGACG TGCGCTACCT GATCGTCCTC
AGCCGGTACA TCCTCGAACT GGAGGGCATG GGGTTCGACG CCCAGGTCGC GCCCTCCTTC
TCCGAGAAGT TCTACTCCGA CCTCGGGGCC CTGGCCTACG AGCTCTACAC GAAGCGCTCG
TTCGAACGGC TCTGCCGCCG CCTCTCCCCC GCGTCCGTCC TGGACATCGG CTGCGGTGAC
GGCCTGCACA TGAGCTCGGT GCTCTCCACC CTGCCCACCG CACGGATGGT CGGTCTGGAA
CCCCAGGTGA AGGTCGCCGA CGCCACACGC GAGCGGCTGT CCGGCCATCC GAACACACGC
GTGGAATCGG TCCGGTTCAC CGACCACGAC ACCACCGACC GCTTCGACAT GGTCCTGAGC
AGCTTCATGA TCTTCTACAT GCCCGAGGAG GAGCGCGTCC CGTTCTTCCG CAGGGTCCGT
GAGGTCCTGT CGCCGACGGG CACCTACGTC ATCGGCCAGT ACTTCCCCGA CTTCGAGGAC
GTCCAGGAGG TCCTCGTGCG CTCGACCTCC CCGGTGCCGG GCATCCAGCT CTACCTGTCC
GGTGTGGGCA ACTCCCTGGT CAAGGCCGAG GCTCTCCTCA ACCGCGTGCT GTCGGACTTC
CGGTCGGTGG CCTACTGGAG CACGCTCCAG GACCAGCTCT CGGAGGCCGG CCTGGCCGTG
GAGGAGATCG TTCCGGCGGA CAGCATGTAC TACTCGTACT TCCTGCTCGT GCGGCGGGCG
GAGGGCGCCT CGTGA
 
Protein sequence
MRHSEIALRR HDLALRSRQT PVVRAALAAA NGLGLRGGPL LQLLLLSKLQ GFLRHIVLTL 
MDREFPDGDL SARPDTDPRI LDDFLRLALE LGVAARVDGA VVPEPAYTTG FPGFGGDSAT
RPRAEAEKDY VFVRGLIRKA EADGETGPLH RPDVRYLIVL SRYILELEGM GFDAQVAPSF
SEKFYSDLGA LAYELYTKRS FERLCRRLSP ASVLDIGCGD GLHMSSVLST LPTARMVGLE
PQVKVADATR ERLSGHPNTR VESVRFTDHD TTDRFDMVLS SFMIFYMPEE ERVPFFRRVR
EVLSPTGTYV IGQYFPDFED VQEVLVRSTS PVPGIQLYLS GVGNSLVKAE ALLNRVLSDF
RSVAYWSTLQ DQLSEAGLAV EEIVPADSMY YSYFLLVRRA EGAS