Gene Ndas_1549 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1549 
Symbol 
ID9245399 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1896135 
End bp1899605 
Gene Length3471 bp 
Protein Length1156 aa 
Translation table11 
GC content70% 
IMG OID 
Productmethionine synthase 
Protein accessionYP_003679484 
Protein GI297560510 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.217391 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGCCA GCCCCACCTT CCGCGAAACG CTGTCCCGCC GTGTGGTCGT GGCGGACGGA 
GCGATGGGCA CCATGCTCCA GGCGCACGAT CTCGACCTCG ACCAGTTCGA GGGGCACGAG
GGATGCAACG ACATCCTCAA CCTCACCCGT CCCGACATCG TCCGCGACAC CCATGCCGCC
TTTCTTGCCG TGGGTTCGGA CGCCATCGAG ACGAACACCT TCTCGGCCAA CCTCGGCGGG
CTCGCCGAGT ACGGCATCGA GGACCGCACC TACGAGATCG CCCACGCCGG AGCGCAGGTG
GCCCGCGAGG CCGCCGACGC CTACTCCACC CCCGACCAGC CCCGCTACGT CCTCGGGTCC
GTGGGCCCGG GAAACCGGCT GCCCACCCTG GGCCACGCGC CCTACACCCA GCTGCGCGAC
TACTACGAGC AGTGCGCGCG CGGCCTGATC GACGGCGGAT CCGACGCCAT CCTCATCGAG
ACCTGCCAGG ACCTGCTCCA GGTCAAGGCC GCCGTGGTGG CCGCCCAGCG CGCCCGCCGC
GCCGCGGGCA GGGACGTGCC GATCATCGCC CAGGTGAGCA TCGAGACCAA CGGCACCATG
CTGCTCGGCT CCGAGATCGG CGCCGCGCTG ACCTCCCTGG AGCCGCTGGG CGTCGACGTC
ATCGGCCTCA ACTGCTCCAC CGGCCCCGCC GAGATGAGCG AGCACCTGCG CTACCTGTCG
CACCACTCGC CCATCCCCAT CTCCTGCATG CCCAACGCGG GCCTGCCCCA GCTCGGCCCC
GACGGCGCCT TCTACGACCT CTCGCCCGCC GAGCTGGCCG ACGCCCACGA CTCCTTCACC
TCCGAGTTCG GACTGAGCCT GGCCGGCGGC TGCTGCGGCA CCACCCCCGA GCACCTGCGC
CACGTGGTGG AGCGCGTCCA GGGGCGCGGC ATCAAGAACC GCAAGCCCCT GGTGGAGGCG
GCCTCCTCCT CCCTGTACCA GAGTGTGCCC TTCCGCCAGG ACGCCAGCTA CCTGGCCGTG
GGTGAGCGCA CCAACGCCAA CGGCTCCAAG AAGTTCCGCG AGGCCATGCT GGAGGGCCGC
TGGGACGACT GCGTGGAGAT CGCGCGGGAC CAGATCCGCG ACGGCGCCCA CCTGCTCGAC
CTCAACATCG ACTACGTGGG CCGCGACGGG GTCAGCGACA TGCGCGAGCT GGCCTCGCGC
CTGGCCACCT CCTCCACGCT GCCGATCATG CTCGACTCCA CCGAGCCGCC CGTCCTGGAG
GCGGGCCTGG AGGCCCTCGG CGGCAGGAGC GTGGTCAACT CCGTCAACTA CGAGGACGGC
GACGGCCCCG ACTCCCGCTT CACCCGCATC ATGGGGCTGG TCAAGGAGCA CGGCGCCGCC
GTCGTGGGCC TGTGCATCGA CGAGGAGGGC CAGGCCCGCA CCGCCGAGTG GAAGGTGCGC
GTCGCCACCC GCCTCATCGA GCAGATCACC GGTGAGTGGG GGCTCAACAC CAGCGACATC
ATGATCGACT GCCTCACCTT CCCCATCACC ACGGGGCAGG AGGAGACCCG CCGGGACGGC
CTGGAGACGA TCAACGCCAT CCGCGAGCTC AAGCGGCTCT ACCCGGACGT GCAGACCACC
CTGGGACTGT CCAACCTGTC CTTCGGCCTC AACCCGGCCG CCCGCATCGT GCTGAACTCG
GTGTTCCTGC ACGAGGCGGT CCAGGCCGGG CTGGACTCGG CGATCGTGCA CGCCTCCAAG
ATCGTGCCGA TCAACCAGAT CCCCGAGGAG CAGCGCGAGG TCGCCCTCGA CATGGTCTAC
GACCGGCGCG AGGGCGACTA CGACCCCCTC TCGCGGTTCA TGGAGATGTT CGAGGGCGTG
GACGCCAAGT CCATGAAGGC ATCCCGCGCC GAGGAGCTGG CCGCCCTGCC GCTGTGGGAG
CGCCTGGAGC GGCGCATCAT CGACGGCGAG ATGACCGGGA TCGAGGCCGA CCTCGACGAG
GCGCTGGAGT CCAAGCCCGC CCTGGCCATC GTCAACGACA CCCTGCTCTC GGGCATGAAG
ACCGTCGGCG AGCTGTTCGG CTCCGGCCAG ATGCAGCTGC CCTTCGTCCT CAAGTCCGCC
GAGGTCATGA AGGGCGCCGT CGCCTACCTC GAACCCCACA TGGAGAAGAG CGACGACGAC
GGCAAGGGCC GCATCGTCCT GGCCACCGTC AAGGGCGACG TCCACGACAT CGGCAAGAAC
CTCGTGGACA TCATCCTGTC CAACAACGGC TACGACGTGG TCAACATCGG CATCAAGCAG
CCGGTGTCGG CGATCCTGGA GGCCGCCGAG GAGCAGCGCG CCGACGTGAT CGGCATGTCC
GGCCTGCTGG TCAAGTCCAC GGTCATCATG AAGGAGAACC TGGAGGAGAT GAACTCCAGG
GGCCTGTCCG AGCGCTTTCC GGTCCTGCTG GGCGGCGCCG CGCTCACCCG CTCCTACGTG
GAGCAGGACC TGGCCGAGGT CTTCGACGGC CACGTGCGCT ACGCCAAGGA CGCCTTCGAG
GGCCTGCGCC TGATGGACGC GTTCATGGCG GTCAAGCGCG GTGACGAGGG CGCCGAGCTG
CCCGCCCTGC GCCAGCGCCG GGTCAAGACG GGGGCCAAGC TCAAGGTGAG CGAGCCCGAG
GAGGTGCCCG CCCGCAGCGA CGTGTCGACC ACCAACCGGG TGCCCAAGCC GCCCTTCCTG
GGGGACCGGA TCAGCAAGGG CATCCCGCTG GCCGACTACG CCGCCTTCCT CGACGAGCGC
GCCACCTTCA TGGGCCAGTG GGGGCTCAAG GCCGCCCGCG GCGGCGAGGG CCCCAGCTAC
GAGGAGCTGG TGGAGACCGA GGGCCGCCCC CGGATGCGGA TGTGGCTGGA CCGCATCCAG
ACCGACGGCC TGCTGGAGGC GGCCGTGGTG CACGGCCACT TCCCCTGCTA CAGCGAGGGC
GACGACCTGG TGGTGCTGGA CGAGGAGGGC GCCGAGCGCA CCCGCTTCAC CTTCCCGCGC
CAGCGCCGGG ACCGCCACCT GTGCCTGTCC GACTTCTTCC GGCCCAAGGA GTCGGGGGAG
CTGGACGTGG TGTCCTTCCA GGTGGTGACC GTGGGCTCGG CGATCAGCCG CGCCACCGCC
GAGCTGTTCG CCAAGAACGC CTACCGCGAC TACCTGGAAC TGCACGGGCT GTCGGTGCAG
CTGACCGAGG CCCTGGCCGA GTACTGGCAC ACCCGGGTGC GTGCGGAGCT GGGCTTCGCG
GGCGAGGACC CGGCCGAGCT GGACGCGTTC TTCAAGCTCG GCTACCGGGG GGCCCGCTTC
TCCCTGGGCT ACGGGGCCTG CCCCGACCTG GAGGACCGCG CCAAGATCAT GCGCCTGCTG
GAGCCCGAGC GGGTGGGCGT GACCCTGTCG GAGGAGTTCC AGCTGGTGCC CGAGCAGGCG
ACCGACGCGA TCGTCGTCCA CCACCCGGAG GCGACCTACT TCAACGTCTG A
 
Protein sequence
MRASPTFRET LSRRVVVADG AMGTMLQAHD LDLDQFEGHE GCNDILNLTR PDIVRDTHAA 
FLAVGSDAIE TNTFSANLGG LAEYGIEDRT YEIAHAGAQV AREAADAYST PDQPRYVLGS
VGPGNRLPTL GHAPYTQLRD YYEQCARGLI DGGSDAILIE TCQDLLQVKA AVVAAQRARR
AAGRDVPIIA QVSIETNGTM LLGSEIGAAL TSLEPLGVDV IGLNCSTGPA EMSEHLRYLS
HHSPIPISCM PNAGLPQLGP DGAFYDLSPA ELADAHDSFT SEFGLSLAGG CCGTTPEHLR
HVVERVQGRG IKNRKPLVEA ASSSLYQSVP FRQDASYLAV GERTNANGSK KFREAMLEGR
WDDCVEIARD QIRDGAHLLD LNIDYVGRDG VSDMRELASR LATSSTLPIM LDSTEPPVLE
AGLEALGGRS VVNSVNYEDG DGPDSRFTRI MGLVKEHGAA VVGLCIDEEG QARTAEWKVR
VATRLIEQIT GEWGLNTSDI MIDCLTFPIT TGQEETRRDG LETINAIREL KRLYPDVQTT
LGLSNLSFGL NPAARIVLNS VFLHEAVQAG LDSAIVHASK IVPINQIPEE QREVALDMVY
DRREGDYDPL SRFMEMFEGV DAKSMKASRA EELAALPLWE RLERRIIDGE MTGIEADLDE
ALESKPALAI VNDTLLSGMK TVGELFGSGQ MQLPFVLKSA EVMKGAVAYL EPHMEKSDDD
GKGRIVLATV KGDVHDIGKN LVDIILSNNG YDVVNIGIKQ PVSAILEAAE EQRADVIGMS
GLLVKSTVIM KENLEEMNSR GLSERFPVLL GGAALTRSYV EQDLAEVFDG HVRYAKDAFE
GLRLMDAFMA VKRGDEGAEL PALRQRRVKT GAKLKVSEPE EVPARSDVST TNRVPKPPFL
GDRISKGIPL ADYAAFLDER ATFMGQWGLK AARGGEGPSY EELVETEGRP RMRMWLDRIQ
TDGLLEAAVV HGHFPCYSEG DDLVVLDEEG AERTRFTFPR QRRDRHLCLS DFFRPKESGE
LDVVSFQVVT VGSAISRATA ELFAKNAYRD YLELHGLSVQ LTEALAEYWH TRVRAELGFA
GEDPAELDAF FKLGYRGARF SLGYGACPDL EDRAKIMRLL EPERVGVTLS EEFQLVPEQA
TDAIVVHHPE ATYFNV