Gene Ndas_2721 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_2721 
Symbol 
ID9246572 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3242250 
End bp3245615 
Gene Length3366 bp 
Protein Length1121 aa 
Translation table11 
GC content69% 
IMG OID 
Productputative DNA methyltransferase 
Protein accessionYP_003680641 
Protein GI297561667 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGCGC TGCCGCACGT TTCCGAACCG AGCCGCGACT CCGGGAGGGG CCACCTCCGC 
GGACACGGCG TCACCAACCG ATTGGTCCAG GTTTTCGTCA ACGACGTCGT CCGTGTCCTG
GGACGCGGTG GTGAGGACGA GGAGCAGCTC CGGGGCCCGC TCAAGGTGCT CATCGAGAGC
ATGGGCGCCT CGCTGGGGCT GCCGAACAGC GTCTACGGCG AGGTCTCCCT CCCCGACCTC
CAGGCAAGAC CGGACTTCGG GGTCGACCTC ACCGGAATCG ACCAAGGGCC ACACGGCAGG
GTCGGATACA TCGAACTCAA ACGTCCGGGA AAACCGATTC CTCCCGAGAG GTTGACCGAC
AGACGCGACC GCGAGCAGTG GCAGAAATTC CAGAGCCTGC CGAATGTCCT CTACACCAAC
GGAACGGAAT GGTCCCTGTT CCGCCACGGA AACCCGGTCC TGAGAACGGT GCGCACACCC
GACCTGACAT CAGGGCGAAA GAGGTTCCCC CAGGTCGACC CGGTGTTCAC CGATCTGATC
CGCGAGTTCC TCCTCTGGCA GCCGGAGGCC GAACCCGGGC TGGACCGACT GATCACCCGG
GTGGCGGATC TGTGCGCCCA GCTCCGCGAG GAGATCTCCG AGATCCTCAC CGATGAAAGG
GTGCACGGCC GGGGCACCCC CTTCATCCGC CTGGCACACG AATGGCGCGA CCTGCTCTTC
CCACACCTCA AACCCGACAG GGAGTTCGCC GACGTCTACG CGCAGACCGT CACCTTCTCC
CTGATCCTGG CCCGCGAGGA CGGGGTGGAC TTCGGCGGCC GCGACTTGGA CGGCATCGGT
GAACTCCTGG GCAAACGGCA TGCCTTCCTG GGCCAGGCCT TCTCCCTGCT CACCGAGTCC
CGGGACGTGC GGACCATCAC CGTCCTGCCC ACACTCGTCC GGGTCCTGGA GGCGGTGGAC
TGGCTCCGGC TGACACGCGG ACGGCCCAGG GCGCACGCCG ACCTGTACGA GACCTTCCTG
ACCAGATACG ACCCCGCGCT GCGCAAGAGC TCCGGCTCCT ACTACACGCC CGCCCCCGTA
GCGGACTTCC TCACGGAGTT CACCGACTCA GTCCTGCGAA AGCGCATGGA CCTGCCGCTC
GGTTTCGCCG ACCGCTCGGT GACCACGGTG GATCCGGCAA TGGGGAGCGG GACCTTCCTG
TCCTCGGCCA TGGACCGGGC CCGCCGCAAC CTGGAGGAGG AGTTCGGCCC CGTACACACA
CGCACGTGCC TCAAGGACCT GTACCGCGAC CGCCTCGCCG GCTTCGAACG CAGCACTGCG
GCCTTCGCGG TGTCCGAACT CAGGCTGCAC CAGCAGCTCA GCGAGCAGTA CGGCGCGGAG
GTCCCCGAGG AACACCGACG GTTCCTGTGC AACACCCTGG ACGACCCGAA CCACCATTAC
CAGTCCTTCG GACGGCGCTA CGACGACCTG GTCCACTTCC GGGATCAGGC CAACCAGGTC
AAGAACTCCA CCCCGGTGAT GGTGGTCATC GGCAACCCGC CCTACATCGA GAGCGCGAAA
CAGCGGGACC CCGCCCCCTG GCTGGAGCGG CGCCGGTCCC CGGCTGGTGA TCCGGTCACC
TCGCGCCCGT CCATGGACGA GTTCCGGGAG CTGGGACAGG GCGGTCTCGA CTACAAGCTC
TCCGCGGTCA GCCTCTACTT CTGGCGCTGG GCGACCTGGA AGGCCTTCGA CGCCCACCCC
GAACAGCCCA GCGGCGTCGT CGCCTTCGTC AGCACCTCCG CGTACCTGAC GGGGGACGCC
TTCGCCGGTA TGCGCCGCTA CCTGCGCTCG ACCGCCGATG AGGGGTGGAT CGTCGACCTC
TCCCCCGAGG GGCACCGTCC CCCGGCCAAC ACGCGCGTCT TCGGCGGGGT ACAGCAACCG
GTCTGTATCG GGATCTTCGC CCGCTACGGA CATCCCCGGC CCGACGTCCC CGCCCGGATC
TGGCACGCGA GCGTAGAGGG GCCGCAAGCC GAGAAGTTCG CGGCGCTCAA ATGCGACGGA
GGGCTGCGGC TCGACGGCGG GAACTGGAAG GAGTGTCCAG AGGGATGGAC CGACCCCTTC
CACCCGCAGC ACGGGCGGTG GGGTCTCCTC CCGGCCGTCG GTGACTTGAT GCCGTGGCGG
TCGCCGGGGG TGACCACCAA CCGGACGTGG GTCATCACCC CCGACCGTGC CACCTTCAAC
CGGCGGTGGG AGAGGCTCGC CAGCGCCCGG CCAGAGGACC AGGACCGATT GTTCAAGGCC
ACCCGTGATC GGAGCGTGCA CCGCCCGTTC CCCGGTCGGC ACACCATTGC CGAGGACCGT
GTCCCCCCGC GTACCGCCTT GATCTCACAC CGTGCCTTCG GCCACCAGCA CATGGCGGAC
GATCCCCGAT ATGTGGACTT CCGTCGCCCG GGCCTGTGGG CGGCCAACGG CGACCATCAG
ATCCACGTGG TCGAACAGCA CGCCGAGGCG ATCTCCTCAG GCCCCGGTCT CCTGTTCAGC
GCGCTCGTTC CGGATGTGCA CTACTTCAAC GGCCGCGGCG GTCGAGTGCT GCCGCTCTAC
CGCGACCCCT CCGGCAGTTC TCCCAACCTC GCGCCGGGAC TGCTGGCGCA CCTCACCGAA
CGACTGGGGC GCGGTGTCGG TCCCGAGGAC GTCGTCGCCT ACCTCGCCGC CGTCGCGGCC
CACCCGGGGT ACACCGCGGC CTTCCGGGAG GACCTGCGGA CTCCCGGAGC TCGCATCCCC
CTCACCGCTG ACCCGAAGCT GTGGGACGGG GCCGTGGAAC TCGGCCGTGA GGTCGTGTGG
CTGCACACCT ACGGCGTCCG TATGCGCGAC CCCGGGGCGG GTCGTCCAGC GCACATGCCC
AGGCTCCCTC GGCAGCGCCG CCCCCAGGTG CTCAGGGAGA TTCCCGACCG TTCCGGCCAC
CTGCCCGACC GGGTCTGGCA CGACGGCGGA CAGGGTTCCG GAGGGCCGCG CCTGCACGTG
GGTGAGGGTG TCATCGGCCC GGTGGAAACG GCCGCCTGGG AGTACGAGGT GGGCGGCATG
CACGTCATCA GGAAGTGGTT CTCCGCCAGG GAGCGCGATC CCAGGCACGT ACGCAGGGGC
TCTCCCCTGG ACGACATCCG ACCCGATCGC TGGACGCCCG AGTTCACCGA ACAGCTCCTC
CACCTGATCA CCGTGCTCAC CCGACTGGTG GATCTGGAAC CGCGTCAGCT GGACCTCTTC
GAACGGATCC GCACGGGGCC CCTGGTCACG ACGCAGGAGC TCGAGGGGGC GCGGATCTTG
CCCGTCGCGC CCGGGGTGCG CAAGGGGCCC CGAAGCAGCG GCCAAGGAGA GTTGCCCCTG
ACCTGA
 
Protein sequence
MTALPHVSEP SRDSGRGHLR GHGVTNRLVQ VFVNDVVRVL GRGGEDEEQL RGPLKVLIES 
MGASLGLPNS VYGEVSLPDL QARPDFGVDL TGIDQGPHGR VGYIELKRPG KPIPPERLTD
RRDREQWQKF QSLPNVLYTN GTEWSLFRHG NPVLRTVRTP DLTSGRKRFP QVDPVFTDLI
REFLLWQPEA EPGLDRLITR VADLCAQLRE EISEILTDER VHGRGTPFIR LAHEWRDLLF
PHLKPDREFA DVYAQTVTFS LILAREDGVD FGGRDLDGIG ELLGKRHAFL GQAFSLLTES
RDVRTITVLP TLVRVLEAVD WLRLTRGRPR AHADLYETFL TRYDPALRKS SGSYYTPAPV
ADFLTEFTDS VLRKRMDLPL GFADRSVTTV DPAMGSGTFL SSAMDRARRN LEEEFGPVHT
RTCLKDLYRD RLAGFERSTA AFAVSELRLH QQLSEQYGAE VPEEHRRFLC NTLDDPNHHY
QSFGRRYDDL VHFRDQANQV KNSTPVMVVI GNPPYIESAK QRDPAPWLER RRSPAGDPVT
SRPSMDEFRE LGQGGLDYKL SAVSLYFWRW ATWKAFDAHP EQPSGVVAFV STSAYLTGDA
FAGMRRYLRS TADEGWIVDL SPEGHRPPAN TRVFGGVQQP VCIGIFARYG HPRPDVPARI
WHASVEGPQA EKFAALKCDG GLRLDGGNWK ECPEGWTDPF HPQHGRWGLL PAVGDLMPWR
SPGVTTNRTW VITPDRATFN RRWERLASAR PEDQDRLFKA TRDRSVHRPF PGRHTIAEDR
VPPRTALISH RAFGHQHMAD DPRYVDFRRP GLWAANGDHQ IHVVEQHAEA ISSGPGLLFS
ALVPDVHYFN GRGGRVLPLY RDPSGSSPNL APGLLAHLTE RLGRGVGPED VVAYLAAVAA
HPGYTAAFRE DLRTPGARIP LTADPKLWDG AVELGREVVW LHTYGVRMRD PGAGRPAHMP
RLPRQRRPQV LREIPDRSGH LPDRVWHDGG QGSGGPRLHV GEGVIGPVET AAWEYEVGGM
HVIRKWFSAR ERDPRHVRRG SPLDDIRPDR WTPEFTEQLL HLITVLTRLV DLEPRQLDLF
ERIRTGPLVT TQELEGARIL PVAPGVRKGP RSSGQGELPL T