Gene Ndas_1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1204 
Symbol 
ID9245054 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp1461555 
End bp1464905 
Gene Length3351 bp 
Protein Length1116 aa 
Translation table11 
GC content70% 
IMG OID 
Producthypothetical protein 
Protein accessionYP_003679149 
Protein GI297560175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.115065 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value0.894355 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAGAAC CAGAGATCAA GTACGAAGAC CTGCCGGACA AGCCGCTGCA CGAGCTCATG 
AACATCTTCG CCGGCACCGA CGGCGACTAC AAACCGATGG AGAGCATCGA CACGGTCACC
AAGGACCCCT GGGTCGCGGA CGGGCGCTAC CACTTCCATG AGACCACCAA TCCTTTCACC
GGCTTCTCCT GGAACTACGG CGGCCACGAG ATGGACCGCT GGTGGAGTTA CAGGGAGTAC
TACACGGCGT ACACGACCGG GTACGGCGGC GGCAGCTACC CCTATGAGAC CTACGGCATC
GACTACGCCG GCTACACCAT GCAGAAGCAC CTGTGGGGCG AGAAGACCCT CAGCGACGGC
AAGGAGGACC TGCACACAGA CGACACCCCT CCCCCGCAGG ACGGGTTCCG GCAGCAGCTG
AGCAACCTGG CGCACCTGGC CGACTACGGG AACAACCAGC TGACGTCCGA GAACTGGGAC
CTCACCCGGC TGCAGAAGAC CATCGACAGC CTGGGCACGC TCTACGACAC ACTGACTCCC
TACATCGACG GTGAGGGAGG CAGTCTGAAC AAGCTGTGGC AGGAGATCGA CGTCCCCGAC
GGAGCCATGC AGGGCTCGGC CGCCGCGGCG TACAAGTACC GCCTCCAGGA CATGGCGTAC
CGGATGACCT CCGTGCACAT GGAGCTCCCC GAGTTCAAGG CGGAGCTGGA GCGGGTCAAG
ACGGTCCTGA CGTTCGCCAC AGCCGTCCTC GCCTACACGG TGGACCAGGC CCTCAACACC
AGGGGAGGGC GCGTCCAGGA GGTACTCGAC GACTGGTACT TCTACACCTC CGCGGGGACG
GAGGAGTTCA ACCCGGAGCG GGACCAGTTC CAGGTCACGA TCGCCTATCC CGAGGAGACC
GGCGGGGACA AACTAACCGG TATCCCCGGC GATCCGCCGA CCCAGGACAA CATCAACAAC
GTGCTCTACC AGAGGTTCCG CAACCACTAC GAGGGCGTCT ACGAGGCCGC CAACCAGGTG
CGCGACCTCA TGGCGCACAC CTACACGCAG GCGCACGCCA CCATACCGGT CATCGTGGAC
CCCGCTCCGC TCCCGCCGCC GACCCCCGAG ACCGGTGGCG GCGGCAACGG ACCCGGCGGC
GAGAACCCCT TCGGAGACTG GGAGATTGAC AACCCCTTCG GAGACTGGGA AATCGACAAC
CCCTTCGGAG ACTGGGACAA CGAGGGAGGC GGTCCGCCCC CGCCGCCGCC ACCGCCCCCG
CCGCCACCGC AGCTCCCGTA CAACGGCCCC GGAACCGGTG GTCAGGGCCC CGGAAACGAA
GAGTTCCTGC CCCCGCCGCC TCCGCCGAAC CTCGGGCTCA ACGGGCCCGG CACAGGCAAC
CCCGAGGGCC TGGGCGGTCC GCCACCTCCC ACCCCGGACC TGGGCCTCAA CGGGCCCGGC
TTCGATCAGC TGGGCAACAA CAGCCTCGTG CCCCCGCCGC CTCCCAACGA CCTGGGCCTC
AACGGCCCCG GCTTGGGCCA GGGGGGCAAC AACGGTTCCT TCGTGCCCCC GCCGCCTCCC
AACGACCTGG GCCTCAACGG CCCCGGCTTG GGCCAGGGGG GCAACAACGG TTCCTTCGTG
CCCCCACCGC CCCCCAACGA CCTCGGCCTC AACGGCCCCG GCTTCGGCAG CGGCGGAGGC
AACGACCCCG GCACGAACGG GCCCTTCATC CCGCCGCCTC CGCCTCCGAA CCTCGGCCTC
AACGGCCCGG GAAGCGGCCG CGGGGGCTAC AACGGCTCCC TGGAACGCGA TCCGGCGACG
GGGCTCCCGA TCAACCCCGA GACCGGCCAA CCCTTCCCCG TCGACCCCGA GACCGGACTG
CCCTACAACC CCGACACCGG GCTGCCCATC AACTACGACC CCGAGACCGG GCGGGCCCTG
CCCATCGACC CCGTCACCGG AGAACCGGTC CCCATCGGGG AGAGGGGCAG CTCCCTGGAG
TTCGACCCGG CCACAGGGCT CCCGATCAAC CCCGAGACCG GCCAACCCTT CCCCGTCGAC
CCCGAGACCG GACTGCCCTA CAACCCCGAC ACCGGGCTGC CCATCAACTA CGACCCCGAG
ACCGGGCAGG CCCTGCCCAT CGACCCCGTC ACCGGGCTGC CCATCAACTA CGACCCCCAG
ACCGGGCAGG CCCTGCCCAT CGACCCCGTC ACCGGAGAAC CGGTCCCCAT CGGGGAGAGG
GGCAGCTCCC TGGAGTTCGA CCCGGCCACA GGGCTCCCGA TCAACCCCGA GACCGGCCAA
CCCTTCCCCG TCGACCCCGA GACCGGACTG CCCTACAACC CCGAGACCGA ACTGCCCATC
AACTACGACC CCGAGACCGG GCAGGTCCTG CCCATCGACC CCGTCACCGG GCAGCCGGTC
GGTATCGACC CGGTCACCGG GCTGCCCACC GACTACGACT ACGGGTTGCA GCCGCCCCTG
CCGCCGCCCG ACCTCGGCCT CAACGGCCCA GGCACCTCCG ACGACTACGA CTACGGGCTG
CGGCCGCCGC CGCTGCCGGA GGACCTGGGC CTCAACGGCC CGGGGTCGGG CAGCGACGTA
CCGATCAACC CGATCACCGG CGAGCCCTAC GTCATCGACC CCATGACCGG TCGGCCCTAC
CCGATCAACC CCGAGACCGG GGAGCCGATC AGGAGCAACT ACACGGGCCC GGGACAGGAG
CCCCCGGACG TCCTGCCGCC GCCGCTGGCC CCCGACGACC GCGGCCTCAA CTACGGCGGT
CTCAGCCCGA GCCCCTACTC CAGCGACGGA CCCGGCGCTC CGGACAGCTC CGACCGCTCC
TCGCTGTTCG CCAACCCCGA CGGCACCCAG TACCCGGCGG GCGGCCCCCA GGGGACCTCC
GGCGGACCGG GCACTCCGGG CGGCGCCGGC GGGCTCGGCG GCGTGGGAGG GCTGGGCGGC
CAGCCCGGCG CGGGCACCGG CGGCGCGGGC GCCGGAGTCG GCGGCATGGG AGGCATGGGC
GGCATGCCGA TGATGCCCCC CATGATGGGC GGCGGCATGG GCGGTGGCGG TGGCGGCGGC
GACAACAACC GCGACCGCCA GCGCTCCACC TGGCTCTCCG AGGACGAGAA GGTCTGGGGA
ACCACCGCCG ACGCGCAGAG ATCCGCACTG GGACGCCCCG TACCCGGTCA GAGCAAGAAG
GGAGCCACCC GGCATGAGTT CGCAGATGCA ACGCCAGATG GAAGAGGAAC TGGCACAGCT
TCACAAGACG ATGGGCCTGC TGGCGGACGC AAGCGCAAAC CTGGAGGCGG CAACCGAAGA
GGTCGTGGCC AAGAACAGGC TCGTGGGCGC GAAGGTGAAC GCGACGGGTG A
 
Protein sequence
MPEPEIKYED LPDKPLHELM NIFAGTDGDY KPMESIDTVT KDPWVADGRY HFHETTNPFT 
GFSWNYGGHE MDRWWSYREY YTAYTTGYGG GSYPYETYGI DYAGYTMQKH LWGEKTLSDG
KEDLHTDDTP PPQDGFRQQL SNLAHLADYG NNQLTSENWD LTRLQKTIDS LGTLYDTLTP
YIDGEGGSLN KLWQEIDVPD GAMQGSAAAA YKYRLQDMAY RMTSVHMELP EFKAELERVK
TVLTFATAVL AYTVDQALNT RGGRVQEVLD DWYFYTSAGT EEFNPERDQF QVTIAYPEET
GGDKLTGIPG DPPTQDNINN VLYQRFRNHY EGVYEAANQV RDLMAHTYTQ AHATIPVIVD
PAPLPPPTPE TGGGGNGPGG ENPFGDWEID NPFGDWEIDN PFGDWDNEGG GPPPPPPPPP
PPPQLPYNGP GTGGQGPGNE EFLPPPPPPN LGLNGPGTGN PEGLGGPPPP TPDLGLNGPG
FDQLGNNSLV PPPPPNDLGL NGPGLGQGGN NGSFVPPPPP NDLGLNGPGL GQGGNNGSFV
PPPPPNDLGL NGPGFGSGGG NDPGTNGPFI PPPPPPNLGL NGPGSGRGGY NGSLERDPAT
GLPINPETGQ PFPVDPETGL PYNPDTGLPI NYDPETGRAL PIDPVTGEPV PIGERGSSLE
FDPATGLPIN PETGQPFPVD PETGLPYNPD TGLPINYDPE TGQALPIDPV TGLPINYDPQ
TGQALPIDPV TGEPVPIGER GSSLEFDPAT GLPINPETGQ PFPVDPETGL PYNPETELPI
NYDPETGQVL PIDPVTGQPV GIDPVTGLPT DYDYGLQPPL PPPDLGLNGP GTSDDYDYGL
RPPPLPEDLG LNGPGSGSDV PINPITGEPY VIDPMTGRPY PINPETGEPI RSNYTGPGQE
PPDVLPPPLA PDDRGLNYGG LSPSPYSSDG PGAPDSSDRS SLFANPDGTQ YPAGGPQGTS
GGPGTPGGAG GLGGVGGLGG QPGAGTGGAG AGVGGMGGMG GMPMMPPMMG GGMGGGGGGG
DNNRDRQRST WLSEDEKVWG TTADAQRSAL GRPVPGQSKK GATRHEFADA TPDGRGTGTA
SQDDGPAGGR KRKPGGGNRR GRGQEQARGR EGERDG