Gene Ndas_3333 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3333 
Symbol 
ID9247195 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp3985382 
End bp3986941 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content69% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003681245 
Protein GI297562271 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAACC TGTCCGTGCT CCTGGAGGAC GGTGCCCGGT CGGCTCCGGA GAAGGAGTGC 
CTGGTCTTCG GCGACCTGCG CCTGAACTAC GCGATCACCA ACATGATCGC CAACCAGGTG
GCCAACCTGC TGGTCGCGCG CGGGATCAGG CCGGGGGACC GGATCGCGCT GGCCAGTCCG
AACGTGCCCT ACTTCCCCTT CGTGTACTTC GGCGCGCTCA AGGCGGGCGC GGTGATCGTG
CCGCTGAACG TGCTGCTGAC CCCGCGCGAG ATCGCCTACC ACCTGGAGGA CTCCGGGGCC
AGGGCCCTCT TCGCCTTCAC CGGCACGCCG GAGCTCCCCC TGGGCGATCG GGCCTTCGAG
GCCTTCGGCC AGGTGGACTC CTGCGAGTTC TACGTGGACC TGCCCGCCTC CCCGGGTGCC
ACGGAGTCGA CCATCGACGG CGCCGAGACC TTCTGGAAGG CGCTGGAGGG TCAGCCCGGC
GAGTTCGAGA CGGTGCAGGC CAGCTCCGAC GACACCGCGG TCATCATCTA CACCAGCGGC
ACCACCGGCA AGCCCAAGGG CGCCGAGCTG AGCCACCACA ACCTGCTGCT GAACGCGGTC
GGCTCGTCCA GGCTGGTCGA GCCCCACCCC GACGGCCGCG ACGTCGCCCT CGTGGTCCTG
CCGCTGTTCC ACATCTTCGG CCAGACGGTC ATGCTCAACG CGGCCCTGTA CCGGCACGGC
ACGATGGTCC TCATGCCGCG CTTCGACGGC GCCGAGGCGC TCAGGCTCAT GGAGAAGGAG
GGCGTCACCG CCTTCGCGGG CGTGCCCACC ATGTACTGGG GGCTGCTGGG CGCGGTCCGG
GCCGCCGAAC CGGGAACCTA CGACCTGGAG AAGATCGCGG GCAACGTGGT GGACGCGGTC
TCCGGCGGCG CCGCCCTGCC CGGCCAGCTC GCCGAGGACT TCACCAAGAC CTTCGGCGTG
GGCATCAAGG AGGGCTACGG CCTGTCCGAG ACCTCACCGG TGGTGTCGTT CAACAACCCC
AAGGTCAAGG CCAAGACCGG CTCCATCGGG CAGCCGGTGT GGGGTGTGGA GATGAAGCTC
ATCGACCCCG AGTGGAACGA GGCGGCGGAG GAGGGCGAGA TCGCGGTGCG CGGGCACTGC
GTGATGAAGG GGTACCACAA CCGCCCCGAG GTCAACGCGG AGGTGATCCG CGACGGCTGG
TTCCGCACCG GGGACATCGC CCGCCGCGAC GAGGAGGGTT TCTACTTCAT CGTCGACCGG
TCCAAGGACA TGATCATCCG CGGCGGCTAC AACGTCTACC CGCGCGAGGT CGAGGAGGTC
CTGATGACCC ACGAGGCGGT CAGCCTGGCC GCGGTCGTGG GCGTGCCCCA CGACACCCAC
GGCGAGGAGG TCAAGGCGTT CGTGATCCCC AAGGAGGGCG CCGAGGTCAC CGCCGAGGGG
CTCGTCGCGT GGGCCAAGGA GCGCCTGGCC GGGTACAAGT ACCCGCGCGA GGTGGAGTTC
CGCACCGAGC TGCCGATGAC CTCCACCGGC AAGATCCTCA AGCGCGAGCT GCGCGGCTGA
 
Protein sequence
MLNLSVLLED GARSAPEKEC LVFGDLRLNY AITNMIANQV ANLLVARGIR PGDRIALASP 
NVPYFPFVYF GALKAGAVIV PLNVLLTPRE IAYHLEDSGA RALFAFTGTP ELPLGDRAFE
AFGQVDSCEF YVDLPASPGA TESTIDGAET FWKALEGQPG EFETVQASSD DTAVIIYTSG
TTGKPKGAEL SHHNLLLNAV GSSRLVEPHP DGRDVALVVL PLFHIFGQTV MLNAALYRHG
TMVLMPRFDG AEALRLMEKE GVTAFAGVPT MYWGLLGAVR AAEPGTYDLE KIAGNVVDAV
SGGAALPGQL AEDFTKTFGV GIKEGYGLSE TSPVVSFNNP KVKAKTGSIG QPVWGVEMKL
IDPEWNEAAE EGEIAVRGHC VMKGYHNRPE VNAEVIRDGW FRTGDIARRD EEGFYFIVDR
SKDMIIRGGY NVYPREVEEV LMTHEAVSLA AVVGVPHDTH GEEVKAFVIP KEGAEVTAEG
LVAWAKERLA GYKYPREVEF RTELPMTSTG KILKRELRG