Gene Ndas_1751 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_1751 
Symbol 
ID9245601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp2138402 
End bp2140018 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content75% 
IMG OID 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_003679685 
Protein GI297560711 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.748681 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0123673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTCGAGG GATGCGTCCC CTGGCCGGAG GAGTTCGCCC GACGGTACCG GCGGGAGGGG 
TACTGGCTCG GCCGGTCCCT GGCCGAACTC TTCGACACCT GGTGCTCCGC GCACCCGGAG
CGCACCGCGC TGGTCTGCGG CTCCCGGCGC TGGACCTACC GGGAACTGCA CGAGCGGGTC
GCCCGCACGG CCGGGGGCCT GCGGCAGCGC GGGCTGGGCG GCGGCGACCG GGTCCTGGTG
CAGCTGCCCA ACACCGCCGA GTTCGTGACC GCCCTGTGCG CGCTGCTGAG GATCGGCGCC
ATCCCCGTCC TGGCGCTCAC CTCCCACCGC CGCGCCGAGC TGCTGGAGCT GTGCCGGGTC
TCGGAGGCGG TCGCCCACCT CGTCCCGGAC CGGCACCGCG GCCACGACCA CCGGGAGGAG
GCCGCGCGGG TACGGGCGGA CGCCGGGGGC GGCCTGGACG TCATCGTCGA CGGGGACCCG
GGCGCCTTCA CCCGCCTGGC CGACGTGACC GGCCCGCCGG CTCCGGCGGC CGCGACCGAC
CCGGGGGAGG TCGCCCTCTT CCTGCTCTCG GGCGGCACCA CCGGCCGGTC CAAGCTCATC
CCGCGGACCC ACGACGACTA CGCCTACAAC GTCCGCATCA CCTCCGACAA CGCCGGACTC
ACCCCCGACG ACGTCTACCT GTGCGTCCTG CCCGCCTCGC ACAACTACGC CCTCGGCTGC
CCCGGGGTGC TCGGCGCCCT CTCCCGCGGC GCCACGGTCG TGCTCAGCGA CAGCGCGGAC
GCCGAGGACG CCTTCGCCCT GGTCGAGGAG GAGGGCGTGA CCGTCACCGC CCTGGTGCCC
TCCCTGGCCG CGCTGTGGAC GGAGGCCGCG GACCTCACCC ACCGCGACCT CTCCACCCTG
CGGCTGGTCC AGGTCGGCGG CAGCAGGGCC TCGGCCGACG ACGTCGCCGA GACCGAACTC
GCCCTGGGGT GCCGCGTCCA GCAGTCCTTC GGCATGGGCG AGGGCATCCT CAGCCAGTCC
GGTCCCGACG ACTCCTTCGG CATGCGCACC ACCACGCAGG GCCGCCCCCT GTCACCGGCC
GACGAGGTGC GCGTGGTGGA CGAGGACGAC CGCCCCCTGC CCGCGTGCAC CACCGGACGC
CTCCAGGTCC GCGGCCCCTA CACGATCCGG GGCTACTACC GGGCCGAGGA GGCCAACGCC
GCCGCCTTCA CCCCGGACGG CTTCCTGCGC ACCGGAGACC TGGCCCGCCT GACCAGCTAC
GGCCACCTGG TCGTGGAGGG CAGGACCAAG GACGTCATCA ACCGGGCGGG CGACAAGATC
GCCGCGGCCG AGGTCGAGGA CGCCCTCACG GCCCTTCCGT CCGTCCGCTC CTGCGCGGTC
GTGTCCGTGC CCGACCCCCT CCTGGGCGAG GCCTCCTGCG CCTTCCTCGT CTGCTCGGGG
CCGCCGCCCT CCTCCGAGGA GGTCTCCCGG CACCTGCGCT CGCTGGGGCT GGCCGCCTTC
AAGGTCCCCG ACCGGATCGA GAACGTGGCG GAGCTCCCGC TGACCAGGGT CGGCAAGGTG
GACAAGGAAT GGCTGCGCCG CGGCCTGGAG GAAGCCGCGG ACCGGGGGAC CCGGTGA
 
Protein sequence
MLEGCVPWPE EFARRYRREG YWLGRSLAEL FDTWCSAHPE RTALVCGSRR WTYRELHERV 
ARTAGGLRQR GLGGGDRVLV QLPNTAEFVT ALCALLRIGA IPVLALTSHR RAELLELCRV
SEAVAHLVPD RHRGHDHREE AARVRADAGG GLDVIVDGDP GAFTRLADVT GPPAPAAATD
PGEVALFLLS GGTTGRSKLI PRTHDDYAYN VRITSDNAGL TPDDVYLCVL PASHNYALGC
PGVLGALSRG ATVVLSDSAD AEDAFALVEE EGVTVTALVP SLAALWTEAA DLTHRDLSTL
RLVQVGGSRA SADDVAETEL ALGCRVQQSF GMGEGILSQS GPDDSFGMRT TTQGRPLSPA
DEVRVVDEDD RPLPACTTGR LQVRGPYTIR GYYRAEEANA AAFTPDGFLR TGDLARLTSY
GHLVVEGRTK DVINRAGDKI AAAEVEDALT ALPSVRSCAV VSVPDPLLGE ASCAFLVCSG
PPPSSEEVSR HLRSLGLAAF KVPDRIENVA ELPLTRVGKV DKEWLRRGLE EAADRGTR