Gene Ndas_3609 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_3609 
Symbol 
ID9247478 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp4325798 
End bp4328254 
Gene Length2457 bp 
Protein Length818 aa 
Translation table11 
GC content75% 
IMG OID 
Producttranscriptional regulator, LuxR family 
Protein accessionYP_003681515 
Protein GI297562541 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones22 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCCTC CCCGCCACAA CCTGCCCCTG TCCGCCGGCG AGTTCATCAG CTTCGTCGGA 
CGCGAGCGCG ACATCGTCGA CCTGGGCCGC CTCCTGGGCA CCGCCCGGAT GGTCACGCTG
ACCGGTACCG GCGGTATCGG CAAGACGCGC CTGGCCCTGC ACCTGGCCGA ACGCGTCATG
CGCCGGTTCC CCGACGGCGT GCGCTTCGTC GACCTGAGCG AGGCCTCCAC CCACGACCAG
GCGCTGCGCG CGGTCGCGGG CAGCCTCCAG GCGGTGCAGG ACGACTCGCG CACCATGACG
GACGCCGTCA TCACCTCGCT GCGCACCCGC AACCTGCTGC TGCTCCTGGA CACCTGCGAG
CACGCCGTGG GGCCGATGGC GCAGCTGTGC CAGGCCGTGC TGCGCGACTG CCCCCGGGTG
CGCATCCTGG TGACCAGCCG CCAGCCCCTG CACGTTCCCG AGGAGAACAT CTGGCGGGTG
CCGCCGCTGT CCCTGCCAGC GCGGCCCACC CCCACCGACC CCTACGCGGC CGACCCCGCG
CCCATCCCCC GGCGCGACAC CCAGCGCTAC GAGTCGGTGC GCCTGTTCGT GACCCGCGCG
CACGCCGCGC GCGCCGGGTT CGAGATGACC AGGGAGAACT CCGGTTACAT CGCGGAGATC
TGCCGGATGC TGGACGGCAT GCCGCTGGCG ATCGAGCTGG CCGCCGCACG GGTGCGGGTG
CTGTCGGTGC AGCAGATCCT GCGCCGCCTG GACGACCGCT TCCAGCTGTT GACCAGCGAC
GGGTCGGAGG ACCTGCCGCC GCGCCAGCGG ACGCTGCGCG CGGTCCTGGA GTGGAGCCAC
GAACTGCTCA CCGAGCCGGA GCGGCTCCTG CTGCACCGGT TGTCGGTGTT CTCCACCTGG
TACCTGGAGG CCGCGGAGGA CGTCTGCTCG GGCGAGGGGG TCGATCCCGC CGACATCCTG
CCGCTGCACT TCTCCCTGCT GGACAAGTCG CTGGTGGTGA TGGACGCCGA GGTCGACGGC
ACCACCCACT ACCGGCTGAC CGAGACGGTG CGCGCCTACG CCGCCGAGCA CCTGGCCGAC
AGCGGCGGCG AGGACGACCG GTGGGAACGC TACCTGCGCT TCTGCGTCGC CCGTCTGGAG
GAGTGGGCCA AGAGCTGCTG CGCGCCGATG CCGTGGGGGG AGCGCCTGGG GCACCTGCGG
CTGCTCGACC ACCACCGGGA GAACCACGCC CGTGTGGTGG ACTGGGCGCT CTCGCGCGGA
CGCGTGGACG AGGCCCTGCG GGTGTGCGTG GCGCTGCGCT CCTACTGGAT CGTGCGCGAC
CTGGCCGCCG AGGGCAGCCG CCTCCTGGAG CGGGCGCTGT CCACCGACTC CGACGCGCAG
TCCCCCCACC TGCGGGCGCG CGCCCTGGCC CTGCACGCCG AGCTGCGGCT GGACCTGGAC
GCGGCGCCGC GCGTGTCCAC CCTGGCCCTG TGCGCCCTGG AGTCGGCGCG GGCCTGCCGT
GAGGCGGGAG CCGCCGCCTC CGCGCTGGCC ACCCTGGCGG CGCTGTGCCT GCGCACCGGC
ACCCTGGACG AGGGCCAGGA GCACGCCGAG CGGGCCGGGG TGTGGGCCTC CCAGGTCAGC
GACCCCATCA CCGAGGCGGC CACGCTGGAC GTGCTGGCCC AGCTGGCCCG CCGCCGCGGC
GAGGCCGAAC GGGCCACGGC CCTGCTGGAG CACAGCGTCG CGGTGGCCGA CAACCTCGGC
GACCGCTGGA GCGCGGCCCG CGCCCTGCAC GGGCTGGGCT CGATCGCCGC CGAGCGCGGC
GACTTCACGC GGGCGCGGGA GCTGTTCTCC GACGCCCTGT CGGTCTTCGG CGAGCTGGAG
GCCGCCCCGG CCACCGCGCA CTGCGCCCGG GAGCTGGGCC GCCTGTACCT GGCCGAGGGC
GAGACGCTGC TCGCCCGGGA GCCCCTGGCG ACGTGCCTGC GGGTGAGCTT CACCTCCGGC
CGCAGGCTCG CGGTGGCCCG GGCCCTGGAG GCGCTGGCGG AGCTGGCCCT GGCCGAGGAG
GAGGCCGAGC GGGCCGCCGC CCTGGTGGGG GTGGCCGCCG ATCTGAGGGC GGCGCTGGAC
CGCCCCTCGG CCGAGACGCT GCGGCTGCGG TCGGCGGCGG AGCGGCGCGT GGGCGCCACG
CGCACCGCCG AGGCGTGGAA CGCGTGGCGC CCCCTGCCGC TGGAGCAGGT GCTCGACCGC
GCGCTGGCCT TCCCCCGGCC GCAGTCCTCG GGGCCCTCGG GCCCGGATTC GCTGACCCCC
CGGGAGAGGG AGGTGGCCGA ACTGGCCGAA GAGGGCCTGT CCAACCGCGA GATCGCCGAG
CGGCTGACCA TCAGCCACGC GACGGCGGCC CGCCACATCG CGAACATCTT CAGAAAACTC
TCGATTTCCT CCAGGACGCA GTTGACCGGC TGGGCCGTCC CGGGTGACGG TGGCTGA
 
Protein sequence
MSPPRHNLPL SAGEFISFVG RERDIVDLGR LLGTARMVTL TGTGGIGKTR LALHLAERVM 
RRFPDGVRFV DLSEASTHDQ ALRAVAGSLQ AVQDDSRTMT DAVITSLRTR NLLLLLDTCE
HAVGPMAQLC QAVLRDCPRV RILVTSRQPL HVPEENIWRV PPLSLPARPT PTDPYAADPA
PIPRRDTQRY ESVRLFVTRA HAARAGFEMT RENSGYIAEI CRMLDGMPLA IELAAARVRV
LSVQQILRRL DDRFQLLTSD GSEDLPPRQR TLRAVLEWSH ELLTEPERLL LHRLSVFSTW
YLEAAEDVCS GEGVDPADIL PLHFSLLDKS LVVMDAEVDG TTHYRLTETV RAYAAEHLAD
SGGEDDRWER YLRFCVARLE EWAKSCCAPM PWGERLGHLR LLDHHRENHA RVVDWALSRG
RVDEALRVCV ALRSYWIVRD LAAEGSRLLE RALSTDSDAQ SPHLRARALA LHAELRLDLD
AAPRVSTLAL CALESARACR EAGAAASALA TLAALCLRTG TLDEGQEHAE RAGVWASQVS
DPITEAATLD VLAQLARRRG EAERATALLE HSVAVADNLG DRWSAARALH GLGSIAAERG
DFTRARELFS DALSVFGELE AAPATAHCAR ELGRLYLAEG ETLLAREPLA TCLRVSFTSG
RRLAVARALE ALAELALAEE EAERAAALVG VAADLRAALD RPSAETLRLR SAAERRVGAT
RTAEAWNAWR PLPLEQVLDR ALAFPRPQSS GPSGPDSLTP REREVAELAE EGLSNREIAE
RLTISHATAA RHIANIFRKL SISSRTQLTG WAVPGDGG