Gene Ndas_4595 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNdas_4595 
Symbol 
ID9248476 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardiopsis dassonvillei subsp. dassonvillei DSM 43111 
KingdomBacteria 
Replicon accessionNC_014210 
Strand
Start bp5447437 
End bp5450424 
Gene Length2988 bp 
Protein Length995 aa 
Translation table11 
GC content77% 
IMG OID 
Producttranscriptional regulator, LuxR family 
Protein accessionYP_003682488 
Protein GI297563514 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.131343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTGCC CGCGCACCCG TCCCGTGAGT ACCGCGAGTA CCCCAACCCA GGCCCGTCAC 
GCACGTCGCG TGCGACTCCG GGTGAGCATC GGTGAAACGA CCAGTGTGGT CGCCCGCACC
TGCCGCGTCG GGGGCCCGCT GTGCGAATGT GTGCACGTGA CTGGCAGGAA CACACAACCC
CCCGTCACCC GCCTCCGGGG TCGCGACACG GAGCAGCGGG CCCTCTCCGA CGCCCTGGAC
GACGCGCGGT CCCGCCGGGG CGCCTCCCTG CTCCTCTCCG GAGGCCCCGG GCGCGGCAAG
ACGGCGCTCC TGGAGCACCT GAGCGGGTCC GCGGGTGGTT TCACCGTGCT CCGCGCGGAC
GGGGTCGCCG ACGAGGCGGA CCTGCCCCTG GCCGGGCTCC AGCGCCTCCT CCACCCCCTC
GCGGAGGAGA GCGAGCGCCT TCCCGAGCCC CGGCGCGGCC TGCTCCGCGA CGCCCTGACC
CGGGGAGCGG TCGCCGACGC CGACCGGCTC GCCCTCTACA CCGGCCTGGT CGAACTGCTC
TCCCGCGCCG CGGCCGACCG CCCCCTCCTG CTGTGCGTGG ACGACGCCGA CCGGCTCGAC
GCGCCCTCGC TGGACGCCCT GGCCTTCGTC GCGCGGCGCC TCGCCGGAAC CCCCGTCGCC
GCCGTCCTCA CCGCGCGCGA GGGCCGCGGC AAGCCCGCCG GGGCCCGCGT GCCCGAGGCC
GACGCCGAGC CCCCTCCCGA CGCCCTCGTC CCCGGTGTCA CCGAACTCCC GCTCGCGCCA
CTGGAGGAGC GGGCGGTCCA CGACATCCTC ACCGACAGGG CCCCGGTCAC CCCGGCCTCC
GCCGTGCGCT CCGCCCTGGT CCGCGCCGCC CACGGCAACC CCGCCGCCGT CCTCGGCCTC
CTGCGGGGGC TGTCCCGGGC CCAGCTCCTG GGCGAGGAAC CCCTCCCCGC CCCGCCGCTC
CTGCCCGGCC GCCTGCGCGC CGGCTTCCTC GCCCCCTACC GCGATCTCCC CGAGCGGACC
CGCCGCCTGC TGCTGCTCGC CGCCCTCGGC GACGAGCCCC GCGTCCACCG GCTGCTGGAG
GCCTGCGAGG AACCCGGACC GAACACCACC GGCCCCGGAC CGACCGTCAC CGACCTCGAA
CCCGCCGAGG AGCGCGGCCT GGTGCGGGTG GAGGGCGACA CCGTCGTCTT CACCGACCCC
CTGGCGCGCG AGGCCATCGC CCAGGACGCC CCCGCCGGGC GACTGCGCGC CGCGCACCGG
GCCCTGGCCC GCGCCTGCGA CCCCGAACTC TCGCCCGCCG AGTTCGTCCG CCACACCGCC
TCGGGGGCCG ACGCGCCCGA CGCCGGGCTC GCCGAAGCCG CCACCGCCGC CGCCCGGCGC
GTCAAGCGGA TCGAGGGCCG CCTCGCCGCC TCCCACGCCT ACGAGCGCGC CGCCGACCTG
TGCCCCGACC CCGACGAGCG CGCCTGCCGA CTGAACACGG CCTCCTACGA GGCCTACATG
GCCGGGAGCT CGGCGCGCGC CACCCGGCTG CTCGCGCGGG CGCGCCCCCT GGCGGTGACG
GACCGGCGGC GGGCGACCTC CGACCTCATC GACGCCCAGA TCGCCATGCG CGGCGAGAAC
GCCATGGACG TCGCCGAACG CCTGCTCACC GTAGGCCGCG AACTCATCCC CCACGACCGC
TTCCTGGCCC TGCGCGCCCT CGTGCGCTCG GCCGACGCCG CCTCCCTCGC CGGGGACGCC
GTCCGCCACG GCCGGGCCGC CGAACTGGCC CTGCCCCTGG TCGGCCCGGA CGACCCGGCG
CCGATGCGCA TGGTCGCGTC CTTCCTGGAG GGGTGCGCGG TCTCCTTCCG CGGCGACTAC
CCCGGCTCCA CCCCGCTGCT GCGCGAGGCC ACCGGGCTGG CCGCGATCGC CAAGCCCTCC
GAACTGGTCT GGGCGGGCAT CAGCGGCCTG CGCCTGGGCG ACGCGCCGTT CGTGCGCTCG
GTGACCTCGC GCGCCGTCGA GGTCGGCCGC CTGCGCGGCG AACGGGCCAC CCTGCCCGCC
GCGCTGGGCT TCCTGGTCTT CTCCGAGTTC TGGAGCGGGC GCTTCCCCTC GGCCGCGGGG
ACCGCCCTGA CCGGCCTGCG GGTCTCCCGC GAGACCGGCC AGACCGTGTG GGCCACCCAG
CACCTGGCGT CCCTGGCGAT GATCGCCGCC ATCCAGGGAG ACGTGGACAC CTGCCGCATC
CGGGCGCGCG CGGTCGCCGC CCAGGCGGGG GAGAACAGCC TCGGCCTCGC CGCCGCCCTG
TCGGCGTGGG CACTGGCCGT CCTGGAGCTG TCCCGGGGCA ACGCCGCGGA GGCGTTCTTC
CGGCTGCGCG CCCTGGTCCA CGCCGCCCCA GGGCACGGCC ACCCCACCAT GCGGCTGCTC
ACCGCCCCGC ACTTCGTGGA GGCGGCGACC CGCATGGGCG AGACCGAGTG GGCGCGCACC
TCCCTGGCCG GGTACCGGCG CTGGGCCGAG TCGGTGGGCA GCCCGAGCAC GCTGGCGCTG
GCCGCCCGGG GCTCGGGCCT GCTGGCCGCG GGCGACGAGG CATGCGACCA CTTCGAGAAC
GCGCTGGCCC TGCACCGGGC CTGCGGCGAC GACGACGTCG AGCACGCGCG CACCCAGCTG
CTGTTCGGCG CCCACCTGCG CCGGGCCCGC CTGCCCGGCC GGGCCCGCGA GCACCTGTAC
AACGCGCTGG AGTCCTTCGA ACGCTTCGGG GCGCGGCTGT GGGTGCGCCA GACCCGCGCC
GAGCTGCGCG CGATCGGGAC CGCCGAACGC GGCCCCGACC CCGTCTCCAC CAGCGAGCTG
ACCGCGCAAC AGCAGCAGAT CGCCCGGCTG GTCGCCGAGG GGGCCACCAA CCGCGAGGTG
GCCGCCCACA TGTTCATCAG CCCGCGCACG GTCGAGCACC ACCTGCGCGG CATCTTCCGC
AAGCTCAACA TCAGGTCCCG CGTGGACCTG GCCCGCCTGT TCAACTGA
 
Protein sequence
MPCPRTRPVS TASTPTQARH ARRVRLRVSI GETTSVVART CRVGGPLCEC VHVTGRNTQP 
PVTRLRGRDT EQRALSDALD DARSRRGASL LLSGGPGRGK TALLEHLSGS AGGFTVLRAD
GVADEADLPL AGLQRLLHPL AEESERLPEP RRGLLRDALT RGAVADADRL ALYTGLVELL
SRAAADRPLL LCVDDADRLD APSLDALAFV ARRLAGTPVA AVLTAREGRG KPAGARVPEA
DAEPPPDALV PGVTELPLAP LEERAVHDIL TDRAPVTPAS AVRSALVRAA HGNPAAVLGL
LRGLSRAQLL GEEPLPAPPL LPGRLRAGFL APYRDLPERT RRLLLLAALG DEPRVHRLLE
ACEEPGPNTT GPGPTVTDLE PAEERGLVRV EGDTVVFTDP LAREAIAQDA PAGRLRAAHR
ALARACDPEL SPAEFVRHTA SGADAPDAGL AEAATAAARR VKRIEGRLAA SHAYERAADL
CPDPDERACR LNTASYEAYM AGSSARATRL LARARPLAVT DRRRATSDLI DAQIAMRGEN
AMDVAERLLT VGRELIPHDR FLALRALVRS ADAASLAGDA VRHGRAAELA LPLVGPDDPA
PMRMVASFLE GCAVSFRGDY PGSTPLLREA TGLAAIAKPS ELVWAGISGL RLGDAPFVRS
VTSRAVEVGR LRGERATLPA ALGFLVFSEF WSGRFPSAAG TALTGLRVSR ETGQTVWATQ
HLASLAMIAA IQGDVDTCRI RARAVAAQAG ENSLGLAAAL SAWALAVLEL SRGNAAEAFF
RLRALVHAAP GHGHPTMRLL TAPHFVEAAT RMGETEWART SLAGYRRWAE SVGSPSTLAL
AARGSGLLAA GDEACDHFEN ALALHRACGD DDVEHARTQL LFGAHLRRAR LPGRAREHLY
NALESFERFG ARLWVRQTRA ELRAIGTAER GPDPVSTSEL TAQQQQIARL VAEGATNREV
AAHMFISPRT VEHHLRGIFR KLNIRSRVDL ARLFN