Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4595 |
Symbol | |
ID | 9248476 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 5447437 |
End bp | 5450424 |
Gene Length | 2988 bp |
Protein Length | 995 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | transcriptional regulator, LuxR family |
Protein accession | YP_003682488 |
Protein GI | 297563514 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.131343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCTGCC CGCGCACCCG TCCCGTGAGT ACCGCGAGTA CCCCAACCCA GGCCCGTCAC GCACGTCGCG TGCGACTCCG GGTGAGCATC GGTGAAACGA CCAGTGTGGT CGCCCGCACC TGCCGCGTCG GGGGCCCGCT GTGCGAATGT GTGCACGTGA CTGGCAGGAA CACACAACCC CCCGTCACCC GCCTCCGGGG TCGCGACACG GAGCAGCGGG CCCTCTCCGA CGCCCTGGAC GACGCGCGGT CCCGCCGGGG CGCCTCCCTG CTCCTCTCCG GAGGCCCCGG GCGCGGCAAG ACGGCGCTCC TGGAGCACCT GAGCGGGTCC GCGGGTGGTT TCACCGTGCT CCGCGCGGAC GGGGTCGCCG ACGAGGCGGA CCTGCCCCTG GCCGGGCTCC AGCGCCTCCT CCACCCCCTC GCGGAGGAGA GCGAGCGCCT TCCCGAGCCC CGGCGCGGCC TGCTCCGCGA CGCCCTGACC CGGGGAGCGG TCGCCGACGC CGACCGGCTC GCCCTCTACA CCGGCCTGGT CGAACTGCTC TCCCGCGCCG CGGCCGACCG CCCCCTCCTG CTGTGCGTGG ACGACGCCGA CCGGCTCGAC GCGCCCTCGC TGGACGCCCT GGCCTTCGTC GCGCGGCGCC TCGCCGGAAC CCCCGTCGCC GCCGTCCTCA CCGCGCGCGA GGGCCGCGGC AAGCCCGCCG GGGCCCGCGT GCCCGAGGCC GACGCCGAGC CCCCTCCCGA CGCCCTCGTC CCCGGTGTCA CCGAACTCCC GCTCGCGCCA CTGGAGGAGC GGGCGGTCCA CGACATCCTC ACCGACAGGG CCCCGGTCAC CCCGGCCTCC GCCGTGCGCT CCGCCCTGGT CCGCGCCGCC CACGGCAACC CCGCCGCCGT CCTCGGCCTC CTGCGGGGGC TGTCCCGGGC CCAGCTCCTG GGCGAGGAAC CCCTCCCCGC CCCGCCGCTC CTGCCCGGCC GCCTGCGCGC CGGCTTCCTC GCCCCCTACC GCGATCTCCC CGAGCGGACC CGCCGCCTGC TGCTGCTCGC CGCCCTCGGC GACGAGCCCC GCGTCCACCG GCTGCTGGAG GCCTGCGAGG AACCCGGACC GAACACCACC GGCCCCGGAC CGACCGTCAC CGACCTCGAA CCCGCCGAGG AGCGCGGCCT GGTGCGGGTG GAGGGCGACA CCGTCGTCTT CACCGACCCC CTGGCGCGCG AGGCCATCGC CCAGGACGCC CCCGCCGGGC GACTGCGCGC CGCGCACCGG GCCCTGGCCC GCGCCTGCGA CCCCGAACTC TCGCCCGCCG AGTTCGTCCG CCACACCGCC TCGGGGGCCG ACGCGCCCGA CGCCGGGCTC GCCGAAGCCG CCACCGCCGC CGCCCGGCGC GTCAAGCGGA TCGAGGGCCG CCTCGCCGCC TCCCACGCCT ACGAGCGCGC CGCCGACCTG TGCCCCGACC CCGACGAGCG CGCCTGCCGA CTGAACACGG CCTCCTACGA GGCCTACATG GCCGGGAGCT CGGCGCGCGC CACCCGGCTG CTCGCGCGGG CGCGCCCCCT GGCGGTGACG GACCGGCGGC GGGCGACCTC CGACCTCATC GACGCCCAGA TCGCCATGCG CGGCGAGAAC GCCATGGACG TCGCCGAACG CCTGCTCACC GTAGGCCGCG AACTCATCCC CCACGACCGC TTCCTGGCCC TGCGCGCCCT CGTGCGCTCG GCCGACGCCG CCTCCCTCGC CGGGGACGCC GTCCGCCACG GCCGGGCCGC CGAACTGGCC CTGCCCCTGG TCGGCCCGGA CGACCCGGCG CCGATGCGCA TGGTCGCGTC CTTCCTGGAG GGGTGCGCGG TCTCCTTCCG CGGCGACTAC CCCGGCTCCA CCCCGCTGCT GCGCGAGGCC ACCGGGCTGG CCGCGATCGC CAAGCCCTCC GAACTGGTCT GGGCGGGCAT CAGCGGCCTG CGCCTGGGCG ACGCGCCGTT CGTGCGCTCG GTGACCTCGC GCGCCGTCGA GGTCGGCCGC CTGCGCGGCG AACGGGCCAC CCTGCCCGCC GCGCTGGGCT TCCTGGTCTT CTCCGAGTTC TGGAGCGGGC GCTTCCCCTC GGCCGCGGGG ACCGCCCTGA CCGGCCTGCG GGTCTCCCGC GAGACCGGCC AGACCGTGTG GGCCACCCAG CACCTGGCGT CCCTGGCGAT GATCGCCGCC ATCCAGGGAG ACGTGGACAC CTGCCGCATC CGGGCGCGCG CGGTCGCCGC CCAGGCGGGG GAGAACAGCC TCGGCCTCGC CGCCGCCCTG TCGGCGTGGG CACTGGCCGT CCTGGAGCTG TCCCGGGGCA ACGCCGCGGA GGCGTTCTTC CGGCTGCGCG CCCTGGTCCA CGCCGCCCCA GGGCACGGCC ACCCCACCAT GCGGCTGCTC ACCGCCCCGC ACTTCGTGGA GGCGGCGACC CGCATGGGCG AGACCGAGTG GGCGCGCACC TCCCTGGCCG GGTACCGGCG CTGGGCCGAG TCGGTGGGCA GCCCGAGCAC GCTGGCGCTG GCCGCCCGGG GCTCGGGCCT GCTGGCCGCG GGCGACGAGG CATGCGACCA CTTCGAGAAC GCGCTGGCCC TGCACCGGGC CTGCGGCGAC GACGACGTCG AGCACGCGCG CACCCAGCTG CTGTTCGGCG CCCACCTGCG CCGGGCCCGC CTGCCCGGCC GGGCCCGCGA GCACCTGTAC AACGCGCTGG AGTCCTTCGA ACGCTTCGGG GCGCGGCTGT GGGTGCGCCA GACCCGCGCC GAGCTGCGCG CGATCGGGAC CGCCGAACGC GGCCCCGACC CCGTCTCCAC CAGCGAGCTG ACCGCGCAAC AGCAGCAGAT CGCCCGGCTG GTCGCCGAGG GGGCCACCAA CCGCGAGGTG GCCGCCCACA TGTTCATCAG CCCGCGCACG GTCGAGCACC ACCTGCGCGG CATCTTCCGC AAGCTCAACA TCAGGTCCCG CGTGGACCTG GCCCGCCTGT TCAACTGA
|
Protein sequence | MPCPRTRPVS TASTPTQARH ARRVRLRVSI GETTSVVART CRVGGPLCEC VHVTGRNTQP PVTRLRGRDT EQRALSDALD DARSRRGASL LLSGGPGRGK TALLEHLSGS AGGFTVLRAD GVADEADLPL AGLQRLLHPL AEESERLPEP RRGLLRDALT RGAVADADRL ALYTGLVELL SRAAADRPLL LCVDDADRLD APSLDALAFV ARRLAGTPVA AVLTAREGRG KPAGARVPEA DAEPPPDALV PGVTELPLAP LEERAVHDIL TDRAPVTPAS AVRSALVRAA HGNPAAVLGL LRGLSRAQLL GEEPLPAPPL LPGRLRAGFL APYRDLPERT RRLLLLAALG DEPRVHRLLE ACEEPGPNTT GPGPTVTDLE PAEERGLVRV EGDTVVFTDP LAREAIAQDA PAGRLRAAHR ALARACDPEL SPAEFVRHTA SGADAPDAGL AEAATAAARR VKRIEGRLAA SHAYERAADL CPDPDERACR LNTASYEAYM AGSSARATRL LARARPLAVT DRRRATSDLI DAQIAMRGEN AMDVAERLLT VGRELIPHDR FLALRALVRS ADAASLAGDA VRHGRAAELA LPLVGPDDPA PMRMVASFLE GCAVSFRGDY PGSTPLLREA TGLAAIAKPS ELVWAGISGL RLGDAPFVRS VTSRAVEVGR LRGERATLPA ALGFLVFSEF WSGRFPSAAG TALTGLRVSR ETGQTVWATQ HLASLAMIAA IQGDVDTCRI RARAVAAQAG ENSLGLAAAL SAWALAVLEL SRGNAAEAFF RLRALVHAAP GHGHPTMRLL TAPHFVEAAT RMGETEWART SLAGYRRWAE SVGSPSTLAL AARGSGLLAA GDEACDHFEN ALALHRACGD DDVEHARTQL LFGAHLRRAR LPGRAREHLY NALESFERFG ARLWVRQTRA ELRAIGTAER GPDPVSTSEL TAQQQQIARL VAEGATNREV AAHMFISPRT VEHHLRGIFR KLNIRSRVDL ARLFN
|
| |