Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2421 |
Symbol | |
ID | 9246271 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2870399 |
End bp | 2873218 |
Gene Length | 2820 bp |
Protein Length | 939 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, LuxR family |
Protein accession | YP_003680347 |
Protein GI | 297561373 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.924633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.0586657 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGAGC CCCGGACGCG GCGCGGCGGG ATCGTGGGCA GGACCGAACC GCTCACGAGG CTGCTGTCGG CCCGTCAACA CAGCCGCCGT ACGGGGCTCA CCTGTCACGT GGTGACGGGA GAGCCCAGGG TGGGGCGGAC CACCCTGCTC TCCCTGGTGT GCCGCCCCAC ACCGTCCCGC GCCGGTCCCG CGGTGCACAT CGCGTGCTCC CGGCACGTGA ACCGCCTGGT CACCGCGCTC ACGGACGCCC TGGCCCCGAC CGCCGACGAA CACCCCGCCG AGGAGGGGGA GGGGGGTTCC ACCGTGTCCC CGCGCGCCCG CGCCGGGACC GCGCACGCCC GGGGAGCGGG AGGCGGACCG GCCGGCCCCC GGGAGCTGGT GGAGGTGCTG ACCCGGGACG CCCCCCTGGT GATCGCGCTG GACGACGTCG ACCAGGCGGG GCCCGAGTCC CTGGTCCGGT TGCGGGGCGT CCTGCGGGAC GTCGCCCACC TTCCCGTGAC GCTCGTCGCC TCGATCCGGA GCGGAGAACC GGCCGCGGCT CCGACCGAAC TCGCCGACCT GCTGTCCGGC GCCCGCACCA TCACCCTGCG CGGGCTGTCG GAGGAGGAGA CCGGCGCGCT GATGCACGAG CGGCTGGGCC ACCGGCTCGA CGCGGACCTC GTCACGGCGT CCCACGAGAC CACCGCCGGC AACCCGTTCC TGACGCGGGC CCTGTGCGAC TGGATCCGGG CCCGCGAATC CCCGGTGCGC TCACCGGCCG AGCTGCGAAG CGCCGTGCTC CCCTCCGTGG CCGACGCCAT GATCGGCCGG GCGAACCGCT TCGACCCGCG GGCCCGGGCG GTCGCCGAGG CGGTCGTCGT CGCCTCGGCC TCCGGGGAGG CCGACCCCGC TCTGGTCGCG CACCTCAGCG GAACCCGGCT CGCGGAAACG CTGGCCGCGC TCGACCTCCT GGCTCGGATG CGCCTGGTCA CCGACGACCA CGCGGTGACG CTGCGCCACC CGTTGCTCCA CACCGCGCTG CTCGCCTCCA TGACCGTGAT GAGCCGCAAC GCGGCGCACC TGGCCGCCGC CGCCTTCCTG CACCGCAGAC CCGGCGCGGA GCGGCGGGTG GCCCGTCACC TCGCGGAGTC GACGGTGCCC CTGGACGCCC CGTGGTCGTC CACCGCCCTG ATCACGGCCG CACGGCTCCC CGACACGGCC GCGCGGGACC GCGTGCGCTA CCTGGAGCAG GCCGTCCAGG CCGGGGGAAC GGGCGCGTGG CCCGGTGTCG CCCCCGAACT GGCGGCCGCC CGGATCGCAC TCGACCGACA GGGCGGCCTG CGCGCCGCGG TGGAGGCACT CGGTCGCACC ACCGACGTCG CGGTCCGTCG CCGCCTGCTC GGCCTGATCG GCGCGACGCT CTGCGAGGGC GAACGCGGCG ACGACGCGTC CGCCGTCCTG CGGACGGTGC GGGAGGCCGT CGCCGGAACG GAGCGGGAGG ACTGGCCGCG GACCTTCCTC ACCCACGGGC GGTCCCTGAC GCCCGAGCCG GCGGCCGCCC GTCCGGTCGA AGGGGCTCCC GTCCCCGACG ACGTGCCGCG CCCTCCCGCC GTCACGGCCA TCGACGCCTT CTCCTCCTAC CTCCTGGGCG GGCCGCCGTC GGTCGCGCTC GCGGACGTGA GGCAGGCCCT GGACCACGAC CTCGACGACC TCCTGCTCCA ACCGCCCGCC CTGCCCGCCG CCCTGTCGGT GCTGGTCGGT TGTGGTCACC AGGCGGAGGC GTCCACGCGT CGGCGCTCCC TCGTCGCCGA TCCGGACCGG CTGCCCCGCT GGGTGGGCAC GGCGGTCCGG CTCACCGAGG CGACGGGATC CTACGCCTCG GGCGACCTGT CCGCGGCCCG GCACACGCTC ACGGAGCAGC TCTCGGAACT GCCCTCCCGC GGTGGTCACG GCTACAGCGG TCTCCGGACC CGCCTGGTCG GCCTCCTGGC CAACGTCCAC CTGGACCTGG GCGACCCGGA CGCGGCGGAG GCGCTGTTGC GCCGACACCA CCACGACGGC CACCCGCAGA CCGCGTGGTA CGACGCCGAC GTGCCCCTCG CCCGAGCGCG GCTGAGGATC AGGGCGGGCG CCCTGTCCCG CGGCGTCGAG GACCTGTTGG AGGTCATCCG ACGCCGCGAC GCGGCGGGGG TCCGGGGGCC GGGCACGCTC TGCTGGCGGA GCGAGGGCGC CCTCCTCCTG GCCAGGGCGG GCGCCCGTGA CGAGGCGGTG CGCGACGCCC GCCGACAGAT GGAGTTCGCG GAGGCGACCG GTTCGCCGCA GGAGCGGGCC CGCGCGCTGC GGGTGTGGGG CGCGCTCGCC GAGGAGCCCG CTTCGGCGGA GGCGCTGAGC GCGGCGGTCG ACCTGCTCCG GGGCACCGGG CACGACCTCG AAACCGCACG GACCACGGCG GAGCTGGGCA CGGTGCTGGC GCGGATGGGC CGCCACGGGG AGGCCGTGGC GGCCCTGAGC CGTTCGGCCG GCCTGGCCGC CAGCCGGGGC GCGCGTGACC TCGCCGACCG GGTACGGCTC CAGCTGGTGG CCTTGGACGC CTGTCGTGCC TCACACGACG TCTCCGTGCG GGGCATCCTC GCGCTGACCC CGCGCGAACG GCAGATCCTC ATCGACGCGC TGCTGGGCCA GGCCAACAAG ACGATCGCCG GACGTCGGCA CATCACCCGT CGCACGGTGG AACTGCACCT GTCGAGCGCC TACCGCAAGC TGGGCATCTC CGGGAGGGGA GAGTTCGGCA AGATCCTCGG CAGCCCCGGA AGGTGGGAGA TCCTCGTCGG CGGGGAGTGA
|
Protein sequence | MPEPRTRRGG IVGRTEPLTR LLSARQHSRR TGLTCHVVTG EPRVGRTTLL SLVCRPTPSR AGPAVHIACS RHVNRLVTAL TDALAPTADE HPAEEGEGGS TVSPRARAGT AHARGAGGGP AGPRELVEVL TRDAPLVIAL DDVDQAGPES LVRLRGVLRD VAHLPVTLVA SIRSGEPAAA PTELADLLSG ARTITLRGLS EEETGALMHE RLGHRLDADL VTASHETTAG NPFLTRALCD WIRARESPVR SPAELRSAVL PSVADAMIGR ANRFDPRARA VAEAVVVASA SGEADPALVA HLSGTRLAET LAALDLLARM RLVTDDHAVT LRHPLLHTAL LASMTVMSRN AAHLAAAAFL HRRPGAERRV ARHLAESTVP LDAPWSSTAL ITAARLPDTA ARDRVRYLEQ AVQAGGTGAW PGVAPELAAA RIALDRQGGL RAAVEALGRT TDVAVRRRLL GLIGATLCEG ERGDDASAVL RTVREAVAGT EREDWPRTFL THGRSLTPEP AAARPVEGAP VPDDVPRPPA VTAIDAFSSY LLGGPPSVAL ADVRQALDHD LDDLLLQPPA LPAALSVLVG CGHQAEASTR RRSLVADPDR LPRWVGTAVR LTEATGSYAS GDLSAARHTL TEQLSELPSR GGHGYSGLRT RLVGLLANVH LDLGDPDAAE ALLRRHHHDG HPQTAWYDAD VPLARARLRI RAGALSRGVE DLLEVIRRRD AAGVRGPGTL CWRSEGALLL ARAGARDEAV RDARRQMEFA EATGSPQERA RALRVWGALA EEPASAEALS AAVDLLRGTG HDLETARTTA ELGTVLARMG RHGEAVAALS RSAGLAASRG ARDLADRVRL QLVALDACRA SHDVSVRGIL ALTPRERQIL IDALLGQANK TIAGRRHITR RTVELHLSSA YRKLGISGRG EFGKILGSPG RWEILVGGE
|
| |