Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_0926 |
Symbol | |
ID | 9244771 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 1135613 |
End bp | 1138612 |
Gene Length | 3000 bp |
Protein Length | 999 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | |
Product | transcriptional regulator, LuxR family |
Protein accession | YP_003678876 |
Protein GI | 297559902 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.0223661 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCCGTTC TTGCCGAGAG TCGTACAGTT TTCGTGGGCC GCGAGCGCGA ACTCCGCCTC CTCCGGGACC ACGCCAGACG CTCCCACACC GAGGCGTCCG GAACGGTACT GGTCAGCGGC GACGCGGGCG TGGGCAAGAG CCGCCTGGTC GGCGAGTTCG TCTCGGCCCT GCCCCAGGGA ACGGTCTTCG TCGGCGGCTG CCTCCAGCTC GGCGTGGACG GACTGTCCTA CGCCCCCTTC ACCGCCGTTC TGCGCCAACT CCTGCGCGAG CGCGGACGCG CGGCGTTCGA GGCCGCGGCG CCCGGCGGCA CCGGCGAGTT CGCCCGGCTC CTGCCCGAGC TGGGCGAGGT GCCCGTGCTG CGCCCGGAGA ACCGGGGCAT CCTCTTCGAA CAGGTCCTGC GCCTGTTCAC CCAGGCCGCC GAGGACGGCG GCGTCACCGT GGTGCTGGAG GACCTGCACT GGGCCGACGG CGCCACCCGC GACCTGCTCG TCTTCCTCGT CCGCAACCTC GACCTGCCTG GCGTGCAGCT CGTGGCCACC TACCGCAGCG ACGACCTGCA CCGCACCCAT CCGCTCCGCC GCCTGCTGCC CGAACTGCGG CGCGCACCCG GGGTCGAGCC GCTGGAGCTG GCGCCGTTCA GCCGCGAGGA GGCCGGTGTC CAGGCCGCCG CGATACGGGG CGCCGACCTC ACCGGCCACG AACTGGACCA GCTGTACCGG CGCACCGAGG GCGTCCCGCT GTTCGTGGAG TCGCTGGCCT CCGCGGTCGG CGACCCCTCC GTCGGCGGCC ACGACGTGCC CGACCAGTTC CGCGACCTGC TGCTGGAACC GCTGCACCGC TTCGACGACA CCGCCCTGTC GGTGTTGCGC GTGGCCTCGG TCGGCGCGGT CTCGGGCAGC ATCGAGCACG AGATGCTCTA CCACGCGGCC GGGCTGCCCG AACGCGAACT GGAGACCGCG CTGCACACCC TGGTCGACGC CAACACCCTG CGCGCCGACC GAACCGGGTA CCGCTTCCGG CACGCCCTGC TGCGCGACGC CGTGCACAGC GACCTGCTGC CCGGCGCCCA CGCGCGGCTG CACATGCGCT TCGCCCAGCT CATCGACGAG TACCCCGACT CCGTTCCCTT CGACCGCAGG GCCGCCGAAC AGGCCCACCA CTACAACGCC GCACAGGAGC TGCCCAGCGC CCTCCAGGCC GCCTGGTGGG CCGCGGTGCG CGCGGGCGAC ACCCTGGCCT ACGGCGAGGA ACTGGACATG CTGGAGCGGG TCCTGGCCCT GTGGGACCGC GTCCCCGACG CACGGGAGCG GGTCCAGGGC CGGACCTGGG CCGAGGTGGC CAGCCTCGCG GCGGGGGCCG CCGTGGAGGC GGGCCGCGCC AGGCGCGCCC TCGAACTGGC CGACGAGGCC CTGGCCGCCC TCCCCGAGGA CGGGGTCGAC GACCACACGC TGACCGTGCG GGCGGGGCTG CTGCGCCGCC GCGGGTTGGC GCGCGCCGCG GACTCCTGCG GCAGCGGGAT CACCGACCTG GTCAAGGCCT TGGAGCTGCA CCCGCCGCAC ATGCCGGGGT ACGGCCTGCT GCTGTCCATC CTGGCCCGGG AGAGCATGGT GCACCGCGCC GACCGGCGCC ACACGCCGGA ACAGGAGCGG CTGCGGGAAC TGGAGAGGTC CGGCAGGTCG GCGCGCGCGC TGGCGGAGGA GGCCATCGCC CTCGCCGACC CCGCCGAGCA GAGCGGCATG TGCGCGGCCG CCGACGCCCG CATCACCCTG GGCGGCCTGC ACATGGACGC GGGCGACCTG GAGGGGGGAC GCCCCCTCAT CGAGGCCGCC ATCCGCTACG CCGCCGAGAT CCGCGACCCC GCCCTGGAGG CGCGCGGGGC GGGCAACCTG GGCCACTTCC TACGTGAGCT GGGGCACCAC GAGGAGGGCC TGGCCGTCCT GGAGGAGTCC CTGGCCCGGC ACGAGGCGAT GGGGTGGGCG GCCGTCCACA AGACGTTCAA CCACCAGAAC CGCGCGGAGA TCCACTTCGA GCTGGGGGAC CTGGCCAAGG CCCGCGGGAT CCTCGAAACG GTCCTGCGCT CCCACCCCTC CAGCAAGCAC CGCTTCTACG TCGACGCGGT GCTGGCCCGC ACGGCGGCGG CGCAGGGGGA CCCGGAGGCG GCGCGCCGGG CCATGAGGGT GTCGGGGCGG GCGGACGCGC TGGCCTCGCA CCGGATGAAC ATCGTGCAGC TGTCCCTGCT GGCGCTGCTG GAGGCGGACC TGGTCGCGGG GGACGTGGAC GACGCGCTCG TCCTGGCCGA GCGGACGCTG GAGCGGCTGG TCCTGGAGTC CGCGCACGGG TACACGTGGC CGATGGCGGA GGCGATGGCC GAGGCCGCCC GGAGGGGCGC GGCGCCGGAC CGCCCGGAGG GGACGGCGGA GCGGGCGCGC CGGGTGCGGG ACCTGGTGGC CGCGCTGGTG GCGCCGATGC CCGCCCACGG GACGGCGCAG CACGCCTACC GGGTGTCCGT CGGGGCGCAC CTCTCCGAGG CGGACGGCGC GGGACCGGAC GCGCTGCTGG AGCGGTGGCG CGAGTCGGTG GCGGCCTGGG AGGCCACGCC GATGCGGCTG CACCTGGCCC GGGCCCGGCT GCGGGCCGCC GAGGCGGCCG TGGCGGTCGG GCGGCGGGAG CGGGCCGTGA CGTGGGTGCG CCAGGCGCAC GCCACGGCAC GGGAGTGCGG AGCGGCGCCG CTGGCCGGCG CCGCCGCGGA CCTGGCGCGC AGGATGGGCT CCGGTCTGGA GGAGGACGCG GCCCCGCCCG CCGTCCCCGC CGGGCTGACG GCGCGCGAGA CGGAGGTGGC GCGGCTGCTG GTGGTGGGCA GCACCAACGC CCAGATCGCC GAGCGGCTGT TCATCACGGC CAAGACCGCG AGCGTGCACG TGTCCAACAT CCTGGCCAAG CTCGACGTGC CCAACCGGGC CGCGGCCGGG GTGCGGCTGC GGGAGCTGGG CCTGGCCTAG
|
Protein sequence | MPVLAESRTV FVGRERELRL LRDHARRSHT EASGTVLVSG DAGVGKSRLV GEFVSALPQG TVFVGGCLQL GVDGLSYAPF TAVLRQLLRE RGRAAFEAAA PGGTGEFARL LPELGEVPVL RPENRGILFE QVLRLFTQAA EDGGVTVVLE DLHWADGATR DLLVFLVRNL DLPGVQLVAT YRSDDLHRTH PLRRLLPELR RAPGVEPLEL APFSREEAGV QAAAIRGADL TGHELDQLYR RTEGVPLFVE SLASAVGDPS VGGHDVPDQF RDLLLEPLHR FDDTALSVLR VASVGAVSGS IEHEMLYHAA GLPERELETA LHTLVDANTL RADRTGYRFR HALLRDAVHS DLLPGAHARL HMRFAQLIDE YPDSVPFDRR AAEQAHHYNA AQELPSALQA AWWAAVRAGD TLAYGEELDM LERVLALWDR VPDARERVQG RTWAEVASLA AGAAVEAGRA RRALELADEA LAALPEDGVD DHTLTVRAGL LRRRGLARAA DSCGSGITDL VKALELHPPH MPGYGLLLSI LARESMVHRA DRRHTPEQER LRELERSGRS ARALAEEAIA LADPAEQSGM CAAADARITL GGLHMDAGDL EGGRPLIEAA IRYAAEIRDP ALEARGAGNL GHFLRELGHH EEGLAVLEES LARHEAMGWA AVHKTFNHQN RAEIHFELGD LAKARGILET VLRSHPSSKH RFYVDAVLAR TAAAQGDPEA ARRAMRVSGR ADALASHRMN IVQLSLLALL EADLVAGDVD DALVLAERTL ERLVLESAHG YTWPMAEAMA EAARRGAAPD RPEGTAERAR RVRDLVAALV APMPAHGTAQ HAYRVSVGAH LSEADGAGPD ALLERWRESV AAWEATPMRL HLARARLRAA EAAVAVGRRE RAVTWVRQAH ATARECGAAP LAGAAADLAR RMGSGLEEDA APPAVPAGLT ARETEVARLL VVGSTNAQIA ERLFITAKTA SVHVSNILAK LDVPNRAAAG VRLRELGLA
|
| |