Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_2515 |
Symbol | |
ID | 9246365 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2982820 |
End bp | 2983893 |
Gene Length | 1074 bp |
Protein Length | 357 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003680440 |
Protein GI | 297561466 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 0.185479 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 7 |
Fosmid unclonability p-value | 0.0043298 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | GTGACGCACG TCAGGAGGGC GGGGCCGCGA CGCCCGCGCC AGTCCGACAT CGCGCGGGTG GCCGGGGTCT CCCAGGCCAC GGTCTCGCTG GTGCTGCGGG GCACTCCGAC CGGGATGTCC CTGGCCCGCG AGACCCGCCA GAGGGTGCTC GACGCGGCCG AGGAGCTGGG CTACGTTCCC GACCCGGTCG CCACCCGGTT GGCCTCGGCC AGCAACGCGA TGCTGGGGCT GTACACCTTC AGCGCCACCT TCCCCACCGA CGTGGCGCAC TCCTACTACC CGGTCCTGGT CGGGGTGGAG GAGGAGGCCG CGGCCCAGGG GCAGGACCTC ATCCTGTTCA CCGGGTCCGC GCGGGCCGCC GCCAGCGCCG ACGACCCGGC CGCGGTGCGG CGGGTGCGCG TGGCCGACGG CTGCCTGTTC TTCGGCCGCC ACGTGCCCGA CGAGCCCATC CGGCGTCTGG TCGAGGACGG CTTCCCGTTC GTCTACATCG GCCGCCGCGA CGAGCCCGGC GTGCCCTACG TGGGCGCCGA CTACGTCTCG GCCTCGGCGC AGGTGGTGGC GCGTCTGGCC GGTCTGGGCC ACCGCGAGCT GCGCTACCTG CGCGAGCACG ACCAGGCCCC GGCCTCGACC GACCGCGAGC GGGGTGTGCT GCGGGGCGCC CGCGAGGCCG GGATCGACAC CGCGCGCCTG GTGGTGCGCA CCGACGGCTC CGACCTGGAC GGACTGCTGC GGCGCTGGCT GGAGGAGGGC GTGACCGCCG TCGTGGTCGA GCAGACCGAC ACCGGCGCCG CCCTGGACGG CCTGGCCGCC GCCGTGGACC GGGCCGGTCT GCGCTGCCCG GAGGACATCT CCCTGGCCCT GCTGGGGGCG CCGACCGGGA CCCGGGGCCG CGCTCCGGGC GGCGGCGCGG CCTACGGAGG GTTCGACGCG CCCCTGCGCG CGATGGGCCG CGACGCGGTC CGTCTGCTGC TGGAACTGAT CGCGGGCGCC CACGGGCAGA CGCCCCGGCG CCTGCTGCCC TGCGAGCCGG TGGAGGGCGG CACCCTCGCC GCGCCCCGGA CACGACACAT CTGA
|
Protein sequence | MTHVRRAGPR RPRQSDIARV AGVSQATVSL VLRGTPTGMS LARETRQRVL DAAEELGYVP DPVATRLASA SNAMLGLYTF SATFPTDVAH SYYPVLVGVE EEAAAQGQDL ILFTGSARAA ASADDPAAVR RVRVADGCLF FGRHVPDEPI RRLVEDGFPF VYIGRRDEPG VPYVGADYVS ASAQVVARLA GLGHRELRYL REHDQAPAST DRERGVLRGA REAGIDTARL VVRTDGSDLD GLLRRWLEEG VTAVVVEQTD TGAALDGLAA AVDRAGLRCP EDISLALLGA PTGTRGRAPG GGAAYGGFDA PLRAMGRDAV RLLLELIAGA HGQTPRRLLP CEPVEGGTLA APRTRHI
|
| |