Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_1742 |
Symbol | |
ID | 9245592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | - |
Start bp | 2119994 |
End bp | 2121286 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | putative transcriptional regulator, GntR family |
Protein accession | YP_003679676 |
Protein GI | 297560702 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.554811 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGACCTGG CGCTCTCCGA CATGCACGCA TCACTTGACT CACCGACCAC GGAGTCGATG AATTTCCTGA ACGAGATCGG GAACGTCTAC CCCGACGCCA TCTCGTTCGC CGCGGGGCAG CCGTTCGAGG GCTTCTTCGA CCTCGACGCC GTCCACCACT ACCTGGACGC GTTCCGCGCC CACCTCGCCG AGGAGCGGGG GCAGAGCGGG GAGCAGGTCC GGCGCACGCT GCTCCAGTAC GGCAGCGCCA GGGGCATCGT CAACGACCTC ATCTGCCGCA ACCTGGAGAC GGACGAGGGT ATCCGGGTGG ACCCCCGTGC CGTCGTCGTC ACCTCCGGCT GCCAGGAGGC CCTCTTCCTG GTGCTGCGCG CCCTGCGCGG GGGCCCGTCG GACGTGGTGC TCGCGGTGCG GCCGAACTAC TCGGGGCTCG ACGCGGCGGC GCGCCTGGTG GAGATGGGGG TCCACCCGGT CCGGGAGCCG GCTTCGGGAA TCGACGGTGA GAGCCTCACC GAAGCCGCGG AGCAGGCACG CCGGGAAGGG CTCAACCCCC GCGCCTGCTA CGTGATCCCG GACTTCGCCA ACCCCACCGG CCGCAGCCTG TCGGTGGCCG CCCGGCGGAG CCTGCTGGAG AGCGCCGAGG AACAGGGGAT CCTTCTCATC GAGGACAACC CCTACGGCAT CTTCGGCCCC GAGGAGAGCG GCACCCCCAC CCTGAAGTCC CTGGACGCGT CGCGGTCCGT GGTCTACCTC GGTTCCTTCG CCAAGTCCGG TATCCCGGGA GCGAGGGTCG GCTACGTCGT CGCCGACCAG CGTGTGTCGG CGCACGAATC ATCGGACACC CTTTTCGCCG ACCACCTGGC CAAGGCCAAG GGAATGCTCA ACATCAACAC CTCCCCGATC ACTCAGGCGG TGATGGGGGG AAAGCTGATC ATGAACGGGT TCAGCCTCCG TTCGGCGAAC ACCCGGGAGA GGAACGTCTA CCAGGGCAAC CTGTCCCGCC TCCTCCAGGA GATGTCCCGG AGGTTTCCCG AAGGCGAGGG CCACGGCGTC AGTTGGAACA CCCCGTCGGG CGGATTCTTC CTGACCCTGA AGGTCCCCTT CCCAGCGAGT GACGAGGCAC TTGGCGTCTG TGCGCGGAAG CACGGTGTGC TGTGGACCCC CATGCACCAC TTCCACGGCG ACGGAATTCC ACGGAACGAG ATCAGGCTCT CCTTCAGCCA TCTCACCCAG GACAGGATCG CGCTCGGTGT CGAACGTTTC GCCTCGTTCG TCACCGACCA CGCCGGCAGC TGA
|
Protein sequence | MDLALSDMHA SLDSPTTESM NFLNEIGNVY PDAISFAAGQ PFEGFFDLDA VHHYLDAFRA HLAEERGQSG EQVRRTLLQY GSARGIVNDL ICRNLETDEG IRVDPRAVVV TSGCQEALFL VLRALRGGPS DVVLAVRPNY SGLDAAARLV EMGVHPVREP ASGIDGESLT EAAEQARREG LNPRACYVIP DFANPTGRSL SVAARRSLLE SAEEQGILLI EDNPYGIFGP EESGTPTLKS LDASRSVVYL GSFAKSGIPG ARVGYVVADQ RVSAHESSDT LFADHLAKAK GMLNINTSPI TQAVMGGKLI MNGFSLRSAN TRERNVYQGN LSRLLQEMSR RFPEGEGHGV SWNTPSGGFF LTLKVPFPAS DEALGVCARK HGVLWTPMHH FHGDGIPRNE IRLSFSHLTQ DRIALGVERF ASFVTDHAGS
|
| |