Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Ndas_4294 |
Symbol | |
ID | 9248168 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nocardiopsis dassonvillei subsp. dassonvillei DSM 43111 |
Kingdom | Bacteria |
Replicon accession | NC_014210 |
Strand | + |
Start bp | 5110296 |
End bp | 5111327 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | |
Product | transcriptional regulator, GntR family |
Protein accession | YP_003682189 |
Protein GI | 297563215 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 0.806766 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGTGTCCA CGACCGGAAA AAAGGAACTG CCGCCCTACG CGCGGGTCGT CACCGACATC CGCGCGCGCA TCGGCTCCGG GGAGTTGCGA CCGGGCGAAC GGGTGCCCTC CACCCGGGAG ATCATGCGCG AGTGGGGGGT GGCCATGGCC ACCGCCACCA AGGCGCTGGC CGCCCTGCGC CAGGAGGGCC TGGTCGAGGC GGTGCGCGGA GTGGGCACCC TCGTGCGCGG TGCACCCGCC TCCGCACCGG AGCCGCAGGG ACCACAGCGC CAGCGGGAAC GCCCGCGTCC GGCGCGGACG GAAGACCCGG GAAGCCACCG TCCGGCCGCC GAGACCGGCG GCCTCGCCCG GGAGGCCATC GTCCGGGCCG CGATCACCAT CGCCGACGCC GAGGGGATCG ACGGCCTGTC CATGCGCAGG GTCGCCACCC AGCTGGGGGT GAGCACCATG GCCCTGTACC GCCACGTCGC GAACAAGGAC GCGCTGGTGA CGGCGATGAT CGACCAGGTC TACACCGAGC ACGCCCTGCC CGACCCGCCG CCCGCCGACT GGCGCGAGGC GCTCGAACTG GCCCTGCTGA CGGAGTGGGG CATCTACCGG GCGCACCCCT GGGCCGTCCA GCTCACTCCG CTCGCCGGAG CGGTTCAGTC GCCCGGGCTG GTGCAGAACG CCGAGTGGAT GATGCGGGTG ATCACCGGCC AGGGGCGCTC GCCGGACGAG GCCATGGCGA TCCTCACCTT CGTGTCCGCC TACACCTCCG GCATGGCCCT CCAGGGCACG CGCGCGGTGG TGGAGGGGTA CGAGGCCGGG ATGGACGCCG AGCACTGGTG GAGGTCCCGG GGCGAGGAGT TCCTGCGGAT CGCCGAGCAG GGCAGGTTCC CCCTGACGTT CAGCGTCTCG GGGCCGACCG ACGTGCACGC GATCTTCGGC CTCGGCATGA AACACCTGCT GGACGGGCTC GCGCCGCTGA TCGAACCGGG AGGCCGACCC GTGGACGGGG GCCTCGCGGG CCCCACGGAC ACAACCCGGT GA
|
Protein sequence | MVSTTGKKEL PPYARVVTDI RARIGSGELR PGERVPSTRE IMREWGVAMA TATKALAALR QEGLVEAVRG VGTLVRGAPA SAPEPQGPQR QRERPRPART EDPGSHRPAA ETGGLAREAI VRAAITIADA EGIDGLSMRR VATQLGVSTM ALYRHVANKD ALVTAMIDQV YTEHALPDPP PADWREALEL ALLTEWGIYR AHPWAVQLTP LAGAVQSPGL VQNAEWMMRV ITGQGRSPDE AMAILTFVSA YTSGMALQGT RAVVEGYEAG MDAEHWWRSR GEEFLRIAEQ GRFPLTFSVS GPTDVHAIFG LGMKHLLDGL APLIEPGGRP VDGGLAGPTD TTR
|
| |