Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_4680 |
Symbol | |
ID | 8450310 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 5201445 |
End bp | 5202440 |
Gene Length | 996 bp |
Protein Length | 331 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645043721 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003203946 |
Protein GI | 258654790 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 59 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 37 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGCTCC GCGACGTGGC CGAACGGGCC GGCGTCTCCC GGACCACCGC GTCATTCGTG CTCAGCGGCC GGCGGGACAT GCGCATCTCG GCCGACGCCG AACGCCGGGT CCGGCAGGCG GCCCGCGAGC TGAACTACCG GCCGAGCCTG CTGGCCCGCA GCCTGCGCAC CAACCTGTCG CAAACCATCG GCCTGCTGTC GGACCAGATC GCCAGCGAGA TGTTCGCCGG CGAGGTCGTC CGCGCCGCCC TGGCCACCGC GGTCCGCCAC GAGCACCTGC TGTTCGTCGG GGAGACCGGC GGCGATCCGG AGGTCGAGCG CAACATCATC CAGGGCATGA TCGACCGCGG CGTCGGGGGC TTCGTCTACG CGTCGATGTA CACCCAGGTG GTCCGGGTCC CGACCATCCT GCGCAAGCTG CCGGTGGTGC TGCTGAACTG CTCCACCCGC GGCAAGCGGT TGACCAACGT GGTGCCGGAC GAGGTCGAGG CCGGCCGCCT GGTCGCCGGC GAACTGCTGG CTCACGGGCA CCGCGACCGG ATCGTGCTGG TCGGCGAGCG CACGACCCAG GCGGTGGCCG GGCACGACCG GCTGACCGGG ATCCGCGAGG CGTTGGCGGC GGCCGGCGCC GAGCTGGCCG GCCTGGTCGA CTGCCTCTGG TGGCCCGACC ACGCCTACCG GGCGGTGACC GAGTACCTGG CCCGGGGCCA CCGGCCCTCG GCGTTCATCT GCCTCAACGA CCGCATCGCC TTCGGCGCCT ACCAGGCGAT CAGCGACTTC GGGTGGTCCG TGCCGCAGGA CGTCTCGGTG ATCTCGTTCG ACGACTCCGA CCTGGCCAGC TGGCTGCGGC CGGCCCTGAC CAGCGCGGCC ATCCCCTATT TCGAGATGGG GCGGCGCTCG GTGGAGCTGC TGATGGCCCC GCCGGCCGAC CCCGAGGTGC ACCGGGTCCC GATGGAACTG ACCCGACGGG TCTCGGTGGC CCCGCCGGCC GGGTGA
|
Protein sequence | MTLRDVAERA GVSRTTASFV LSGRRDMRIS ADAERRVRQA ARELNYRPSL LARSLRTNLS QTIGLLSDQI ASEMFAGEVV RAALATAVRH EHLLFVGETG GDPEVERNII QGMIDRGVGG FVYASMYTQV VRVPTILRKL PVVLLNCSTR GKRLTNVVPD EVEAGRLVAG ELLAHGHRDR IVLVGERTTQ AVAGHDRLTG IREALAAAGA ELAGLVDCLW WPDHAYRAVT EYLARGHRPS AFICLNDRIA FGAYQAISDF GWSVPQDVSV ISFDDSDLAS WLRPALTSAA IPYFEMGRRS VELLMAPPAD PEVHRVPMEL TRRVSVAPPA G
|
| |