Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3975 |
Symbol | |
ID | 8449594 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 4388138 |
End bp | 4389127 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 645043020 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003203256 |
Protein GI | 258654100 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.00653549 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.0330479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACGA TGCGGGAGGT TGCCGCGCGC GCGGGAGTCA GCGTCAAGAC GGTTTCCCGG GTGTTCAATG ACGATCCGCA TGTGCTGCCG GACACGCGGG AACGGGTGCA GGCCGCGCTG CGGGAGCTGA GCTACGTACC CAACGACCTG CCGTCCACGT TGCGGGCCGG CCGCGCGCCG GTGATCGGCA TTGCCGTGCC GGACATCGTC GATCCGTTCT TCGCGTCGAT CACCCACTCG ATCCAGTCGC TGGCCTTCGA GCGGGGCATG TTCACGTTGG TGACCAGCCT GGGGGTGGAC CCGGAGCTGG AGAAGCCGAT GGTCGAGGGG CTGCTCAAGC GCGGGTTGAG CGGGCTGGTG CTGGCGCCGA TCACGGCCGA CCAGTCGTAC CTGTCGGTGT GGTCCACCCG CCTGCCGGTG GTGTTCGTGG ACCGGCAACC GGCCAAGTTG GCCGCGGATT CCTTCACCGA GGACGACGAG GGCGGCGCGG ATGCGGCGAC CACCCATCTG ATCGAGCACG GGCACCGGCG GATTGCGTTC CTGGGGGAGC AGCCGATCCT GCGCACCGCG GCCGGGCGGC TGGCCGGGTA CCGGGCCGCG CTGGAGCGGG CCGGTCTGCC CGAGAGCCCG GAGCTGGTGA TCCTGGGGGT GGGCGACCGT CCGGGGGCTC GCGATGCCCT GGCGGCGTTG CGCGACCTGC CGGAGCCGCC GACCGCCGTC TTCTCCTCCA ACGCCCGCTG CACGATGGCG CTGTTGCCGG CGTTGCGGCC GCACGAGTTC GCCCTGGTCG GGTTCGGCGA CTTCCCGATG GCCGACGTGG TGACGCCCGC GGTCTCGGTG ATCGACCAGG ACCCCTACCG GCTCGGTCGA CTGGCCGCCG AACGAATCTT CGACCGGCTC GACGCCCCCG ACCGGCGCTA CCGGCGGCGC ACCGTCGTCC CGGTGCGCCT GATCGAACGG GCGTCCTGCT CGGCGGCCCC GGCCGGCTGA
|
Protein sequence | MSTMREVAAR AGVSVKTVSR VFNDDPHVLP DTRERVQAAL RELSYVPNDL PSTLRAGRAP VIGIAVPDIV DPFFASITHS IQSLAFERGM FTLVTSLGVD PELEKPMVEG LLKRGLSGLV LAPITADQSY LSVWSTRLPV VFVDRQPAKL AADSFTEDDE GGADAATTHL IEHGHRRIAF LGEQPILRTA AGRLAGYRAA LERAGLPESP ELVILGVGDR PGARDALAAL RDLPEPPTAV FSSNARCTMA LLPALRPHEF ALVGFGDFPM ADVVTPAVSV IDQDPYRLGR LAAERIFDRL DAPDRRYRRR TVVPVRLIER ASCSAAPAG
|
| |