Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_2031 |
Symbol | |
ID | 8447640 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | - |
Start bp | 2242101 |
End bp | 2243102 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 645041157 |
Product | transcriptional regulator, LysR family |
Protein accession | YP_003201403 |
Protein GI | 258652247 |
COG category | [K] Transcription |
COG ID | [COG0583] Transcriptional regulator |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.0205829 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.00632018 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGCGTAAGA TGCACTGCAT GATTGATGTC CGCAAGCTGG AGATCCTGCG GGAGCTCGAT CGGTGCGGCA CCATCGCCGC CACCGCCGCC GCCGTCCACC TCACCCCGTC CGCGGTGTCC CAACAGCTGG CCGCGTTGTC CAAGGAGGCC GGCACGGCCA TGCTGGAACC GGACGGGCGC CGGGTCCGGC TTACCGAGGC CGCCCAACTG CTGCTGCAGC ACGCGCACCA GATCTTCACC CACCTGGAAC ACGCCGAGTC GGACCTGGCC GCCTTCCGGC GGGGGGACGC CGGCACGGTC CGGGTGGGCA CGTTCAGCTC CGCGGTGAAA GCCCTGGCGG TGCCGCTGGT GTCCGACCTG TCCACCCGGA CCCGAATCCG GGTGGAACTG CGCGAGGTGC AGCCGGAGGA TGCGCTGGAC GCCCTGCTGG GGCGGCGGGT GGACATCTGC ATGAACCTGG CCACCACCGA ATTGTTGCCC GGCTCGGACG ACAAACGGGT CCACTCCGAG CACCTGCTCG ACGACGTGAT GGACGTCGCC CTGCCGTTCG ATCACCCGCT GGCCGATCGC GCCGAGATCG AGCTGGCCGA TTTGGCCGAC GAGGACTGGA TCCTGGCCAA CCCCGGGGTG CCGTGCTGGC AGCTGAGCCG GGACGCCTGC GAACGGGCCG GGTTCTCCCC GCGCGCCCGC CACTACGCCG ACGAATTCGT CGGCGTGGTC GGGCTGGTCG CGGCCGGCCA CGGGGTCAGC CTGCTCCCCC GGCTCGCCCA ACCCGAGGCC GTGCACGAAC CGATCGTGCT GCGACCCGTC GCCGGGGTCA GCCCGGTTCG CCGGATCAGC GTGCAGACCC GGGCCGGCAC CGCCGACCAG CCGCACATCG CGCCCGCCCT GGAGTCCCTG CGCCGGGTCG CCGCCGGCGT GGCCCGGGGC CCGCTGGCCT GTCGCAGCGT GACCCGGGGC CCGGCCCCGA TCGCCGATCC GGCCCCGGCG TTGGTGAGCT GA
|
Protein sequence | MRKMHCMIDV RKLEILRELD RCGTIAATAA AVHLTPSAVS QQLAALSKEA GTAMLEPDGR RVRLTEAAQL LLQHAHQIFT HLEHAESDLA AFRRGDAGTV RVGTFSSAVK ALAVPLVSDL STRTRIRVEL REVQPEDALD ALLGRRVDIC MNLATTELLP GSDDKRVHSE HLLDDVMDVA LPFDHPLADR AEIELADLAD EDWILANPGV PCWQLSRDAC ERAGFSPRAR HYADEFVGVV GLVAAGHGVS LLPRLAQPEA VHEPIVLRPV AGVSPVRRIS VQTRAGTADQ PHIAPALESL RRVAAGVARG PLACRSVTRG PAPIADPAPA LVS
|
| |