Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Namu_3846 |
Symbol | |
ID | 8449465 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nakamurella multipartita DSM 44233 |
Kingdom | Bacteria |
Replicon accession | NC_013235 |
Strand | + |
Start bp | 4217138 |
End bp | 4218184 |
Gene Length | 1047 bp |
Protein Length | 348 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 645042895 |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003203131 |
Protein GI | 258653975 |
COG category | [K] Transcription |
COG ID | [COG1609] Transcriptional regulators |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.00500845 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 0.20128 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGCAAGC CCACTCTCCG CGACGTCGCG GACCGTTCCG GCTTCTCCAT CACCACCGTC TCCCAGGTTC TCAACGATGT GCCGGGCAAG CGCATCCCGG ACGCCACCCG CGACCGGGTC CGGGCCGCCG CGACCGAGCT GGCCTACCGG CCGAACCGGT TGGCCCAGGG CCTGCGGCTG CAGCGGTCCA ACACCCTGGG GTTCGTCAGC GACAAGATCG CGACCACGCC CTACGCCGGT GAGGTGATCC TGGGCGCGCA GGACGCGGCC GCCGAGCACG GCGACCTGTT GTTGCTGATG AACTCCAACA GCGACCCGGG TCTGGAGGAA CGCGAGATCC GCGCCCTGCA GGAGCGCCAG GTGGACGGCA TCATCTTCGC TTCGGAATAC CACCGGGTGA TCACGCCGCC GGACGCGTTG CAGGGCACCC CGGCGGTGCT GCTGGACGCC CGCTCGGTCC GTGGCGATGT CAGCTCGGTC GTCCCCGACG AGGTCGGCGG CACGTTGGCC GCGGTGCGCG AACTGATCGC CGCCGGCCAC CAGCGCATCG CCTTTCTCAA CAACGTCGAC GACATTCCCG CCACAGCCCT GCGTTTGCAG GGATTCCGGC AGGGTCTGAA CGAGGCCGGC CGGCGCCTGC GCGCGGGCAT GGTGGTGACC GCCGCCTCGA CCCCGGGCGC CGGTTATGAC GCGGCCCGGC AGTTGCTCGA CCAACCCCGG GCCGGCCGAC CGACCGCGAT CTTCTGCTTC AACGACCGGA TGGCGATGGG CACCTACCAG GCGGCCGCCG AACTGGGCCT GCGCATCCCG GACGACCTGT CGGTCGTCGG CTTCGACAAC CAGGAACTGA TCGCGGCGAA CCTGCGCCCC GGTCTGACCA CGGTGGCCCT GCCCCATTAC GCGATGGGCC AGTGGGCGGT CGCCGCCCTG CTGGACCTGA TCGACGCCCA GGCCGACCCA TCCCAGAAGC GACAGCCGAT CCGCGAGGAG AAGCTGCCCT GCCCACTGGT GCGCCGTGCA TCGGTGGGCC CGCCGCCCCG GTCGTGA
|
Protein sequence | MGKPTLRDVA DRSGFSITTV SQVLNDVPGK RIPDATRDRV RAAATELAYR PNRLAQGLRL QRSNTLGFVS DKIATTPYAG EVILGAQDAA AEHGDLLLLM NSNSDPGLEE REIRALQERQ VDGIIFASEY HRVITPPDAL QGTPAVLLDA RSVRGDVSSV VPDEVGGTLA AVRELIAAGH QRIAFLNNVD DIPATALRLQ GFRQGLNEAG RRLRAGMVVT AASTPGAGYD AARQLLDQPR AGRPTAIFCF NDRMAMGTYQ AAAELGLRIP DDLSVVGFDN QELIAANLRP GLTTVALPHY AMGQWAVAAL LDLIDAQADP SQKRQPIREE KLPCPLVRRA SVGPPPRS
|
| |