Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3558 |
Symbol | |
ID | 9157737 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3666077 |
End bp | 3667087 |
Gene Length | 1011 bp |
Protein Length | 336 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | periplasmic binding protein/LacI transcriptional regulator |
Protein accession | YP_003648476 |
Protein GI | 296141233 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.617929 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCAGCC GCAAGAGCGT CCGCGTGACA GCGATGTGCT GCGCGCTCAC CGTCGGTCTC ACCTCGGTCG CCGCCTGCAA CCGGGACGGC GGCTCGTCGG GCGGTGACGG GGTGAAGGTC GGCCTGATCA CCAAGACCGA CACCAATCCG TTCTTCGTGA AGATCCGTGA GGCCGCGAAG GCGCAGGCCG ATGCGAAGGG GGCGAATCTG GTCGCCCTCG CGGGCAAGTT CGACGGGGAT AACGAGGGCC AGGTCGCGGC GATCGAGAAC CTGGTGAGCC AGGGCGTCAA GGGCATCCTG ATCACCCCCA ATTCGTCGAC CGGCATCCTC GCCGCGATCA AGAAGGCGCG CGATGCCGGC GTCGTGGTCA TCGCGCTCGA CACCGCCACC GAACCCGCCG ATGCCGTGGA CGCGACCTTC GCCACCGACA ACACCGAGGC CGGCCGCCTG CAGGGCGCGT GGGTCAAGGG GGCGCTCGCG GGGGCCGCAC CGAAACTCCT CATGCTCGAC GGCACCGCCG GCAGCACCGT CTCCGACTTC CGGCACAAGG GCTTCCTCCA GGGCATCGGG CTCACCGAGA ACGCCCCCGA GATCGCGGGC ACGGAGAATA CCAACGGCGA TCAGACCAAG GCGCAGACCG CGATGGAGAA CCTGCTGCAG CGCGTCCCGG ATGCCAACGC GCTGTACACG ATCAACGAGC CGGCCGCGGC CGGCGGCTAC CAGGCCGCGA AGAAGGCGGG CAAGGCGGCG CAGCTGACCA TCGGCTCGAT CGACGGCAGC TGCGCCGGTG TGGACAACGT CAAGAAGGGC ATGATCGGCG CTACCGTTCT GCAGTTCCCG GCGAAGATGG CGCAGCAGGG CGTCGACGCG GTGGTGGCCT TCGCCAAGGA TGGCACCAAG CCGAGCGGTT TCGTCAACAC CGGCTCGCAG CTGGTGACCG ACAAGCCCGT GGCCGGAGTC GATTCCAAGG ACACCGCGTG GGGTGCGCAG AACTGCTGGG GCCAGGCGTG A
|
Protein sequence | MPSRKSVRVT AMCCALTVGL TSVAACNRDG GSSGGDGVKV GLITKTDTNP FFVKIREAAK AQADAKGANL VALAGKFDGD NEGQVAAIEN LVSQGVKGIL ITPNSSTGIL AAIKKARDAG VVVIALDTAT EPADAVDATF ATDNTEAGRL QGAWVKGALA GAAPKLLMLD GTAGSTVSDF RHKGFLQGIG LTENAPEIAG TENTNGDQTK AQTAMENLLQ RVPDANALYT INEPAAAGGY QAAKKAGKAA QLTIGSIDGS CAGVDNVKKG MIGATVLQFP AKMAQQGVDA VVAFAKDGTK PSGFVNTGSQ LVTDKPVAGV DSKDTAWGAQ NCWGQA
|
| |