Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3559 |
Symbol | |
ID | 9157738 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3667223 |
End bp | 3668242 |
Gene Length | 1020 bp |
Protein Length | 339 aa |
Translation table | 11 |
GC content | 70% |
IMG OID | |
Product | transcriptional regulator, LacI family |
Protein accession | YP_003648477 |
Protein GI | 296141234 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.156963 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGACCA TGGCCGATGT CGCCGCCGCC GCCGGCGTCT CGATCAGCAC GGTCTCGCAC GTGCTCAACG AAACCCGGCG CGTGGATCCC CGCACCCGCG ACGCGGTGCT CGCCGCGATC GAATCCACGG GCTATCGCCG CAACGCCCTC GCGACCGCGC TCGCCACGTC CCGCTCGGGC GTTCTGGCAC TGAGCATCTC GGCGGGCCGC AACCCGTATT TCGGACCGCT GATGCGCGCC ATCGAATCGC GGGCCAGCGA ACTCGGATAC ACACTCATGA TGGGCGATTC GCACGACGAT CCGGAGATCG AGACCCGGCT CGTCGGATCG CTGCTCGACC GCCGCGTGGA CGGGATGATC CTGGCCCCGG CGCCCCATTC GGAGGCCGGG ACGATCCCGA CGGTGCGCCG AGCGGGCACG CCGCTGGTAC TCATCGATCG GCTCTCCCCG GCCGATGTCG ACCAGGTCGC CTCCGAGGGC GCGGAGCCCG TGGCCCGGCT CACCGCACAT CTCGCGGAGC TCGGACACCG GCGCATCGGG GTGCTCACCG GACACCCCGG TATCCAGTCC ACGATCGAGC GGATCCAGGG CTTCACCGGC GCGATGACCG CGGCCGGACT GCGCGCCGCA CCGCGCCACA TCCGCTGCGG CGACTCCCGC GCCGACGAGG CCCGCAGGCA GACGCTCGCG ATGTTCCGGG CGCGGGCACC GCGACCGACC GCGCTGGTGG TGCTCAACAA CGAGATGACC GTGGGCACGA TGCGGGCGCT GCGTGAGCTG CAACTGCGCG TGCCCGATGA CGTGGCACTG GTGGCCTACG ACGACTTCGA ATGGTCCGAC CTGTTCTCAC CCGGACTCAC CGCGGCGGCA CAGAACGTGG ATGCGATCGG GCGCAGGGCC GTGGACCTGC TGGTCGAACG GATCGGCGGC TTCGACGGAC CGCGGCGCGT GGAGCGTGTT CCCACGGCGT TCCATCACCG TGACTCATGC GGATGTGAAC GCCTCGGCGG TGGGGTTTAG
|
Protein sequence | MGTMADVAAA AGVSISTVSH VLNETRRVDP RTRDAVLAAI ESTGYRRNAL ATALATSRSG VLALSISAGR NPYFGPLMRA IESRASELGY TLMMGDSHDD PEIETRLVGS LLDRRVDGMI LAPAPHSEAG TIPTVRRAGT PLVLIDRLSP ADVDQVASEG AEPVARLTAH LAELGHRRIG VLTGHPGIQS TIERIQGFTG AMTAAGLRAA PRHIRCGDSR ADEARRQTLA MFRARAPRPT ALVVLNNEMT VGTMRALREL QLRVPDDVAL VAYDDFEWSD LFSPGLTAAA QNVDAIGRRA VDLLVERIGG FDGPRRVERV PTAFHHRDSC GCERLGGGV
|
| |