Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3754 |
Symbol | |
ID | 9157934 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 3870765 |
End bp | 3871745 |
Gene Length | 981 bp |
Protein Length | 326 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | |
Product | extracellular solute-binding protein family 3 |
Protein accession | YP_003648671 |
Protein GI | 296141428 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.807307 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATGCGCC GAAGCCTGAG CGGGATCGCC CTCGCGCTCA GCGTCGCCCT CACCACCTCG GCCTGCGTGG TCTCCGCCCC CGAGGCACCG GGTGACGCGG TGACCTCCTT CCCCCAGCCG ATGCCCGGGT CCGGCACCGT CGTCCCGGCG GACACACCGC TGCCGACGGA GGTGTCGATG TGCCCGGGCA CCCTGTCGCC GTCGTCCCTG CCGCCGGCGG GCCAGATGGC CCCCGGCACC ACGATGGCGG AGATCTTCGA CCGCGGCCGG CTCGTGGTGG GTATCGACAT CGGCTCGAAC CTGCTGTCCT TCCGCGACCC GATCACCGGT GAGATCAAGG GTTTCGACGC CGATATCGCA CATCAGATCG CCGCCGCGAT CTTCGGCGAC CCGAACCGTG TGCACTTCCG CATCCTCTCC ACCGCCGAAC GGCAGCAGGC GCTGATCGAC GGCACCGTCG ATGTGGTGGT CAAGACGATG AGCGCCACCT GCGACCGGGC CAAGCAGGTG GACTTCTCGG CCACCTATTT CATCGCGCAA CAGCGCATCC TGGCGCCGCG CGGCGCACCG ATCAACACCG TCGGCGACCT GTCCGGCCGC CGGGTGTGCG AGGTTCGCGG CACCACCTCG CTCGACCGGG TGCGGCGGCT CGTACCCAGT GCCACCGTGA TCGCCGCCGA CTCCTGGTCG GATTGTCTGG TGATGCTCCA GCAGCAGCAG GTGGACGCCG TCTCGACCGA CGACGCGATC CTCGCGGGGC TCGAAGACCA GGATCCGAAC CTGACGGTGA CCGGGCCGTC GATGGGACTG GAGTACTACG CGGTCGGTAT CGGGAAGAAT CGGCCCGACC TGGTGCGTTT CATCAATGGG GTGCTGCTGC GCATGGCCTT CGACGGAACC TGGATGCGGC TGTACAACCA GTGGTTCGCG CCTACCCTCG GGCAGGTGTG GTCGGCACCC GCGGCGCAGT ACACGGGGTA G
|
Protein sequence | MMRRSLSGIA LALSVALTTS ACVVSAPEAP GDAVTSFPQP MPGSGTVVPA DTPLPTEVSM CPGTLSPSSL PPAGQMAPGT TMAEIFDRGR LVVGIDIGSN LLSFRDPITG EIKGFDADIA HQIAAAIFGD PNRVHFRILS TAERQQALID GTVDVVVKTM SATCDRAKQV DFSATYFIAQ QRILAPRGAP INTVGDLSGR RVCEVRGTTS LDRVRRLVPS ATVIAADSWS DCLVMLQQQQ VDAVSTDDAI LAGLEDQDPN LTVTGPSMGL EYYAVGIGKN RPDLVRFING VLLRMAFDGT WMRLYNQWFA PTLGQVWSAP AAQYTG
|
| |