Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_3527 |
Symbol | |
ID | 9157706 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 3639335 |
End bp | 3640645 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003648445 |
Protein GI | 296141202 |
COG category | |
COG ID | |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGCCGACC TCAGCAGACG CGGATTCCTC GCGACCAGCC TCGCCGCCAC GGCGGCCTTC GCCGCCGCCT GTGCGGGTTC CGGCGGGTCG GGCGGTGGCG GCAACTCGGG TGGCAGCGGC GGCGGTATCA AGCTGCAGTT CCTCACCAAC CATCCGGGCA GCTCGAAGGC GATCGAGCAG AGCATCATTG ACGAGTTCCA GAAGGCCAAC TCCGGCATCA CCGTCGAGCT GCTCGACGGC GGCAAGGACT ACGAGGAGGT GGCGCAGAAG TTCCAGACCT CGCTGACCGG CGGCACGAAG CCCGACATCA TCGTGGTCTC CGACGTCACC TGGTTCAACT TCGCGCTGAA CAAGCAGATC GAGCCGCTCG ACGGCCTGTT CGCCGGTGCC GGCCTCAACC CTGCCGACTA CGTTGACTCC CTGCTGGCCG ACGGCAAGTT CGACGGCAAG TACTACACCA TCCCGTTCGC CCGCTCGACG CCGCTGTTCT ACTACAACAA GGACGTGTGG AAGAAGGCCG GCCTCGAGGA CCGCGGCCCG AAGGACTGGG ACGAGTTCGT GGCCTGGGCG CCGCGCATCC AGGAGGCCAT CGGCGGCGAT AAGAAGGCCA TCGTGCTGGC CGACTCGGCG AACTACATCG ACTGGGTCTT CGAGGGCTGG AACTGGTCCA AGGGCGGTGC CTACTCGGAC GGCTGGGACC TGAAGTTCAC CACGCCCGAA TCCGTTGCTG CCGCGCAGCA GCTCAAGGAC GTGATCGGCA AGTGGGGCCG GCTGACCAGC AAGCCGGAGA ACGACTTCGG TGCCGGCCTC GCCGGCGTCA CCCTGCAGTC GACGGGCTCC CTGAAGACGA TCACCACCAC CGCGAAGTTC GAGGTGGGCA CCGCCTTCCT GCCCGGCCCG CAGGGCAAGT CCTGCCCGAC GGGCGGCGCC GGCGTGGCGA TCGCCGCGGG CATCTCCGAC GACCGCAAGG CCGCCGCGAT GAAGTTCATC GAGTTCCTCA CCAATGCCAA GAACTCGTCC ACCTTCTCGC AGGGCACCGG CTACATGCCG GTGCGCAAGT CCGCGGTCGA CGATCCGTCG ATGAAGGAGT TCATCGCGAA GAACCCGAAT TTCGGGACCG CGGTCAAGCA GCTCCCGTTC ACCCGCAGCC AGGACAACGC CCGGGTGTTC GTGCCGGGCG GTGGTCGCGA CATCGGGCAG GCGTTGCAGC AGATCGCGAC CGGAGGTGAC CCGGCGGCGG TGCTCGGCGC GCTGCAGAGC ACGATCCAGG GCAAGATCGA CTCGCAGATC ACGCCGAAGC TGCCGAAGTA G
|
Protein sequence | MADLSRRGFL ATSLAATAAF AAACAGSGGS GGGGNSGGSG GGIKLQFLTN HPGSSKAIEQ SIIDEFQKAN SGITVELLDG GKDYEEVAQK FQTSLTGGTK PDIIVVSDVT WFNFALNKQI EPLDGLFAGA GLNPADYVDS LLADGKFDGK YYTIPFARST PLFYYNKDVW KKAGLEDRGP KDWDEFVAWA PRIQEAIGGD KKAIVLADSA NYIDWVFEGW NWSKGGAYSD GWDLKFTTPE SVAAAQQLKD VIGKWGRLTS KPENDFGAGL AGVTLQSTGS LKTITTTAKF EVGTAFLPGP QGKSCPTGGA GVAIAAGISD DRKAAAMKFI EFLTNAKNSS TFSQGTGYMP VRKSAVDDPS MKEFIAKNPN FGTAVKQLPF TRSQDNARVF VPGGGRDIGQ ALQQIATGGD PAAVLGALQS TIQGKIDSQI TPKLPK
|
| |