Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2099 |
Symbol | |
ID | 9156254 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | - |
Start bp | 2187472 |
End bp | 2188503 |
Gene Length | 1032 bp |
Protein Length | 343 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003647050 |
Protein GI | 296139807 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.603993 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCATCA ACCGTTCTCT CACCGCGGCG GTGGCCGTCG CCACCGTGCT TGCGGCCACC GCCTGCGGGG GCACCGGCGG CGGTTCCGGA TCCGGTGATT CCGTCACCGT GTACAGCGCC GACGGTCTCG GCGACTGGTA CAAGGCCGAA TTCGCCGACT TCACCAAGCG AACCGGCATC ACGGTCGCTC TGGTCGAAGC CGGTTCGGGA GAGGTGATCT CGCGCGCGCA GAAGGAGAAG TCGAATCCGC AGGTCGATGT GCTCGTCACT CTCCCGCCGT TCATCCAGCA GGCGCACCAG CAGGGCTCAC TCGCGCCCTC CGGCGTCGAT ACGGGTTCCG TTCCGGCCGA ACTGAAGTCG AATGACGGTA CCTATGTGGC CGTGGTCAAC AACTACTTCG CGATGATCCG TGGCACCGCT GCGCTTCCCA AGCCTGCCGA CTGGGCGGAG TTGACCGGCC CGGCGTACAA GCAGAAGATC CAGTACTCCA CCCCGGGAAA GGCGGGCGAT GGCACCGCGC TCCTCCTTCT GCTCCAGCAG ATCATGGGCA AGGAGGGCGC GCTCGCGTAC CTCGCCGAGC TGCAGACGAA CAACGTGGGA CCGTCGACAT CCACGGGCAA GCTCGGACCG AAGGTCTCGA AGGGCGAACT GAGCGTGGCC AACTCGGATG TGCAGATGGC GCTCGCCGCC ATCGCCGCCG ACAAGGTGGC CTACGAAGTG TTCTACCCGG CCTTCGGGGG CAAACGCAGC ACGCTCCCTC TGCCGTACTT CATGGGGCTG GCGGCTCAGG CACCGCATGC CGCGAACGGG ACCAAGCTCA TGGAGCACCT GCTGTCGAAG CAGACCCAGC AGAACATCCC CGCCCGCGCC TGGGGTGCCC CGGTGCGCAC CGATGTCACC CCGTCGGGCC AGCAGTGGGA CGCTTTCCGC GCCGCTATCG ACGGCGTCGA GATCTGGTAC CCGAACTGGG ATCAGGTGCT CAGCGGTCTC GACGCCGATA TCAAGGCCTA CGAGAAGGCG ACCGGCCAGT GA
|
Protein sequence | MRINRSLTAA VAVATVLAAT ACGGTGGGSG SGDSVTVYSA DGLGDWYKAE FADFTKRTGI TVALVEAGSG EVISRAQKEK SNPQVDVLVT LPPFIQQAHQ QGSLAPSGVD TGSVPAELKS NDGTYVAVVN NYFAMIRGTA ALPKPADWAE LTGPAYKQKI QYSTPGKAGD GTALLLLLQQ IMGKEGALAY LAELQTNNVG PSTSTGKLGP KVSKGELSVA NSDVQMALAA IAADKVAYEV FYPAFGGKRS TLPLPYFMGL AAQAPHAANG TKLMEHLLSK QTQQNIPARA WGAPVRTDVT PSGQQWDAFR AAIDGVEIWY PNWDQVLSGL DADIKAYEKA TGQ
|
| |