Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_2070 |
Symbol | |
ID | 9156225 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 2158134 |
End bp | 2159171 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003647021 |
Protein GI | 296139778 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.620132 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGTGGTT TTCGACGGTG GGCGGCGCTC GGAGGAGCAG CGGTCCTCGC GGTGGGTTCG CTCACCGCGT GCGGTAGCGG ATCGACTCCG GACGCGGGTA CCTCCAGCGC TTCGTCGGCG GGCCCCACGA TCACGCTGTA CTCGGGCCGG TCGAAGGAAC TCGTAGGTCC GCTGATCGAT CAGATCAAGG GTGCCGTGTC CGTGAACGTC GCCTACGACA AGAAGGCCAA CCAGATTCAG GAGGAAGGCA GCCGCAGCCC CGCGGACGTC TACTTCGGTC AGGACGCGGG CGAACTCGGT GCGCTCGCGA AGGCGGGGCT CCTCGAACCG TTGCCCAGTG ACATCGTGGA CAAGATCCCG GCGCAGTACC GCTCCGGCGA CGGCACCTGG GCGGCGACCT CCGCGCGATC GCGCGTGCTG GTGTACAACC CCGCACAGCT GCCCGAAGCC GACCTGCCGA CCGGGATCGA CGGGCTCCTC GACCCGAAGT TCAAGGGCAA GGTCGGCTAT GCGCCCACCA ATGCGTCGTT CAAATCGTTC GTGACCGCGC TGCGGGTGCT CCGCGGCGAA GACGGCGCGA AGGCGTGGCT CCAGGGATTC CTGGCGAATG AGCCGAAGAA GTTCGAGGGC AACGGCAAGA TCCTCGACGC GGTGAACTCG GGCACCGTGG CGTCCGGCCT GATCAACCAC TACTACTGGG CGCAGGCCGT GCAGGAGAAG GGCGAGGCGG GCGTGCCGTC GAAGCTCACG TTCTTCGATA ACGATCCGGG CGGTCTCGTG AACCTCGCCG GCGCCGCGGT GCTCAAGAGC TCCACGAAGA AGCCGCAGGC CATCGAGTTC GTGCGCCAGC TGTTGACGAA CTCCTCGCAG GAGTACTTCG CCACCAAGAC GGGGGAGTAC CCGGTGGTCT CGGGTGTCCC GTTCTCGGTG CCCGGCCTGC CGCCACTGTC GGCCCTCACG CCGCCGAAGG TCGATTACGC GCAGCTCGCC GACACCAAGG GCACCGAGCG ACTGTTTACC GAGGTGGGTG CGCTCTAA
|
Protein sequence | MGGFRRWAAL GGAAVLAVGS LTACGSGSTP DAGTSSASSA GPTITLYSGR SKELVGPLID QIKGAVSVNV AYDKKANQIQ EEGSRSPADV YFGQDAGELG ALAKAGLLEP LPSDIVDKIP AQYRSGDGTW AATSARSRVL VYNPAQLPEA DLPTGIDGLL DPKFKGKVGY APTNASFKSF VTALRVLRGE DGAKAWLQGF LANEPKKFEG NGKILDAVNS GTVASGLINH YYWAQAVQEK GEAGVPSKLT FFDNDPGGLV NLAGAAVLKS STKKPQAIEF VRQLLTNSSQ EYFATKTGEY PVVSGVPFSV PGLPPLSALT PPKVDYAQLA DTKGTERLFT EVGAL
|
| |