Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_1269 |
Symbol | |
ID | 9155413 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 1307906 |
End bp | 1309144 |
Gene Length | 1239 bp |
Protein Length | 412 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | |
Product | extracellular solute-binding protein family 1 |
Protein accession | YP_003646239 |
Protein GI | 296138996 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGAC GCGCCGCGCT TGCGGGCTTC GGCGCGTTCG CGGCGTCCGG AGTGTTGGCG GCGTGCGGGT CGAACACCGG ACGCGGTTCG TCGGGTGACC TGACGGTCTG GTTCCACGAG TACGGCGAGA AGGGAACCGG GACCGCGCTC CGGCGGTATG CGTCGGGGTT CCCGGGCGGC GGTGTCTCCA CGCTCACCTT CCCCGGTGAC TACGATCAGA AGGTCGCATC GGCGCTGATC GGAGACAGTG GGCCGGACGT CTTCGAGTTC GCGAACGGCC CCACGATCGA CATGATCACC GCGGGTCAGG TCGCCGATCT CACCGATCTG CTCGGCGATG CCCGTTCCGA TTTCAATCCA GCTCTCCTGG ACCGGATGAC CTATCGCGGG AAGGTCTACG GTATCCCGCA GGTGATGGAT GTCCAGGTGC TGGTGTACCG CAAGTCGATG TTGCAGAAGG CGGGCGTGCG TCCGCCGCGC ACCGTCGATG AGCTGGTGTC GGCGGCCGAG GAGCTGAATC AGGGCACCAC CAAGGGGCTC TTCCTCGGTA ACAACGGTGG CGCCGATGTC CTCGGCGGGG TCCTGTTGTG GTCGTCGGGG CACGACTACC TGACCCCGGA CCGCAAGCCC GGTTTCGTCA ATGAGGATGT GGCGTCCGGC CTGCGCACGA TGCGCAAGCT GTTCACCTCG GGAAACCTGT TGCTCGGCGC GCCGCAGGAC TGGTCGGATC CGGGAGCCTT CATCGCGGGC CTGACCGCCA TGCAATGGAC CGGACTCTGG GCGTTGCCGG CGATCACGGA GGCCCTGGGC GACGATGTCG GCGTGCGGGC GTTCCCCGCG ATCGGCACCA ACGGTGGCGA CAGTGTGCCG TTCGGCGCGT ACGGCTCCGC GGTGAGCACG AAGGGGGATC TCGCGAAGGC GAAGAAGTTC GTGCAGTGGC TGTGGGTGGA ACGCTCCGAC CTGCAATTGG ACTGGGCGCA GGGCTACGGG TTGCATGTGC CCGCCCGGCG TTCGCTGGTT CCACAGGCGA ACAAACTCTC GACCGGTGTC TACAACGAGG CGTCCAAGCT CGTCTACTCG GTGGGCCGCC CGCAGACCCC GCTGCTGTGG ACCCCGCGGA GCTCGACGGC CTACCGGGAC GCGTTGGACC GAGTGATCCG CGGCGGTGCC GACCCGATGA CGGAGTTGCG CGGCGTGGCG GACGTGGTGG AACGAGAACT GGACCGGTTC CCCGCCTGA
|
Protein sequence | MSRRAALAGF GAFAASGVLA ACGSNTGRGS SGDLTVWFHE YGEKGTGTAL RRYASGFPGG GVSTLTFPGD YDQKVASALI GDSGPDVFEF ANGPTIDMIT AGQVADLTDL LGDARSDFNP ALLDRMTYRG KVYGIPQVMD VQVLVYRKSM LQKAGVRPPR TVDELVSAAE ELNQGTTKGL FLGNNGGADV LGGVLLWSSG HDYLTPDRKP GFVNEDVASG LRTMRKLFTS GNLLLGAPQD WSDPGAFIAG LTAMQWTGLW ALPAITEALG DDVGVRAFPA IGTNGGDSVP FGAYGSAVST KGDLAKAKKF VQWLWVERSD LQLDWAQGYG LHVPARRSLV PQANKLSTGV YNEASKLVYS VGRPQTPLLW TPRSSTAYRD ALDRVIRGGA DPMTELRGVA DVVERELDRF PA
|
| |