Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tpau_4216 |
Symbol | |
ID | 9158404 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Tsukamurella paurometabola DSM 20162 |
Kingdom | Bacteria |
Replicon accession | NC_014158 |
Strand | + |
Start bp | 4348999 |
End bp | 4349988 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | |
Product | Bile acid:sodium symporter |
Protein accession | YP_003649123 |
Protein GI | 296141880 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGTGCGC TGCTGAACAA GCTGTCGATC GACGGCTTCA TCCTCGCGAT CTTCGCCGCC GTCGCGGTGG CCGCCGTCTT CCCCGCGCAG GGCGAGGCCG TACCGGTGGT CGACGGTGCG GTCACCGTTG CCATCGCCGT GCTGTTCTTC CTCTACGGTG CGCGGATCCA CCCGCAGGAG GCGCTCGCCG GGCTCAAGCA CTGGCGGCTG CACGCCGTGA TCCTCGGCTT CACCTTCGTG GTCTTCCCGA TCCTCGGGGT GCTGCTGAAG TTCCTGCCAC CCGCCCTGCT CACCCCCAGC CTGTACGCGG GCGTGCTGTT CCTGACGCTG CTGCCTTCGA CGGTGCAGTC GTCGATCGCC TTCACCTCGA TCGCGGGCGG GAACGTACCC GGCGCCATCG TGAGCGCCTC GCTGTCGAAC CTGCTGGGCA TCTTCATCAC ACCGTTGTTG GTGCTGGGGC TCATGGCCAC CACCGGTGAG GTGCAGTTCC GCAGCAGTTC GATCATCGAC CTGTGCCTGC AACTTCTGCT GCCGTTCATC CTCGGTCAGC TCTCGCGACG CTGGGTGGCG GATTTCGTCA AGGATCACGC GGCGGCCCTG AAGTACGTGG ACCGCGGCTC GATCGTGCTG GTGGTGTACG CGGCGTTCTC CGCCGGTATG CGCGAGCACA TCTGGAGCCA GGTGTCGTGG GTCGGCGTCC TGCAGCTCAT CGTGCTGTCG GTATTACTGG TGCTGCTTCT GCTGTGGCTC ACCCGCTTCA CCGCGGAGAA GCTGGGCTTC GATCGCGGCG ACATGATCGC GATCCAGTTC TGCGGCACCA AGAAGTCGAT GGCCACCGGT CTCCCGATGG CCGCCGTCCT GTTCGCCGGA CAGCCCGTGG GCCTGATCGT GCTACCGCTG ATGATCTTCC ATCAGATCCA GCTGATGATG TGCGCCTGGC TCGCGGCACG GTACGGCCGC AAGCTGACCC CGGATCCCGC CGACGCCTGA
|
Protein sequence | MRALLNKLSI DGFILAIFAA VAVAAVFPAQ GEAVPVVDGA VTVAIAVLFF LYGARIHPQE ALAGLKHWRL HAVILGFTFV VFPILGVLLK FLPPALLTPS LYAGVLFLTL LPSTVQSSIA FTSIAGGNVP GAIVSASLSN LLGIFITPLL VLGLMATTGE VQFRSSSIID LCLQLLLPFI LGQLSRRWVA DFVKDHAAAL KYVDRGSIVL VVYAAFSAGM REHIWSQVSW VGVLQLIVLS VLLVLLLLWL TRFTAEKLGF DRGDMIAIQF CGTKKSMATG LPMAAVLFAG QPVGLIVLPL MIFHQIQLMM CAWLAARYGR KLTPDPADA
|
| |