Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tmz1t_0178 |
Symbol | |
ID | 7085275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Thauera sp. MZ1T |
Kingdom | Bacteria |
Replicon accession | NC_011662 |
Strand | + |
Start bp | 208891 |
End bp | 209838 |
Gene Length | 948 bp |
Protein Length | 315 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 643697220 |
Product | Bile acid:sodium symporter |
Protein accession | YP_002353869 |
Protein GI | 217968635 |
COG category | [R] General function prediction only |
COG ID | [COG0385] Predicted Na+-dependent transporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCATCG ACGTCCTGCT CCCCGCCGCC CTCGCCTTCA TCATGTTCTC GGTCGGGCTG GCGCTGCAGG GGGCGGATTT CCACCGCGTG TTCGCACGCC CGCGCGCGCT GCTGATCGGC CTCCTCGGCC AGCTCGTGCT GGTGCCGCTG GCCGCGCTCG CGGTGGTGCT GGCCTTCGAC CTGCCGGTGC TCCTCGGCAT GGGCCTGATG GTGCTGGCGG CCTGCCCGGG CGGGGCGAGC TCGGGCTTCC TGACCCACCT CGCGCGCGCC AACGCGGCGC TGTCCTTGAG CCTTACCGTG ATCAGCAGCC TGGCCGCGCT GCTGACCTTC CCGCTGCTGG TCAAGGCGGT GCTCGCCGCA TTCGGCGAGG GCGTCTTCGG TGCGGACGGC AGCGTGCTGG CCGCGCTGCC GGTCGGCCGC CTGATCGGCA GCGTGCTCGT GGTGACGACA GTGCCGATCG TCGCCGGGAT GCTGCTGCGG CGCCGGGCGC CCGCGCTCAC GGCGCGCGCC GAGCCGCTCG TCGCCCGCGT CGCCACGCTG TTCTTCGCCG CCATCGTGGT TGCCACCTTC GTGTCGCATC GGCACACCAT CCTCGCCAAC CTGCTGTCGG TCGGACCGGC GACCCTGGTG CTCAACTTCG TCGTCATGGG CCTGGGCTAC GGCCTGGTGA TGCTGGCGGG CGCCGAGCGT CGGGACGCGG TCGCGGTGGC GATGGAGTGC GGGCTGCAGA ACGCGGGGCT GGCGATCTTC GTCGCCATCG TGCTGTTGCG TCAGCCCGAG CTCGCCGTCG GCGCGGTGGT GTATGCGCTG ACGATGAACT TCGGCGCGCT CGGCTTGGTG TTCGTGGCGC GCCGGCGCGA GGGCGCGCTC GCGCACGACG CCAAACGAGT CCTGCGGCTG CCCCCGGGGC CTCCGACGGG CTCTCGCGCC ACGCCGCCGC GGCGTTGA
|
Protein sequence | MSIDVLLPAA LAFIMFSVGL ALQGADFHRV FARPRALLIG LLGQLVLVPL AALAVVLAFD LPVLLGMGLM VLAACPGGAS SGFLTHLARA NAALSLSLTV ISSLAALLTF PLLVKAVLAA FGEGVFGADG SVLAALPVGR LIGSVLVVTT VPIVAGMLLR RRAPALTARA EPLVARVATL FFAAIVVATF VSHRHTILAN LLSVGPATLV LNFVVMGLGY GLVMLAGAER RDAVAVAMEC GLQNAGLAIF VAIVLLRQPE LAVGAVVYAL TMNFGALGLV FVARRREGAL AHDAKRVLRL PPGPPTGSRA TPPRR
|
| |