Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0394 |
Symbol | tauA |
ID | 6145011 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 407608 |
End bp | 408570 |
Gene Length | 963 bp |
Protein Length | 320 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 641615290 |
Product | taurine transporter substrate binding subunit |
Protein accession | YP_001742497 |
Protein GI | 170684058 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG4521] ABC-type taurine transport system, periplasmic component |
TIGRFAM ID | [TIGR01729] taurine ABC transporter, periplasmic binding protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.560594 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAATTT CATCGCGTAA CACACTTCTT GCCGCACTGG CATTCATCGC TTTTCAGGCG CAGGCGGTGA ACGTCACCGT GGCGTATCAA ACCTCCGCCG AACCGGCGAA AGTGGCTCAG GCCGACAACA CCTTTGCTAA AGAAAGCGGA GCAACCGTGG ACTGGCGTAA GTTTGACAGC GGAGCCAGCA TCGTGCGGGC GCTGGCTTCA GGCGACGTGC AAATCGGCAA CCTCGGTTCC AGCCCGTTAG CGGTTGCAAC CAGCCAACAG GTGCCGATTG AAGTCTTCTT GCTGGCGTCA AAACTGGGTA ACTCCGAAGC GCTGGTGGTA AAGAAAACTA TCAGCAAACC GGAAGATCTG ATTGGCAAGC GCATCGCCGT ACCGTTTATC TCCACCACCC ACTACAGCCT GCTGGCGGCG CTGAAACACT GGGGTATTAA ACCCGGGCAA GTGGAGATTG TGAACCTGCA GCCGCCCGCG ATTATCGCTG CATGGCAGCG GGGAGACATT GATGGTGCTT ATGTCTGGGC ACCGGCGGTT AACGCCCTGG AAAAAGACGG CAAGGTGCTG ACCGATTCTG AACAGGTCGG GCAGTGGGGT GCGCCGACGC TGGATGTCTG GGTAGTGCGC AAAGATTTTG CCGAGAAACA TCCTGAGGTC GTGAAAGCGT TCGCTAAAAG CGCCATCGAT GCTCAGCAAC CGTACATTGC TAACCCAGAC GCGTGGCTGA AACAGCCGGA AAACATCAGC AAACTGGCGC GGTTAAGCGG CGTGCCTGAA GGTGACGTTC CGGGGCTGGT GAAGGGGAAT ACCTATCTGA CGCCGCAGCA ACAAACGGCA GAACTGACCG GACCGGTGAA TAAAGCGATC ATCGATACCG CGCAGTTTTT GAAAGATCAG GGTAAAGTCC CTGCCGTGGC GAATGATTAC AGCCAGTACG TGACCTCGCG CTTCGTGCAA TAA
|
Protein sequence | MAISSRNTLL AALAFIAFQA QAVNVTVAYQ TSAEPAKVAQ ADNTFAKESG ATVDWRKFDS GASIVRALAS GDVQIGNLGS SPLAVATSQQ VPIEVFLLAS KLGNSEALVV KKTISKPEDL IGKRIAVPFI STTHYSLLAA LKHWGIKPGQ VEIVNLQPPA IIAAWQRGDI DGAYVWAPAV NALEKDGKVL TDSEQVGQWG APTLDVWVVR KDFAEKHPEV VKAFAKSAID AQQPYIANPD AWLKQPENIS KLARLSGVPE GDVPGLVKGN TYLTPQQQTA ELTGPVNKAI IDTAQFLKDQ GKVPAVANDY SQYVTSRFVQ
|
| |