Gene EcSMS35_0394 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0394 
SymboltauA 
ID6145011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp407608 
End bp408570 
Gene Length963 bp 
Protein Length320 aa 
Translation table11 
GC content56% 
IMG OID641615290 
Producttaurine transporter substrate binding subunit 
Protein accessionYP_001742497 
Protein GI170684058 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4521] ABC-type taurine transport system, periplasmic component 
TIGRFAM ID[TIGR01729] taurine ABC transporter, periplasmic binding protein 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.560594 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAATTT CATCGCGTAA CACACTTCTT GCCGCACTGG CATTCATCGC TTTTCAGGCG 
CAGGCGGTGA ACGTCACCGT GGCGTATCAA ACCTCCGCCG AACCGGCGAA AGTGGCTCAG
GCCGACAACA CCTTTGCTAA AGAAAGCGGA GCAACCGTGG ACTGGCGTAA GTTTGACAGC
GGAGCCAGCA TCGTGCGGGC GCTGGCTTCA GGCGACGTGC AAATCGGCAA CCTCGGTTCC
AGCCCGTTAG CGGTTGCAAC CAGCCAACAG GTGCCGATTG AAGTCTTCTT GCTGGCGTCA
AAACTGGGTA ACTCCGAAGC GCTGGTGGTA AAGAAAACTA TCAGCAAACC GGAAGATCTG
ATTGGCAAGC GCATCGCCGT ACCGTTTATC TCCACCACCC ACTACAGCCT GCTGGCGGCG
CTGAAACACT GGGGTATTAA ACCCGGGCAA GTGGAGATTG TGAACCTGCA GCCGCCCGCG
ATTATCGCTG CATGGCAGCG GGGAGACATT GATGGTGCTT ATGTCTGGGC ACCGGCGGTT
AACGCCCTGG AAAAAGACGG CAAGGTGCTG ACCGATTCTG AACAGGTCGG GCAGTGGGGT
GCGCCGACGC TGGATGTCTG GGTAGTGCGC AAAGATTTTG CCGAGAAACA TCCTGAGGTC
GTGAAAGCGT TCGCTAAAAG CGCCATCGAT GCTCAGCAAC CGTACATTGC TAACCCAGAC
GCGTGGCTGA AACAGCCGGA AAACATCAGC AAACTGGCGC GGTTAAGCGG CGTGCCTGAA
GGTGACGTTC CGGGGCTGGT GAAGGGGAAT ACCTATCTGA CGCCGCAGCA ACAAACGGCA
GAACTGACCG GACCGGTGAA TAAAGCGATC ATCGATACCG CGCAGTTTTT GAAAGATCAG
GGTAAAGTCC CTGCCGTGGC GAATGATTAC AGCCAGTACG TGACCTCGCG CTTCGTGCAA
TAA
 
Protein sequence
MAISSRNTLL AALAFIAFQA QAVNVTVAYQ TSAEPAKVAQ ADNTFAKESG ATVDWRKFDS 
GASIVRALAS GDVQIGNLGS SPLAVATSQQ VPIEVFLLAS KLGNSEALVV KKTISKPEDL
IGKRIAVPFI STTHYSLLAA LKHWGIKPGQ VEIVNLQPPA IIAAWQRGDI DGAYVWAPAV
NALEKDGKVL TDSEQVGQWG APTLDVWVVR KDFAEKHPEV VKAFAKSAID AQQPYIANPD
AWLKQPENIS KLARLSGVPE GDVPGLVKGN TYLTPQQQTA ELTGPVNKAI IDTAQFLKDQ
GKVPAVANDY SQYVTSRFVQ