Gene EcSMS35_A0153 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_A0153 
SymboltetA 
ID6106627 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010488 
Strand
Start bp120487 
End bp121761 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content63% 
IMG OID641614892 
Producttetracycline resistance protein, class A 
Protein accessionYP_001740033 
Protein GI170650807 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones42 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCACCA ACTTATCAGT GATAAAGAAT CCGCGCGTTC AATCGGACCA GCGGAGGCTG 
GTCCGGAGGC CAGACGTGAA ACCCAACAGA CCCCTGATCG TAATTCTGAG CACTGTCGCG
CTCGACGCTG TCGGCATCGG CCTGATTATG CCGGTGCTGC CGGGCCTCCT GCGCGATCTG
GTTCACTCGA ACGACGTCAC CGCCCACTAT GGCATTCTGC TGGCGCTGTA TGCGTTGATG
CAATTTGCCT GCGCACCTGT GCTGGGCGCG CTGTCGGATC GTTTCGGGCG GCGGCCGGTC
TTGCTCGTCT CGCTGGCCGG CGCTGCTGTC GACTACGCCA TCATGGCGAC GGCGCCTTTC
CTTTGGGTTC TCTATATCGG GCGGATCGTG GCCGGCATCA CCGGGGCGAC TGGGGCGGTA
GCCGGCGCTT ATATTGCCGA TATCACTGAT GGCGATGAGC GCGCGCGGCA CTTCGGCTTC
ATGAGCGCCT GTTTCGGGTT CGGGATGGTC GCGGGACCTG TGCTCGGTGG GCTGATGGGC
GGTTTCTCCC CCCACGCTCC GTTCTTCGCC GCGGCAGCCT TGAACGGCCT CAATTTCCTG
ACGGGCTGTT TCCTTTTGCC GGAGTCGCAC AAAGGCGAAC GCCGGCCGTT ACGCCGGGAG
GCTCTCAACC CGCTCGCTTC GTTCCGGTGG GCCCGGGGCA TGACCGTCGT CGCCGCCCTG
ATGGCGGTCT TCTTCATCAT GCAACTTGTC GGACAGGTGC CGGCCGCGCT TTGGGTCATT
TTCGGCGAGG ATCGCTTTCA CTGGGACGCG ACCACGATCG GCATTTCGCT TGCCGCATTT
GGCATTCTGC ATTCACTCGC CCAGGCAATG ATCACCGGCC CTGTAGCCGC CCGGCTCGGC
GAAAGGCGGG CACTCATGCT CGGAATGATT GCCGACGGCA CAGGCTACAT CCTGCTTGCC
TTCGCGACAC GGGGATGGAT GGCGTTCCCG ATCATGGTCC TGCTTGCTTC GGGTGGCATC
GGAATGCCGG CGCTGCAAGC AATGTTGTCC AGGCAGGTGG ATGAGGAACG TCAGGGGCAG
CTGCAAGGCT CACTGGCGGC GCTCACCAGC CTGACCTCGA TCGTCGGACC CCTCCTCTTC
ACGGCGATCT ATGCGGCTTC TATAACAACG TGGAACGGGT GGGCATGGAT TGCAGGCGCT
GCCCTCTACT TGCTCTGCCT GCCGGCGCTG CGTCGCGGGC TTTGGAGCGG CGCAGGGCAA
CGAGCCGATC GCTGA
 
Protein sequence
MSTNLSVIKN PRVQSDQRRL VRRPDVKPNR PLIVILSTVA LDAVGIGLIM PVLPGLLRDL 
VHSNDVTAHY GILLALYALM QFACAPVLGA LSDRFGRRPV LLVSLAGAAV DYAIMATAPF
LWVLYIGRIV AGITGATGAV AGAYIADITD GDERARHFGF MSACFGFGMV AGPVLGGLMG
GFSPHAPFFA AAALNGLNFL TGCFLLPESH KGERRPLRRE ALNPLASFRW ARGMTVVAAL
MAVFFIMQLV GQVPAALWVI FGEDRFHWDA TTIGISLAAF GILHSLAQAM ITGPVAARLG
ERRALMLGMI ADGTGYILLA FATRGWMAFP IMVLLASGGI GMPALQAMLS RQVDEERQGQ
LQGSLAALTS LTSIVGPLLF TAIYAASITT WNGWAWIAGA ALYLLCLPAL RRGLWSGAGQ
RADR