Gene EcSMS35_0814 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0814 
Symbol 
ID6146348 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp816698 
End bp817804 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content52% 
IMG OID641615703 
ProductABC-2 type transporter, permease protein 
Protein accessionYP_001742895 
Protein GI170683835 
COG category[V] Defense mechanisms 
COG ID[COG0842] ABC-type multidrug transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.547864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.91145 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTCATC GCTTATGGAC GTTAATCCGC AAAGAGTTGC AGTCGCTGCT GCGCGAACCG 
CAAACCCGCG CAATTCTGAT TTTACCCGTG CTGATTCAGG TGATCCTGTT CCCGTTCGCC
GCCACGCTGG AAGTGACCAA CGCCACCATC GCCATCTACG ATGAAGATAA CGGCGAGCAT
TCGGTGGAAC TTACTCAACG TTTTGCCCGC GCCAGCGCCT TTACCCATGT GCTGCTGCTG
AAAAGCCCGC AGGAGATCCG CCCGACCATC GACACACAAA AGGCGTTACT CCTGGTGCGT
TTCCCGGCTG ACTTCTCGCG CAAACTGGAT ACCTTCCAGA CCGCGCCGTT ACAGTTGATC
CTAGACGGGC GTAACTCCAA CAGTGCGCAA ATTGCCGCCA ACTACCTGCA ACAGATCGTC
AAAAATTATC AGCAGGAGCT GCTGGAAGGA AAACCGAAAC CCAACAACAG CGAGCTGGTG
GTACGCAACT GGTATAACCC GAATCTCGAC TACAAATGGT TTGTGGTGCC GTCGCTGATC
GCCATGATCA CCACTATCGG CGTAATGATC GTCACGTCAC TTTCCGTCGC CCGCGAACGT
GAACAAGGTA CGCTCGATCA GTTACTGGTT TCGCCGCTCA CCACCTGGCA GATATTTATC
GGCAAAGCCG TACCTGCGTT AATTGTCGCC ACGTTTCAGG CCACCATTGT GCTGGCGATT
GGTATCTGGG CGTATCAAAT CCCCTTCGCC GGATCGCTGG CGCTGTTCTA CTTTACGATG
GTGATTTACG GTTTATCGCT GGTGGGATTC GGTCTGTTGA TTTCATCACT CTGTTCAACA
CAACAACAGG CGTTTATCGG CGTGTTTGTC TTTATGATGC CCGCCATTCT TCTTTCCGGT
TACGTTTCGC CGGTGGAAAA CATGCCAGTA TGGCTGCAAA ACCTGACGTG GATTAACCCT
ATTCGCCACT TTACGGACAT TACCAAGCAG ATTTATTTGA AGGATGCGAG TCTGGATATT
GTGTGGAATA GTTTGTGGCC GCTACTGGTG ATAACGGCCA CGACAGGGTC AGCGGCGTAC
GCGATGTTTA GACGCAAGGT GATGTAA
 
Protein sequence
MFHRLWTLIR KELQSLLREP QTRAILILPV LIQVILFPFA ATLEVTNATI AIYDEDNGEH 
SVELTQRFAR ASAFTHVLLL KSPQEIRPTI DTQKALLLVR FPADFSRKLD TFQTAPLQLI
LDGRNSNSAQ IAANYLQQIV KNYQQELLEG KPKPNNSELV VRNWYNPNLD YKWFVVPSLI
AMITTIGVMI VTSLSVARER EQGTLDQLLV SPLTTWQIFI GKAVPALIVA TFQATIVLAI
GIWAYQIPFA GSLALFYFTM VIYGLSLVGF GLLISSLCST QQQAFIGVFV FMMPAILLSG
YVSPVENMPV WLQNLTWINP IRHFTDITKQ IYLKDASLDI VWNSLWPLLV ITATTGSAAY
AMFRRKVM