Gene EcSMS35_2503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2503 
Symbol 
ID6143177 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2550004 
End bp2551344 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content51% 
IMG OID641617375 
Productlong-chain fatty acid outer membrane transporter 
Protein accessionYP_001744547 
Protein GI170683067 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AAACCCTGTT TACAAAGTCT GCTCTCGCAG TCGCAGTGGC ACTTATCTCC 
ACCCAGGCCT GGTCGGCAGG CTTTCAGTTA AACGAATTTT CTTCCTCTGG CCTGGGCCGG
GCTTATTCAG GGGAAGGCGC AATTGCCGAT GATGCAGGTA ACGTTAGCCG TAACCCCGCA
TTGATTACCA TGTTTGACCG CCCGACATTT TCTGCGGGTG CGGTTTATAT TGATCCGGAT
GTAAATATCA GCGGAACGTC TCCATCTGGC CGTAGCCTGA AAGCCGATAA CATCGCACCT
ACGGCATGGG TTCCGAACAT GCACTTTGTT GCACCGATTA ACGACCAATT TGGTTGGGGC
GCTTCTATTA CCTCTAACTA TGGCCTGGCA ACAGAGTTTA ACGATACTTA TGCAGGCGGC
TCTGTCGGGG GTACAACCGA CCTTGAAACC ATGAACCTGA ACTTAAGCGG TGCGTATCGC
TTAAATAATG CATGGAGCTT TGGTCTTGGT TTCAACGCCG TCTACGCTCG CGCGAAAATT
GAACGTTTCG CAGGCGATCT GGGGCAGCTG GTTGCTGGTC AGATTATGCA ATCTCCTGCC
AGGCAGACTC CTCAGGGGCA AGCATTAGCA GCTACCGCCA ACGGCATCGA CAGTAATACC
AAAATCGCTC ACCTGAACGG TAACCAGTGG GGCTTTGGCT GGAACGCCGG GATCCTGTAC
GAACTGGATA AAAACAACCG CTACGCATTG ACCTACCGCT CTGAAGTGAA AATTGACTTC
AAGGGTAACT ACAGCAGCGA TCTTAACCCG GCTTTTAATA ACTACGGTTT GCCAATTCCT
ACCGCGACAG GTGGCGCAAC GCAGTCGGGT TATCTGACGC TGAACCTGCC TGAAATGTGG
GAAGTGTCGG GTTATAACCG TGTTGATCCG CAGTGGGCAA TTCACTATAG CCTGGCTTAC
ACCAGCTGGA GTCAGTTCCA GCAGCTGAAA GCGACCTCAA CGAGTGGCGA CACGCTGTTC
CAGAAACATG AAGGCTTTAA AGATGCTTAC CGCATCGCGT TGGGAACCAC TTATTACTAC
GATGATAACT GGACCTTCCG TACCGGTATC GCCTTTGATG ACAGCCCAGT TCCGGCACAG
AATCGTTCTA TCTCCATTCC GGACCAGGAC CGTTTCTGGC TGAGTGCAGG TACGACTTAC
GCATTTAATA AAGATGCTTC AGTCGATGTT GGTGTTTCTT ATATGCACGG TCAGAGCGTG
AAAATTAACG AAGGCCCATA CCAGTTCGAG TCTGAAGGTA AAGCCTGGCT GTTCGGTACT
AACTTTAACT ACGCGTTCTG A
 
Protein sequence
MSQKTLFTKS ALAVAVALIS TQAWSAGFQL NEFSSSGLGR AYSGEGAIAD DAGNVSRNPA 
LITMFDRPTF SAGAVYIDPD VNISGTSPSG RSLKADNIAP TAWVPNMHFV APINDQFGWG
ASITSNYGLA TEFNDTYAGG SVGGTTDLET MNLNLSGAYR LNNAWSFGLG FNAVYARAKI
ERFAGDLGQL VAGQIMQSPA RQTPQGQALA ATANGIDSNT KIAHLNGNQW GFGWNAGILY
ELDKNNRYAL TYRSEVKIDF KGNYSSDLNP AFNNYGLPIP TATGGATQSG YLTLNLPEMW
EVSGYNRVDP QWAIHYSLAY TSWSQFQQLK ATSTSGDTLF QKHEGFKDAY RIALGTTYYY
DDNWTFRTGI AFDDSPVPAQ NRSISIPDQD RFWLSAGTTY AFNKDASVDV GVSYMHGQSV
KINEGPYQFE SEGKAWLFGT NFNYAF