Gene ECH74115_3487 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_3487 
Symbol 
ID6966853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp3229578 
End bp3230918 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content50% 
IMG OID643387293 
Productlong-chain fatty acid outer membrane transporter 
Protein accessionYP_002271756 
Protein GI209397641 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones60 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGA AAACCCTGTT TACAAAGTCT GCTCTCGCAG TCGCAGTGGC ACTTATCTCC 
ACCCAGGCCT GGTCGGCAGG CTTTCAGTTA AACGAATTTT CTTCCTCTGG CCTGGGCCGG
GCTTATTCAG GGGAAGGCGC AATTGCCGAT GATGCAGGTA ACGTCAGCCG TAACCCCGCA
TTGATTACCA TGTTTGACCG CCCGACATTT TCTGCGGGTG CGGTTTATAT TGACCCGGAT
GTAAATATCA GCGGAACGTC TCCATCTGGT CGTAGCCTGA AAGCCGATAA CATCGCGCCT
ACGGCATGGG TTCCGAACAT GCACTTTGTT GCACCGATTA ACGACCAATT TGGTTGGGGC
GCTTCTATTA CCTCTAACTA TGGCCTGGCA ACAGAGTTTA ACGATACTTA TGCAGGCGGC
TCTGTCGGGG GTACAACCGA CCTTGAAACC ATGAACCTGA ACTTAAGCGG TGCGTATCGC
TTAAATAATG CATGGAGCTT TGGTCTTGGT TTCAACGCCG TCTACGCTCG CGCGAAAATT
GAACGTTTCG CAGGCGATCT GGGGCAGCTG GTTGCTGGTC AGATTATGCA ATCTCCTGCC
GGGAAGACTC CTCAAGGGCA AGCATTGGCA GCTACCGCCA ACGGTATCGA CAGTAATACC
AAAATCGCTC ATCTGAACGG CAACCAGTGG GGCTTTGGAT GGAACGCCGG TATCCTGTAT
GAACTGGATA AAAATAACCG CTATGCACTG ACCTACCGTT CTGAAGTGAA AATTGACTTC
AAAGGTAACT ACAGCAGCGA TCTTAATCGT GTGTTTAATA ACTACGGTTT GCCAATTCCT
ACCGCCACAG GTGGCGCAAC GCAATCGGGT TATCTGACGC TGAACCTGCC TGAAATGTGG
GAAGTGTCGG GTTATAACCG TGTTGATCCG CAGTGGGCGA TTCACTATAG CCTGGCTTAC
ACCAGCTGGA GTCAGTTCCA GCAGCTGAAA GCGACCTCAA CCAGTGGCGA CACGCTGTTC
CAGAAACATG AAGGCTTTAA AGATGCTTAC CGCATCGCGT TGGGTACCAC TTATTACTAC
GATGATAACT GGACCTTCCG TACCGGTATC GCCTTTGATG ACAGCCCAGT TCCGGCACAG
AATCGTTCTA TCTCCATTCC GGACCAGGAC CGTTTCTGGC TGAGTGCAGG TACGACTTAC
GCGTTTAATA AAGATGCTTC AGTCGACGTT GGTGTTTCTT ATATGCACGG TCAGAGCGTG
AAAATTAACG AAGGCCCATA CCAGTTCGAG TCTGAAGGTA AAGCCTGGCT GTTCGGTACT
AACTTTAACT ACGCGTTCTG A
 
Protein sequence
MSQKTLFTKS ALAVAVALIS TQAWSAGFQL NEFSSSGLGR AYSGEGAIAD DAGNVSRNPA 
LITMFDRPTF SAGAVYIDPD VNISGTSPSG RSLKADNIAP TAWVPNMHFV APINDQFGWG
ASITSNYGLA TEFNDTYAGG SVGGTTDLET MNLNLSGAYR LNNAWSFGLG FNAVYARAKI
ERFAGDLGQL VAGQIMQSPA GKTPQGQALA ATANGIDSNT KIAHLNGNQW GFGWNAGILY
ELDKNNRYAL TYRSEVKIDF KGNYSSDLNR VFNNYGLPIP TATGGATQSG YLTLNLPEMW
EVSGYNRVDP QWAIHYSLAY TSWSQFQQLK ATSTSGDTLF QKHEGFKDAY RIALGTTYYY
DDNWTFRTGI AFDDSPVPAQ NRSISIPDQD RFWLSAGTTY AFNKDASVDV GVSYMHGQSV
KINEGPYQFE SEGKAWLFGT NFNYAF