Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2503 |
Symbol | |
ID | 6143177 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 2550004 |
End bp | 2551344 |
Gene Length | 1341 bp |
Protein Length | 446 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617375 |
Product | long-chain fatty acid outer membrane transporter |
Protein accession | YP_001744547 |
Protein GI | 170683067 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG2067] Long-chain fatty acid transport protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCAGA AAACCCTGTT TACAAAGTCT GCTCTCGCAG TCGCAGTGGC ACTTATCTCC ACCCAGGCCT GGTCGGCAGG CTTTCAGTTA AACGAATTTT CTTCCTCTGG CCTGGGCCGG GCTTATTCAG GGGAAGGCGC AATTGCCGAT GATGCAGGTA ACGTTAGCCG TAACCCCGCA TTGATTACCA TGTTTGACCG CCCGACATTT TCTGCGGGTG CGGTTTATAT TGATCCGGAT GTAAATATCA GCGGAACGTC TCCATCTGGC CGTAGCCTGA AAGCCGATAA CATCGCACCT ACGGCATGGG TTCCGAACAT GCACTTTGTT GCACCGATTA ACGACCAATT TGGTTGGGGC GCTTCTATTA CCTCTAACTA TGGCCTGGCA ACAGAGTTTA ACGATACTTA TGCAGGCGGC TCTGTCGGGG GTACAACCGA CCTTGAAACC ATGAACCTGA ACTTAAGCGG TGCGTATCGC TTAAATAATG CATGGAGCTT TGGTCTTGGT TTCAACGCCG TCTACGCTCG CGCGAAAATT GAACGTTTCG CAGGCGATCT GGGGCAGCTG GTTGCTGGTC AGATTATGCA ATCTCCTGCC AGGCAGACTC CTCAGGGGCA AGCATTAGCA GCTACCGCCA ACGGCATCGA CAGTAATACC AAAATCGCTC ACCTGAACGG TAACCAGTGG GGCTTTGGCT GGAACGCCGG GATCCTGTAC GAACTGGATA AAAACAACCG CTACGCATTG ACCTACCGCT CTGAAGTGAA AATTGACTTC AAGGGTAACT ACAGCAGCGA TCTTAACCCG GCTTTTAATA ACTACGGTTT GCCAATTCCT ACCGCGACAG GTGGCGCAAC GCAGTCGGGT TATCTGACGC TGAACCTGCC TGAAATGTGG GAAGTGTCGG GTTATAACCG TGTTGATCCG CAGTGGGCAA TTCACTATAG CCTGGCTTAC ACCAGCTGGA GTCAGTTCCA GCAGCTGAAA GCGACCTCAA CGAGTGGCGA CACGCTGTTC CAGAAACATG AAGGCTTTAA AGATGCTTAC CGCATCGCGT TGGGAACCAC TTATTACTAC GATGATAACT GGACCTTCCG TACCGGTATC GCCTTTGATG ACAGCCCAGT TCCGGCACAG AATCGTTCTA TCTCCATTCC GGACCAGGAC CGTTTCTGGC TGAGTGCAGG TACGACTTAC GCATTTAATA AAGATGCTTC AGTCGATGTT GGTGTTTCTT ATATGCACGG TCAGAGCGTG AAAATTAACG AAGGCCCATA CCAGTTCGAG TCTGAAGGTA AAGCCTGGCT GTTCGGTACT AACTTTAACT ACGCGTTCTG A
|
Protein sequence | MSQKTLFTKS ALAVAVALIS TQAWSAGFQL NEFSSSGLGR AYSGEGAIAD DAGNVSRNPA LITMFDRPTF SAGAVYIDPD VNISGTSPSG RSLKADNIAP TAWVPNMHFV APINDQFGWG ASITSNYGLA TEFNDTYAGG SVGGTTDLET MNLNLSGAYR LNNAWSFGLG FNAVYARAKI ERFAGDLGQL VAGQIMQSPA RQTPQGQALA ATANGIDSNT KIAHLNGNQW GFGWNAGILY ELDKNNRYAL TYRSEVKIDF KGNYSSDLNP AFNNYGLPIP TATGGATQSG YLTLNLPEMW EVSGYNRVDP QWAIHYSLAY TSWSQFQQLK ATSTSGDTLF QKHEGFKDAY RIALGTTYYY DDNWTFRTGI AFDDSPVPAQ NRSISIPDQD RFWLSAGTTY AFNKDASVDV GVSYMHGQSV KINEGPYQFE SEGKAWLFGT NFNYAF
|
| |