Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0449 |
Symbol | smbA |
ID | 6968402 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 460003 |
End bp | 461223 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 643384499 |
Product | transport protein |
Protein accession | YP_002269013 |
Protein GI | 209395719 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1133] ABC-type long-chain fatty acid transport system, fused permease and ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAGT CTTTTTTCCC AAAGCCGGGA ACGTTTTTTC TCTCGGCCTT TGTTTGGGCA TTGATTGCCG TTATCTTCTG GCAAGCCGGT GGGGGGGACT GGGTGGCGCG TATCACCGGC GCTTCCGGGC AGATCCCGAT TAGCGCCGCG CGTTTCTGGT CGTTGGATTT CCTGATTTTT TACGCTTACT ACATTGTTTG CGTAGGACTT TTTGCATTGT TCTGGTTTAT CTACAGCCCG CACCGTTGGC AATACTGGTC AATACTCGGT ACTGCACTGA TCATCTTCGT CACCTGGTTT TTGGTGGAAG TTGGGGTTGC CGTCAACGCC TGGTATGCAC CGTTCTATGA TCTGATTCAA ACCGCGCTAA GTTCGCCGCA TAAAGTCACT ATCGAACAAT TTTACCGCGA AGTGGGCGTC TTTCTGGGGA TTGCGCTGAT CGCGGTAGTG ATCAGTGTGC TGAATAACTT CTTTGTCAGT CACTACGTGT TCCGCTGGCG TACGGCGATG AACGAATATT ACATGGCGAA CTGGCAACAA CTGCGTCATA TCGAAGGCGC CGCACAGCGT GTGCAGGAAG ACACCATGCG TTTTGCTTCA ACGCTGGAGA ATATGGGTGT CAGTTTTATC AACGCCATCA TGACGTTGAT CGCCTTCCTG CCGGTGTTGG TAACGCTCTC CGCGCATGTG CCGGAGCTGC CGATTATCGG GCACATTCCG TATGGTCTGG TGATTGCCGC TATCGTCTGG TCGCTGATGG GGACCGGATT GCTGGCAGTG GTAGGGATCA AACTGCCGGG GCTGGAGTTT AAAAACCAGC GTGTAGAGGC TGCCTACCGT AAAGAGCTGG TTTATGGTGA AGACGATGCC ACGCGCGCGA CGCCGCCAAC GGTACGCGAG CTGTTTAGCG CCGTACGGAA AAACTATTTC CGCCTCTATT TTCACTATAT GTATTTCAAC ATCGCTCGCA TTCTCTATTT GCAGGTCGAT AACGTTTTCG GTTTGTTCTT GCTGTTTCCG TCAATTGTTG CCGGTACGAT TACGCTCGGC CTGATGACGC AGATTACCAA CGTTTTTGGT CAGGTTCGCG GTGCTTTCCA GTACCTGATT AACTCATGGA CCACACTGGT TGAGTTGATG TCTATCTACA AACGTCTGCG CAGCTTTGAA CATGAGCTGG ATGGTGACAA AATTCAGGAA GTAACCCATA CCTTGAGCTA A
|
Protein sequence | MFKSFFPKPG TFFLSAFVWA LIAVIFWQAG GGDWVARITG ASGQIPISAA RFWSLDFLIF YAYYIVCVGL FALFWFIYSP HRWQYWSILG TALIIFVTWF LVEVGVAVNA WYAPFYDLIQ TALSSPHKVT IEQFYREVGV FLGIALIAVV ISVLNNFFVS HYVFRWRTAM NEYYMANWQQ LRHIEGAAQR VQEDTMRFAS TLENMGVSFI NAIMTLIAFL PVLVTLSAHV PELPIIGHIP YGLVIAAIVW SLMGTGLLAV VGIKLPGLEF KNQRVEAAYR KELVYGEDDA TRATPPTVRE LFSAVRKNYF RLYFHYMYFN IARILYLQVD NVFGLFLLFP SIVAGTITLG LMTQITNVFG QVRGAFQYLI NSWTTLVELM SIYKRLRSFE HELDGDKIQE VTHTLS
|
| |