Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0406 |
Symbol | smbA |
ID | 6145306 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 419027 |
End bp | 420247 |
Gene Length | 1221 bp |
Protein Length | 406 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641615302 |
Product | transport protein |
Protein accession | YP_001742509 |
Protein GI | 170681620 |
COG category | [I] Lipid transport and metabolism |
COG ID | [COG1133] ABC-type long-chain fatty acid transport system, fused permease and ATPase components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 27 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTTAAGT CTTTTTTCCC AAAGCCGGGA CCGTTTTTTC TCTCGGCCTT TGTTTGGGCA TTGATTGCCG TTATCTTCTG GCAAGCCGGT GGGGGGGATT GGGTGGCGCG TATCACCGGA GCTTCCGGGC AGATCCCGAT TAGCGCCGCG CGTTTCTGGT CGTTGGATTT CTTGATTTTT TACGCTTACT ACATTGTTTG CGTAGGACTT TTTGCATTGT TCTGGTTTAT CTACAGCCCG CATCGTTGGC AATACTGGTC AATACTCGGT ACTGCACTGA TCATCTTCGT CACCTGGTTT TTGGTGGAAG TCGGGGTCGC CGTCAACGCC TGGTATGCGC CGTTCTATGA TCTGATTCAA ACCGCGCTAA GTTCGCCGCA TAAAGTCACC ATCGAACAAT TTTACCGCGA AGTGGGCGTC TTTCTGGGGA TTGCGCTGAT CGCGGTGGTG ATCAGTGTGC TGAACAACTT CTTTGTCAGT CACTACGTGT TCCGCTGGCG TACGGCGATG AACGAATATT ACATGGCGAA CTGGCAACAA CTGCGTCATA TCGAAGGTGC CGCACAGCGT GTGCAGGAAG ACACCATGCG TTTTGCTTCA ACGCTGGAGA ATATGGGCGT CAGCTTTATC AACGCTATCA TGACGTTGAT CGCCTTCCTG CCGGTGCTGG TAACGCTCTC CGCGCATGTG CCGGAGCTGC CGATTGTCGG GCACATTCCG TATGGTCTGG TGATTGCCGC TATCGTCTGG TCGCTGATGG GGACCGGATT ACTGGCAGTG GTAGGGATCA AACTGCCGGG GCTGGAGTTT AAAAACCAGC GTGTAGAGGC TGCCTACCGT AAAGAGCTGG TTTATGGTGA AGACGATGCC ACGCGCGCGA CGCCGCCTAC GGTACGCGAG CTGTTTAGCG CCGTACGGAA AAACTATTTC CGCCTCTATT TTCACTATAT GTATTTCAAC ATCGCCCGCA TTCTCTATTT GCAGGTCGAT AACGTTTTCG GTTTGTTCTT GCTGTTTCCG TCAATTGTTG CCGGTACGAT TACGCTCGGC CTGATGACGC AGATTACCAA CGTTTTTGGT CAGGTTCGCG GAGCTTTCCA GTACCTGATT AACTCATGGA CCACACTGGT TGAGTTGATG TCTATCTACA AACGTCTGCG CAGCTTTGAA CATGAGCTGG ATGGTGACAA AATTCAGGAA GTAACCCATA CCTTGAGCTA A
|
Protein sequence | MFKSFFPKPG PFFLSAFVWA LIAVIFWQAG GGDWVARITG ASGQIPISAA RFWSLDFLIF YAYYIVCVGL FALFWFIYSP HRWQYWSILG TALIIFVTWF LVEVGVAVNA WYAPFYDLIQ TALSSPHKVT IEQFYREVGV FLGIALIAVV ISVLNNFFVS HYVFRWRTAM NEYYMANWQQ LRHIEGAAQR VQEDTMRFAS TLENMGVSFI NAIMTLIAFL PVLVTLSAHV PELPIVGHIP YGLVIAAIVW SLMGTGLLAV VGIKLPGLEF KNQRVEAAYR KELVYGEDDA TRATPPTVRE LFSAVRKNYF RLYFHYMYFN IARILYLQVD NVFGLFLLFP SIVAGTITLG LMTQITNVFG QVRGAFQYLI NSWTTLVELM SIYKRLRSFE HELDGDKIQE VTHTLS
|
| |