Gene EcSMS35_0406 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0406 
SymbolsmbA 
ID6145306 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp419027 
End bp420247 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content51% 
IMG OID641615302 
Producttransport protein 
Protein accessionYP_001742509 
Protein GI170681620 
COG category[I] Lipid transport and metabolism 
COG ID[COG1133] ABC-type long-chain fatty acid transport system, fused permease and ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTAAGT CTTTTTTCCC AAAGCCGGGA CCGTTTTTTC TCTCGGCCTT TGTTTGGGCA 
TTGATTGCCG TTATCTTCTG GCAAGCCGGT GGGGGGGATT GGGTGGCGCG TATCACCGGA
GCTTCCGGGC AGATCCCGAT TAGCGCCGCG CGTTTCTGGT CGTTGGATTT CTTGATTTTT
TACGCTTACT ACATTGTTTG CGTAGGACTT TTTGCATTGT TCTGGTTTAT CTACAGCCCG
CATCGTTGGC AATACTGGTC AATACTCGGT ACTGCACTGA TCATCTTCGT CACCTGGTTT
TTGGTGGAAG TCGGGGTCGC CGTCAACGCC TGGTATGCGC CGTTCTATGA TCTGATTCAA
ACCGCGCTAA GTTCGCCGCA TAAAGTCACC ATCGAACAAT TTTACCGCGA AGTGGGCGTC
TTTCTGGGGA TTGCGCTGAT CGCGGTGGTG ATCAGTGTGC TGAACAACTT CTTTGTCAGT
CACTACGTGT TCCGCTGGCG TACGGCGATG AACGAATATT ACATGGCGAA CTGGCAACAA
CTGCGTCATA TCGAAGGTGC CGCACAGCGT GTGCAGGAAG ACACCATGCG TTTTGCTTCA
ACGCTGGAGA ATATGGGCGT CAGCTTTATC AACGCTATCA TGACGTTGAT CGCCTTCCTG
CCGGTGCTGG TAACGCTCTC CGCGCATGTG CCGGAGCTGC CGATTGTCGG GCACATTCCG
TATGGTCTGG TGATTGCCGC TATCGTCTGG TCGCTGATGG GGACCGGATT ACTGGCAGTG
GTAGGGATCA AACTGCCGGG GCTGGAGTTT AAAAACCAGC GTGTAGAGGC TGCCTACCGT
AAAGAGCTGG TTTATGGTGA AGACGATGCC ACGCGCGCGA CGCCGCCTAC GGTACGCGAG
CTGTTTAGCG CCGTACGGAA AAACTATTTC CGCCTCTATT TTCACTATAT GTATTTCAAC
ATCGCCCGCA TTCTCTATTT GCAGGTCGAT AACGTTTTCG GTTTGTTCTT GCTGTTTCCG
TCAATTGTTG CCGGTACGAT TACGCTCGGC CTGATGACGC AGATTACCAA CGTTTTTGGT
CAGGTTCGCG GAGCTTTCCA GTACCTGATT AACTCATGGA CCACACTGGT TGAGTTGATG
TCTATCTACA AACGTCTGCG CAGCTTTGAA CATGAGCTGG ATGGTGACAA AATTCAGGAA
GTAACCCATA CCTTGAGCTA A
 
Protein sequence
MFKSFFPKPG PFFLSAFVWA LIAVIFWQAG GGDWVARITG ASGQIPISAA RFWSLDFLIF 
YAYYIVCVGL FALFWFIYSP HRWQYWSILG TALIIFVTWF LVEVGVAVNA WYAPFYDLIQ
TALSSPHKVT IEQFYREVGV FLGIALIAVV ISVLNNFFVS HYVFRWRTAM NEYYMANWQQ
LRHIEGAAQR VQEDTMRFAS TLENMGVSFI NAIMTLIAFL PVLVTLSAHV PELPIVGHIP
YGLVIAAIVW SLMGTGLLAV VGIKLPGLEF KNQRVEAAYR KELVYGEDDA TRATPPTVRE
LFSAVRKNYF RLYFHYMYFN IARILYLQVD NVFGLFLLFP SIVAGTITLG LMTQITNVFG
QVRGAFQYLI NSWTTLVELM SIYKRLRSFE HELDGDKIQE VTHTLS