Gene EcSMS35_2008 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2008 
SymbollolE 
ID6146219 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2027946 
End bp2029190 
Gene Length1245 bp 
Protein Length414 aa 
Translation table11 
GC content52% 
IMG OID641616884 
Productouter membrane-specific lipoprotein transporter subunit LolE 
Protein accessionYP_001744060 
Protein GI170682380 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG4591] ABC-type transport system, involved in lipoprotein release, permease component 
TIGRFAM ID[TIGR02212] lipoprotein releasing system, transmembrane protein, LolC/E family
[TIGR02213] lipoprotein releasing system, transmembrane protein LolE 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones37 
Fosmid unclonability p-value0.0630458 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATGC CTTTATCGTT ATTAATTGGC CTGCGTTTTA GTCGCGGACG GCGGCGCGGC 
GGCATGGTGT CGCTGATCTC CGTCATTTCT ACCATCGGCA TTGCCCTCGG CGTGGCGGTA
TTGATCGTCG GCTTAAGCGC GATGAACGGC TTTGAACGCG AACTGAATAA CCGCATTCTG
GCGGTGGTGC CGCATGGTGA AATCGAAGCG GTGGATCAGC CGTGGACTAA CTGGCAGGAA
GCACTGGATA ACGTGCAGAA AGTGCCGGGT ATTGCCGCCG CTGCGCCGTA TATCAATTTC
ACCGGGCTGG TGGAAAGTGG CGCGAATCTG CGCGCAATCC AGGTGAAGGG CGTTAACCCG
CAACAGGAAC AGCGTCTGAG CGCATTACCC TCGTTTGTTC AGGGGGATGC CTGGCGCAAT
TTTAAAGCGG GCGAACAGCA AATTATCATC GGCAAAGGCG TGGCGGATGC GCTGAAAGTG
AAGCAGGGCG ATTGGGTGTC GATTATGATC CCCAACTCGA ATCCTGAGCA TAAACTGATG
CAGCCAAAAC GTGTGCGTTT GCACGTTGCC GGTATTTTGC AGTTGAGTGG TCAACTCGAT
CACAGTTTTG CCATGATCCC GCTGGCGGAT GCCCAACAAT ATCTTGATAT GGGTTCCAGC
GTGTCAGGTA TTGCCCTTAA AATGACGGAT GTTTTCAACG CCAATAAGCT GGTACGCGAT
GCGGGTGAAG TGACCAACAG CTATGTTTAT ATTAAAAGCT GGATTGGTAC TTACGGCTAT
ATGTATCGCG ATATCCAGAT GATCCGCGCC ATTATGTATC TGGCGATGGT ACTGGTGATT
GGCGTGGCCT GTTTCAACAT CGTCTCCACC TTAGTGATGG CGGTGAAAGA CAAGAGTGGC
GATATCGCAG TATTAAGAAC GCTGGGGGCG AAAGATGGTT TAATTCGCGC CATCTTTGTC
TGGTATGGAT TGCTGGCAGG GCTGTTCGGC AGCCTGTGTG GGGTGATTAT CGGCGTAGTG
GTTTCACTGC AACTTACCCC GATTATTGAG TGGATTGAAA AGCTGATCGG TCATCAGTTC
CTCTCCAGCG ATATCTATTT TATTGACTTC TTGCCATCGG AATTGCACTG GCTGGACGTC
TTCTACGTAC TGGTCACAGC ATTGTTGCTG AGTCTTTTGG CAAGTTGGTA TCCGGCGCGG
CGCGCCAGTA ATATTGACCC TGCGCGAGTC CTTAGCGGCC AGTAA
 
Protein sequence
MAMPLSLLIG LRFSRGRRRG GMVSLISVIS TIGIALGVAV LIVGLSAMNG FERELNNRIL 
AVVPHGEIEA VDQPWTNWQE ALDNVQKVPG IAAAAPYINF TGLVESGANL RAIQVKGVNP
QQEQRLSALP SFVQGDAWRN FKAGEQQIII GKGVADALKV KQGDWVSIMI PNSNPEHKLM
QPKRVRLHVA GILQLSGQLD HSFAMIPLAD AQQYLDMGSS VSGIALKMTD VFNANKLVRD
AGEVTNSYVY IKSWIGTYGY MYRDIQMIRA IMYLAMVLVI GVACFNIVST LVMAVKDKSG
DIAVLRTLGA KDGLIRAIFV WYGLLAGLFG SLCGVIIGVV VSLQLTPIIE WIEKLIGHQF
LSSDIYFIDF LPSELHWLDV FYVLVTALLL SLLASWYPAR RASNIDPARV LSGQ