Gene EcSMS35_2163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2163 
SymbolompA 
ID6146246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2167505 
End bp2168581 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content53% 
IMG OID641617039 
Productouter membrane protein A 
Protein accessionYP_001744213 
Protein GI170681860 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000195857 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value0.269234 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGATGATA ACGAGGCGCA AAAAATGAAA AAGACAGCTA TCGCGATTGC AGTGGCACTG 
GCTGGTTTCG CTACCGTAGC GCAGGCCGCT CCGAAAGATA ACACCTGGTA CACTGGTGCT
AAACTGGGCT GGTCCCAGTA CCATGACACT GGTTTTATTC CTAACAATGG TCCGACCCAC
GAAAACCAAC TGGGTGCAGG TGCTTTTGGT GGTTACCAGG TTAACCCGTA TGTTGGCTTT
GAAATGGGTT ACGACTGGTT AGGTCGTATG CCGTACAAAG GCGACAACAT CAACGGCGCA
TACAAAGCTC AGGGCGTTCA GCTGACCGCT AAACTGGGTT ACCCAATCAC TGACGATCTG
GACGTTTACA CTCGTCTGGG TGGTATGGTA TGGCGTGCAG ACACCAAGTC TAACGTACCT
GGTGGCGCAT CCACTAAAGA CCACGACACC GGCGTTTCTC CGGTCTTCGC TGGCGGTGTT
GAGTACGCGA TCACTCCTGA AATCGCTACC CGTCTGGAAT ACCAGTGGAC CAACAACATC
GGTGACGCAC ACACCATCGG TACTCGTCCG GACAACGGCA TGCTGAGCCT GGGTGTTTCC
TACCGTTTCG GTCAGGGCGA AGCAGCTCCA GTAGTTGCTC CGGCTCCAGC TCCGGCACCG
GAAGTACAGA CCAAGCACTT CACTCTGAAG TCTGACGTTC TGTTCAACTT CAACAAAGCA
ACCCTGAAAC CGGAAGGTCA GGCTGCTCTG GATCAGCTGT ACAGCCAGCT GAGCAACCTG
GATCCGAAAG ACGGTTCCGT AGTTGTTCTG GGTTACACCG ACCGCATCGG TTCTGACGCT
TACAACCAGG CTCTGTCCGA GCGTCGTGCT CAGTCTGTTG TTGATTACCT GATCTCTAAA
GGTATCCCGG CAGACAAAAT CTCCGCACGT GGTATGGGCG AATCCAACCC GGTTACTGGC
AACACCTGTG ACAACGTGAA ACAGCGTGCT GCACTGATCG ACTGCCTGGC TCCGGATCGT
CGCGTAGAGA TCGAAGTTAA AGGTATCAAA GACGTTGTAA CTCAGCCGCA GGCTTAA
 
Protein sequence
MDDNEAQKMK KTAIAIAVAL AGFATVAQAA PKDNTWYTGA KLGWSQYHDT GFIPNNGPTH 
ENQLGAGAFG GYQVNPYVGF EMGYDWLGRM PYKGDNINGA YKAQGVQLTA KLGYPITDDL
DVYTRLGGMV WRADTKSNVP GGASTKDHDT GVSPVFAGGV EYAITPEIAT RLEYQWTNNI
GDAHTIGTRP DNGMLSLGVS YRFGQGEAAP VVAPAPAPAP EVQTKHFTLK SDVLFNFNKA
TLKPEGQAAL DQLYSQLSNL DPKDGSVVVL GYTDRIGSDA YNQALSERRA QSVVDYLISK
GIPADKISAR GMGESNPVTG NTCDNVKQRA ALIDCLAPDR RVEIEVKGIK DVVTQPQA