Gene SNSL254_A1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSNSL254_A1111 
SymbolompA 
ID6482948 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Newport str. SL254 
KingdomBacteria 
Replicon accessionNC_011080 
Strand
Start bp1121437 
End bp1122513 
Gene Length1077 bp 
Protein Length358 aa 
Translation table11 
GC content54% 
IMG OID642736514 
Productouter membrane protein A 
Protein accessionYP_002040273 
Protein GI194443312 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000109909 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones45 
Fosmid unclonability p-value0.00334653 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGATGATA ACGAGGCGCA AAAAATGAAA AAGACAGCTA TCGCGATTGC AGTGGCACTG 
GCTGGTTTCG CTACCGTAGC GCAGGCCGCT CCGAAAGATA ACACCTGGTA CGCTGGTGCT
AAACTGGGCT GGTCTCAGTA CCATGACACC GGCTTCATTA ACAATGATGG CCCAACTCAT
GAAAACCAAC TGGGCGCAGG TGCTTTTGGT GGTTACCAGG TTAACCCGTA CGTTGGCTTT
GAAATGGGCT ACGACTGGTT AGGCCGTATG CCGTACAAAG GCGACAACAT CAATGGCGCT
TATAAAGCTC AGGGCGTTCA GTTGACCGCT AAACTGGGTT ATCCAATCAC TGACGATCTG
GACGTTTATA CCCGTCTGGG TGGTATGGTA TGGCGTGCAG ACACCAAGTC TAACGTCCCT
GGCGGCCCGT CTACTAAAGA CCACGACACC GGCGTTTCCC CGGTATTCGC GGGCGGTATC
GAGTATGCTA TCACCCCTGA AATCGCAACC CGTCTGGAAT ACCAGTGGAC TAACAACATC
GGTGATGCCA ACACCATCGG CACCCGTCCG GACAACGGCC TGCTGAGCGT AGGTGTTTCC
TACCGTTTCG GCCAGCAAGA AGCTGCTCCG GTAGTAGCTC CGGCACCGGC ACCGGCTCCG
GAAGTACAGA CCAAGCACTT CACTCTGAAG TCTGACGTAC TGTTCAACTT CAACAAATCT
ACCCTGAAGC CGGAAGGCCA GCAGGCTCTG GATCAGCTGT ACAGCCAGCT GAGCAACCTG
GATCCGAAAG ACGGTTCCGT TGTCGTTCTG GGCTTCACTG ACCGTATCGG TTCTGACGCT
TACAACCAGG GTCTGTCCGA GAAACGTGCT CAGTCTGTTG TTGATTACCT GATCTCCAAA
GGTATTCCGT CTGACAAAAT CTCCGCACGT GGTATGGGCG AATCTAACCC GGTTACCGGC
AACACCTGTG ACAACGTGAA ACCTCGCGCT GCCCTGATCG ATTGCCTGGC TCCGGATCGT
CGCGTAGAGA TCGAAGTTAA AGGCGTTAAA GACGTGGTAA CTCAGCCGCA GGCTTAA
 
Protein sequence
MDDNEAQKMK KTAIAIAVAL AGFATVAQAA PKDNTWYAGA KLGWSQYHDT GFINNDGPTH 
ENQLGAGAFG GYQVNPYVGF EMGYDWLGRM PYKGDNINGA YKAQGVQLTA KLGYPITDDL
DVYTRLGGMV WRADTKSNVP GGPSTKDHDT GVSPVFAGGI EYAITPEIAT RLEYQWTNNI
GDANTIGTRP DNGLLSVGVS YRFGQQEAAP VVAPAPAPAP EVQTKHFTLK SDVLFNFNKS
TLKPEGQQAL DQLYSQLSNL DPKDGSVVVL GFTDRIGSDA YNQGLSEKRA QSVVDYLISK
GIPSDKISAR GMGESNPVTG NTCDNVKPRA ALIDCLAPDR RVEIEVKGVK DVVTQPQA