Gene SbBS512_E2359 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSbBS512_E2359 
SymbolompA 
ID6269493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShigella boydii CDC 3083-94 
KingdomBacteria 
Replicon accessionNC_010658 
Strand
Start bp2147054 
End bp2148094 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID641726363 
Productouter membrane protein A 
Protein accessionYP_001880845 
Protein GI187730326 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000000044176 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CAGCTATCGC GATTGCAGTG GCACTGGCTG GTTTCGCTAC CGTAGCGCAG 
GCCGCTCCGA AAGATAACAC CTGGTACACT GGTGCTAAAC TGGGCTGGTC CCAGTACCAT
GACACTGGTT TCATCAACAA CAATGGCCCG ACCCATGAAA ACCAACTGGG CGCTGGTGCT
TTTGGTGGTT ACCAGGTTAA CCCGTATGTT GGCTTTGAAA TGGGTTACGA CTGGTTAGGT
CGTATGCCGT ACAAAGGCAG CGTTGAAAAC GGTGCATACA AAGCTCAGGG CGTTCAACTG
ACCGCTAAAC TGGGTTACCC AATCACTGAC GACCTGGACA TCTACACTCG TCTGGGTGGT
ATGGTATGGC GTGCAGACAC TAAATCCAAC GTTTATGGTA AAAACCACGA CACCGGCGTT
TCTCCGGTCT TCGCTGGCGG TGTTGAGTAC GCGATCACTC CTGAAATCGC TACCCGTCTG
GAATACCAGT GGACCAACAA CATCGGTGAC GCACACACCA TCGGCACTCG TCCGGACAAC
GGCATGCTGA GCCTGGGTGT TTCCTACCGT TTCGGTCAGG GCGAAGCAGC TCCAGTAGTT
GCTCCGGCTC CAGCTCCGGC ACCGGAAGTA CAGACCAAGC ACTTCACTCT GAAGTCTGAC
GTTCTGTTCA ACTTCAACAA AGCAACCCTG AAACCGGAAG GTCAGGCTGC TCTGGATCAG
CTGTACAGCC AGCTGAGCAA CTTGGATCCG AAAGACGGTT CCGTAGTTGT TCTGGGTTAC
ACCGACCGCA TCGGTTCTGA CGCTTACAAC CAGGGTCTGT CCGAGCGCCG TGCTCAGTCT
GTTGTTGATT ACCTGATCTC CAAAGGTATC CCGGCAGACA AGATCTCCGC ACGTGGTATG
GGCGAATCCA ACCCGGTTAC TGGCAACACC TGTGACAACG TGAAACAGCG TGCTGCACTG
ATCGACTGCC TGGCTCCGGA TCGTCGCGTA GAGATCGAAG TTAAAGGTAT CAAAGACGTT
GTAACTCAGC CGCAGGCTTA A
 
Protein sequence
MKKTAIAIAV ALAGFATVAQ AAPKDNTWYT GAKLGWSQYH DTGFINNNGP THENQLGAGA 
FGGYQVNPYV GFEMGYDWLG RMPYKGSVEN GAYKAQGVQL TAKLGYPITD DLDIYTRLGG
MVWRADTKSN VYGKNHDTGV SPVFAGGVEY AITPEIATRL EYQWTNNIGD AHTIGTRPDN
GMLSLGVSYR FGQGEAAPVV APAPAPAPEV QTKHFTLKSD VLFNFNKATL KPEGQAALDQ
LYSQLSNLDP KDGSVVVLGY TDRIGSDAYN QGLSERRAQS VVDYLISKGI PADKISARGM
GESNPVTGNT CDNVKQRAAL IDCLAPDRRV EIEVKGIKDV VTQPQA