Gene EcHS_A1066 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1066 
SymbolompA 
ID5591810 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1080627 
End bp1081667 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content54% 
IMG OID640920231 
Productouter membrane protein A 
Protein accessionYP_001457796 
Protein GI157160478 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value6.14063e-21 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CAGCTATCGC GATTGCAGTG GCACTGGCTG GTTTCGCTAC CGTAGCGCAG 
GCCGCTCCGA AAGATAACAC CTGGTACACT GGTGCTAAAC TGGGCTGGTC CCAGTACCAT
GACACTGGTT TCATCAACAA CAATGGCCCG ACCCATGAAA ACCAACTGGG CGCTGGTGCT
TTTGGTGGTT ACCAGGTTAA CCCGTATGTT GGCTTTGAAA TGGGTTACGA CTGGTTAGGT
CGTATGCCGT ACAAAGGCAG CGTTGAAAAC GGTGCATACA AAGCTCAGGG CGTTCAACTG
ACCGCTAAAC TGGGTTACCC AATCACTGAC GACCTAGACA TCTACACTCG TCTGGGTGGT
ATGGTATGGC GTGCAGACAC TAAATCCAAC GTTTATGGTA AAAACCACGA CACCGGCGTT
TCTCCGGTCT TCGCTGGCGG TGTTGAGTAC GCGATCACTC CTGAAATCGC TACCCGTCTG
GAATACCAGT GGACCAACAA CATCGGTGAC GCACACACCA TCGGCACTCG TCCGGACAAC
GGCATGCTGA GCCTGGGTGT TTCCTACCGT TTCGGTCAGG GCGAAGCAGC TCCAGTAGTT
GCTCCGGCTC CAGCTCCGGC ACCGGAAGTA CAGACCAAGC ACTTCACTCT GAAGTCTGAC
GTTCTGTTCA ACTTCAACAA AGCAACCCTG AAACCGGAAG GTCAGGCTGC TCTGGATCAG
CTGTACAGCC AGCTGAGCAA CCTGGATCCG AAAGACGGTT CCGTAGTTGT TCTGGGTTAC
ACCGACCGCA TCGGTTCTGA CGCTTACAAC CAGGGTCTGT CCGAGCGCCG TGCTCAGTCT
GTTGTTGATT ACCTGATCTC CAAAGGTATC CCGGCAGACA AGATCTCCGC ACGTGGTATG
GGCGAATCCA ACCCGGTTAC TGGCAACACC TGTGACAACG TGAAACAGCG TGCTGCACTG
ATCGACTGCC TGGCTCCGGA TCGTCGCGTA GAGATCGAAG TTAAAGGTAT CAAAGACGTT
GTAACTCAGC CGCAGGCTTA A
 
Protein sequence
MKKTAIAIAV ALAGFATVAQ AAPKDNTWYT GAKLGWSQYH DTGFINNNGP THENQLGAGA 
FGGYQVNPYV GFEMGYDWLG RMPYKGSVEN GAYKAQGVQL TAKLGYPITD DLDIYTRLGG
MVWRADTKSN VYGKNHDTGV SPVFAGGVEY AITPEIATRL EYQWTNNIGD AHTIGTRPDN
GMLSLGVSYR FGQGEAAPVV APAPAPAPEV QTKHFTLKSD VLFNFNKATL KPEGQAALDQ
LYSQLSNLDP KDGSVVVLGY TDRIGSDAYN QGLSERRAQS VVDYLISKGI PADKISARGM
GESNPVTGNT CDNVKQRAAL IDCLAPDRRV EIEVKGIKDV VTQPQA