Gene EcE24377A_1071 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcE24377A_1071 
SymbolompA 
ID5589214 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli E24377A 
KingdomBacteria 
Replicon accessionNC_009801 
Strand
Start bp1097662 
End bp1098714 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content53% 
IMG OID640924775 
Productouter membrane protein A 
Protein accessionYP_001462189 
Protein GI157159269 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins
[COG3637] Opacity protein and related surface antigens 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000000033965 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAGA CAGCTATCGC GATTGCAGTG GCACTGGCTG GTTTCGCTAC CGTAGCGCAG 
GCCGCTCCGA AAGATAACAC CTGGTACACT GGTGCTAAAC TGGGCTGGTC CCAGTACCAT
GACACTGGTT TTATTCCTAA CAATGGTCCG ACCCACGAAA ACCAACTGGG TGCAGGTGCT
TTTGGTGGTT ACCAGGTTAA CCCGTATGTT GGTTTTGAAA TGGGTTACGA CTGGTTAGGT
CGTATGCCGT ACAAAGGCGA CAACATCAAC GGAGCATACA AAGCTCAGGG CGTTCAACTG
ACTGCTAAAC TGGGTTACCC AATCACTGAC GATCTGGACG TATACACTCG TCTGGGTGGT
ATGGTATGGC GTGCAGACAC CAAGGCTAAC GTACCTGGTG GCGCATCCTT TAAAGACCAC
GACACTGGCG TTTCTCCGGT TTTCGCAGGT GGTGTTGAGT ACGCGATCAC TCCTGAAATC
GCTACCCGTC TGGAATACCA GTGGACCAAC AACATCGGTG ACGCAAACAC CATCGGTACT
CGTCCGGACA ACGGTCTGCT GAGCCTGGGT GTTTCCTACC GTTTCGGTCA GGGCGAAGCA
GCTCCGGTAG TTGCTCCGGC TCCAGCTCCG GCACCGGAAG TACAGACCAA GCACTTCACT
CTGAAGTCTG ACGTTCTGTT CAACTTCAAC AAAGCAACCC TGAAACCGGA AGGTCAGGCT
GCTCTGGATC AGCTGTACAG CCAGCTGAGC AACCTGGATC CGAAAGACGG TTCCGTAGTT
GTTCTGGGTT ACACTGACCG CATCGGTTCT GACGCTTACA ACCAGGGTCT GTCCGAGCGC
CGTGCTCAGT CTGTTGTTGA TTACCTGATC TCCAAAGGTA TCCCGGCAGA CAAGATCTCC
GCACGTGGTA TGGGCGAATC CAACCCGGTT ACTGGCAACA CCTGTGACAA CGTGAAACAG
CGTGCTGCAC TGATCGACTG CCTGGCTCCG GATCGTCGCG TAGAGATCGA AGTTAAAGGC
ATCAAAGACG TTGTAACTCA GCCGCAGGCT TAA
 
Protein sequence
MKKTAIAIAV ALAGFATVAQ AAPKDNTWYT GAKLGWSQYH DTGFIPNNGP THENQLGAGA 
FGGYQVNPYV GFEMGYDWLG RMPYKGDNIN GAYKAQGVQL TAKLGYPITD DLDVYTRLGG
MVWRADTKAN VPGGASFKDH DTGVSPVFAG GVEYAITPEI ATRLEYQWTN NIGDANTIGT
RPDNGLLSLG VSYRFGQGEA APVVAPAPAP APEVQTKHFT LKSDVLFNFN KATLKPEGQA
ALDQLYSQLS NLDPKDGSVV VLGYTDRIGS DAYNQGLSER RAQSVVDYLI SKGIPADKIS
ARGMGESNPV TGNTCDNVKQ RAALIDCLAP DRRVEIEVKG IKDVVTQPQA