Gene EcHS_A1037 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A1037 
SymbolompF 
ID5591640 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp1047511 
End bp1048599 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content47% 
IMG OID640920204 
Productouter membrane protein F 
Protein accessionYP_001457769 
Protein GI157160451 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.0000488881 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAAGC GCAATATTCT GGCAGTGATC GTCCCTGCTC TGTTAGTAGC AGGTACTGCA 
AACGCTGCAG AAATCTATAA CAAAGATGGC AACAAAGTAG ATCTGTACGG TAAAGCTGTC
GGTCTGCATT ATTTTTCTAA AGACAATGGT GTAAACAGTT ACGGCGGAAA CGGCGACAAA
ACTTATGCCC GTCTTGGTTT TAAAGGGGAA ACACAAATCA ATTCCGATCT GACCGGTTAT
GGTCAGTGGG AATATAACTT CCAGGGTAAC AACTCTGAAG GCGCTGACGC TCAAACTGGT
AACAAAACGC GTCTGGCATT CGCGGGTCTT AAATACGCTG ACATTGGTTC TTTCGATTAC
GGCCGTAACT ACGGTGTGGT TTATGATGCA CTGGGTTACA CCGATATGCT GCCAGAATTT
GGTGGTGATA CTGCATACAG CGATGACTTC TTCGTTGGTC GTGTTGGCGG CGTTGCTACC
TATCGTAACT CCAACTTCTT TGGTCTGGTT GATGGCCTGA ACTTCGCTGT TCAGTACCTG
GGTAAAAACG AGCGTGACAC TGCACGCCGC TCTAACGGCG ACGGTGTTGG CGGTTCTATC
AGCTACGAAT ACGAAGGCTT TGGTATCGTT GGTGCTTATG GTGCAGCTGA CCGTACCAAC
CTGCAAGAAG CTCAACCTCT TGGCAACGGT AAAAAAGCTG AACAGTGGGC TACTGGTCTG
AAGTACGACG CGAACAACAT CTACCTGGCA GCGAACTACG GTGAAACCCG TAACGCTACG
CCGATCACTA ATAAATTTAC AAACACCAGC GGCTTCGCCA ACAAAACGCA AGACGTTCTG
TTAGTTGCGC AATACCAGTT CGATTTCGGT CTGCGTCCGT CCATCGCTTA CACCAAATCT
AAAGCGAAAG ACGTAGAAGG TATCGGTGAT GTTGATCTGG TGAACTACTT TGAAGTGGGC
GCAACCTACT ACTTCAACAA AAACATGTCC ACCTATGTTG ACTACATCAT CAACCAGATC
GATTCTGACA ACAAACTGGG CGTAGGTTCA GACGACACCG TTGCTGTGGG TATCGTTTAC
CAGTTCTAA
 
Protein sequence
MMKRNILAVI VPALLVAGTA NAAEIYNKDG NKVDLYGKAV GLHYFSKDNG VNSYGGNGDK 
TYARLGFKGE TQINSDLTGY GQWEYNFQGN NSEGADAQTG NKTRLAFAGL KYADIGSFDY
GRNYGVVYDA LGYTDMLPEF GGDTAYSDDF FVGRVGGVAT YRNSNFFGLV DGLNFAVQYL
GKNERDTARR SNGDGVGGSI SYEYEGFGIV GAYGAADRTN LQEAQPLGNG KKAEQWATGL
KYDANNIYLA ANYGETRNAT PITNKFTNTS GFANKTQDVL LVAQYQFDFG LRPSIAYTKS
KAKDVEGIGD VDLVNYFEVG ATYYFNKNMS TYVDYIINQI DSDNKLGVGS DDTVAVGIVY
QF