Gene EcolC_2667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2667 
Symbol 
ID6067636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2923376 
End bp2924464 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content47% 
IMG OID641602073 
Productouter membrane protein F 
Protein accessionYP_001725623 
Protein GI170020669 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000513703 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.141925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAAGC GCAATATTCT GGCAGTGATC GTCCCTGCTC TGTTAGTAGC AGGTACTGCA 
AACGCTGCAG AAATCTATAA CAAAGATGGC AACAAAGTAG ATCTGTACGG TAAAGCTGTC
GGTCTGCATT ATTTTTCTAA AGACAATGGT GTAAACAGTT ACGGCGGAAA CGGCGACAAA
ACTTATGCCC GTCTTGGTTT TAAAGGGGAA ACACAAATCA ATTCCGATCT GACCGGTTAT
GGTCAGTGGG AATATAACTT CCAGGGTAAC AACTCTGAAG GCGCTGACGC TCAAACTGGT
AACAAAACGC GTCTGGCATT CGCGGGTCTT AAATACGCTG ACATTGGTTC TTTCGATTAC
GGCCGTAACT ACGGTGTGGT TTATGATGCA CTGGGTTACA CCGATATGCT GCCAGAATTT
GGTGGTGATA CTGCATACAG CGATGACTTC TTCGTTGGTC GTGTTGGCGG CGTTGCTACC
TATCGTAACT CCAACTTCTT TGGTCTGGTT GATGGCCTGA ACTTCGCTGT TCAGTACCTG
GGTAAAAACG AGCGTGACAC TGCACGCCGC TCTAACGGCG ACGGTGTTGG CGGTTCTATC
AGCTACGAAT ACGAAGGCTT TGGTATCGTT GGTGCTTATG GTGCAGCTGA CCGTACCAAC
CTGCAAGAAG CTCAACCTCT TGGCAACGGT AAAAAAGCTG AACAGTGGGC TACTGGTCTG
AAGTACGACG CGAACAACAT CTACCTGGCA GCGAACTACG GTGAAACCCG TAACGCTACG
CCGATCACTA ATAAATTTAC AAACACCAGC GGCTTCGCCA ACAAAACGCA AGACGTTCTG
TTAGTTGCGC AATACCAGTT CGATTTCGGT CTGCGTCCGT CCATCGCTTA CACCAAATCT
AAAGCGAAAG ACGTAGAAGG TATCGGTGAT GTTGATCTGG TGAACTACTT TGAAGTGGGC
GCAACCTACT ACTTCAACAA AAACATGTCC ACCTATGTTG ACTACATCAT CAACCAGATC
GATTCTGACA ACAAACTGGG CGTAGGTTCA GACGACACCG TTGCTGTGGG TATCGTTTAC
CAGTTCTAA
 
Protein sequence
MMKRNILAVI VPALLVAGTA NAAEIYNKDG NKVDLYGKAV GLHYFSKDNG VNSYGGNGDK 
TYARLGFKGE TQINSDLTGY GQWEYNFQGN NSEGADAQTG NKTRLAFAGL KYADIGSFDY
GRNYGVVYDA LGYTDMLPEF GGDTAYSDDF FVGRVGGVAT YRNSNFFGLV DGLNFAVQYL
GKNERDTARR SNGDGVGGSI SYEYEGFGIV GAYGAADRTN LQEAQPLGNG KKAEQWATGL
KYDANNIYLA ANYGETRNAT PITNKFTNTS GFANKTQDVL LVAQYQFDFG LRPSIAYTKS
KAKDVEGIGD VDLVNYFEVG ATYYFNKNMS TYVDYIINQI DSDNKLGVGS DDTVAVGIVY
QF