Gene EcolC_3340 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_3340 
Symbol 
ID6067357 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp3665103 
End bp3666158 
Gene Length1056 bp 
Protein Length351 aa 
Translation table11 
GC content45% 
IMG OID641602756 
Productouter membrane phosphoporin protein E 
Protein accessionYP_001726288 
Protein GI170021334 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAGA GCACTCTGGC ATTAGTGGTG ATGGGCATTG TGGCATCTGC ATCCGTACAG 
GCCGCAGAAA TATATAACAA AGACGGTAAT AAACTGGATG TCTATGGCAA AGTTAAAGCC
ATGCATTATA TGAGTGATAA CGACAGTAAA GATGGCGACC AGAGTTATAT CCGTTTTGGT
TTTAAAGGCG AAACACAAAT TAACGATCAA CTGACTGGTT ATGGTCGTTG GGAAGCAGAG
TTTGCCGGTA ATAAAGCAGA GAGTGATACT GCACAGCAAA AAACGCGTCT CGCTTTTGCC
GGGTTGAAAT ATAAAGATTT GGGTTCTTTC GATTATGGTC GTAACCTGGG CGCGTTGTAT
GACGTGGAAG CCTGGACCGA TATGTTCCCG GAATTTGGTG GCGACTCCTC GGCGCAGACC
GACAACTTTA TGACCAAACG CGCCAGCGGT CTGGCGACGT ATCGGAACAC CGACTTCTTC
GGCGTTATCG ATGGCCTGAA CTTAACCCTG CAATATCAAG GGAAAAACGA AAACCGCGAC
GTTAAAAAGC AAAACGGCGA TGGCTTCGGC ACGTCATTGA TATATGACTT TGGCGGCAGC
GATTTCGCCA TTAGTGGGGC CTATACCAAC TCAGATCGCA CCAACGAGCA GAACCTGCAA
AGCCGTGGCA CAGGCAAGCG AGCAGAAGCA TGGGCAACAG GTCTGAAATA CGATGCCAAT
AATATTTATC TGGCAACTTT TTATTCTGAA ACACGCAAAA TGACGCCAAT AACTGGCGGC
TTTGCCAATA AGACACAGAA CTTTGAAGCG GTCGCTCAAT ACCAGTTTGA CTTTGGTCTG
CGTCCATCGC TGGGTTATGT CTTATCGAAA GGGAAAGATA TTGAAGGTAT CGGTGATGAA
GATCTGGTCA ATTATATCGA TGTCGGTGCT ACATATTATT TCAACAAAAA TATGTCAGCG
TTTGTAGATT ATAAAATCAA CCAACTGGAT AGCGATAACA AATTGAATAT TAATAATGAT
GATATTGTCG CGGTTGGCAT GACCTATCAG TTTTAA
 
Protein sequence
MKKSTLALVV MGIVASASVQ AAEIYNKDGN KLDVYGKVKA MHYMSDNDSK DGDQSYIRFG 
FKGETQINDQ LTGYGRWEAE FAGNKAESDT AQQKTRLAFA GLKYKDLGSF DYGRNLGALY
DVEAWTDMFP EFGGDSSAQT DNFMTKRASG LATYRNTDFF GVIDGLNLTL QYQGKNENRD
VKKQNGDGFG TSLIYDFGGS DFAISGAYTN SDRTNEQNLQ SRGTGKRAEA WATGLKYDAN
NIYLATFYSE TRKMTPITGG FANKTQNFEA VAQYQFDFGL RPSLGYVLSK GKDIEGIGDE
DLVNYIDVGA TYYFNKNMSA FVDYKINQLD SDNKLNINND DIVAVGMTYQ F