Gene EcHS_A2354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcHS_A2354 
SymbolompC 
ID5593770 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli HS 
KingdomBacteria 
Replicon accessionNC_009800 
Strand
Start bp2352627 
End bp2353733 
Gene Length1107 bp 
Protein Length368 aa 
Translation table11 
GC content51% 
IMG OID640921480 
Productouter membrane porin protein C 
Protein accessionYP_001459015 
Protein GI157161697 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value0.00268594 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAGTTA AAGTACTGTC CCTCCTGGTC CCAGCTCTGC TGGTAGCAGG CGCAGCAAAC 
GCTGCTGAAG TTTACAACAA AGACGGCAAC AAATTAGATC TGTACGGTAA AGTAGACGGC
CTGCACTATT TCTCTGACAA CAAGTCAGAA GACGGCGACC AGACCTATGT ACGTCTTGGT
TTCAAAGGCG AAACTCAGGT TACTGACCAG CTGACCGGTT ACGGCCAGTG GGAATATCAG
ATCCAGGGCA ATACCTCTGA AGACAACAAA GAAAACTCCT GGACCCGTGT GGCATTCGCA
GGTCTGAAAT TCCAGGATGT AGGTTCTTTC GACTACGGTC GTAACTACGG CGTTGTTTAC
GACGTAACTT CCTGGACCGA CGTACTGCCA GAATTCGGTG GCGACACCTA CGGTTCTGAC
AACTTCATGC AGCAGCGTGG TAACGGCTTC GCGACCTACC GTAACACCGA CTTCTTCGGT
CTGGTTGACG GTCTGAACTT TGCTGTTCAG TACCAGGGCA AAAACGGCAG CGTAAGCGGC
GAAGGCATGA CCAACAACGG TCGTGGCGCT CTGCGTCAGA ACGGCGACGG TGTCGGCGGT
TCTATCACTT ATGATTACGA AGGCTTCGGT ATCGGTGCTG CAGTTTCCAG CTCCAAACGT
ACTGATGCTC AGAACACCGC TGCTTACATC GGTAACGGCG ACCGTGCTGA AACCTACACT
GGTGGTCTGA AATACGACGC TAACAACATC TACCTGGCTG CTCAGTACAC CCAGACCTAC
AACGCAACTC GCGTAGGTTC CCTGGGTTGG GCGAACAAAG CACAGAACTT CGAAGCTGTT
GCTCAGTACC AGTTCGACTT CGGTCTGCGT CCGTCTGTAG CATACCTGCA GTCTAAAGGT
AAAAACCTGG GTGTCGTTGC TGGTCGTAAC TACGACGACG AAGATATCCT GAAATATGTT
GATGTTGGTG CGACCTACTA CTTCAACAAA AACATGTCCA CCTACGTTGA CTACAAAATC
AACCTGCTGG ACGACAACCA GTTCACTCGT GACGCTGGCA TCAACACTGA TAACATCGTA
GCTCTGGGTC TGGTTTACCA GTTCTAA
 
Protein sequence
MKVKVLSLLV PALLVAGAAN AAEVYNKDGN KLDLYGKVDG LHYFSDNKSE DGDQTYVRLG 
FKGETQVTDQ LTGYGQWEYQ IQGNTSEDNK ENSWTRVAFA GLKFQDVGSF DYGRNYGVVY
DVTSWTDVLP EFGGDTYGSD NFMQQRGNGF ATYRNTDFFG LVDGLNFAVQ YQGKNGSVSG
EGMTNNGRGA LRQNGDGVGG SITYDYEGFG IGAAVSSSKR TDAQNTAAYI GNGDRAETYT
GGLKYDANNI YLAAQYTQTY NATRVGSLGW ANKAQNFEAV AQYQFDFGLR PSVAYLQSKG
KNLGVVAGRN YDDEDILKYV DVGATYYFNK NMSTYVDYKI NLLDDNQFTR DAGINTDNIV
ALGLVYQF