Gene EcolC_1435 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1435 
Symbol 
ID6067592 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1583771 
End bp1584901 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content50% 
IMG OID641600854 
Productouter membrane porin protein C 
Protein accessionYP_001724425 
Protein GI170019471 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.974185 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.210914 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGTTA AAGTACTGTC CCTCCTGGTC CCAGCTCTGC TGGTAGCAGG CGCAGCAAAC 
GCTGCTGAAG TTTACAACAA AGACGGCAAC AAATTAGATC TGTACGGTAA AGTAGACGGC
CTGCACTATT TCTCTGACAA CAAGTCAGAA GACGGCGACC AGACCTATGT ACGTCTTGGT
TTCAAAGGCG AAACTCAGGT TACTGACCAG CTGACCGGTT ACGGCCAGTG GGAATATCAG
ATCCAGGGCA ATACCTCTGA AGACAACAAA GAAAACTCCT GGACCCGTGT GGCATTCGCA
GGTCTGAAAT TCCAGGATGT GGGTTCTTTC GACTACGGTC GTAACTACGG CGTTGTTTAC
GACGTAACTT CCTGGACCGA CGTACTGCCA GAATTCGGTG GCGACACCTA CGGTTCTGAC
AACTTCATGC AGCAGCGTGG TAACGGCTTC GCGACCTACC GTAACACCGA CTTCTTCGGT
CTGGTTGACG GTCTGAACTT TGCTGTTCAG TACCAGGGCA AAAACGGTAG CGTAAGCGGC
GAAGGCATGA CCAACAATGG TCGTGGTGCT CTGCGTCAGA ATGGCGACGG TGTCGGCGGA
TCTATCACTT ATGATTACGA AGGCTTCGGT ATCGGTGCTG CAGTTTCCAG CTCCAAACGT
ACTGATGATC AAAATGGTAG CTACACCAGC AATGGTGTAG TTCGTAACTA CATCGGTACT
GGCGACCGTG CTGAAACCTA CACTGGTGGT CTGAAATACG ACGCTAACAA CATCTACCTG
GCTGCTCAGT ACACCCAGAC CTACAACGCA ACTCGCGTAG GTTCCCTGGG TTGGGCGAAC
AAAGCACAGA ACTTCGAAGC TGTTGCTCAG TACCAGTTCG ACTTTGGTCT GCGTCCGTCC
CTGGCTTACC TGCAGTCTAA AGGTAAAAAC CTGGGTGTCA TCAATAGTCG TAACTACGAC
GACGAAGATA TCCTGAAATA TGTTGATGTT GGTGCGACCT ACTACTTCAA CAAAAACATG
TCCACCTACG TTGACTACAA AATCAACCTG CTGGACGACA ACCAGTTCAC TCGTGACGCT
GGCATCAACA CTGATAACAT CGTAGCTCTG GGTCTGGTTT ACCAGTTCTA A
 
Protein sequence
MKVKVLSLLV PALLVAGAAN AAEVYNKDGN KLDLYGKVDG LHYFSDNKSE DGDQTYVRLG 
FKGETQVTDQ LTGYGQWEYQ IQGNTSEDNK ENSWTRVAFA GLKFQDVGSF DYGRNYGVVY
DVTSWTDVLP EFGGDTYGSD NFMQQRGNGF ATYRNTDFFG LVDGLNFAVQ YQGKNGSVSG
EGMTNNGRGA LRQNGDGVGG SITYDYEGFG IGAAVSSSKR TDDQNGSYTS NGVVRNYIGT
GDRAETYTGG LKYDANNIYL AAQYTQTYNA TRVGSLGWAN KAQNFEAVAQ YQFDFGLRPS
LAYLQSKGKN LGVINSRNYD DEDILKYVDV GATYYFNKNM STYVDYKINL LDDNQFTRDA
GINTDNIVAL GLVYQF