Gene EcSMS35_1221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1221 
Symbol 
ID6143519 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1224155 
End bp1225318 
Gene Length1164 bp 
Protein Length387 aa 
Translation table11 
GC content43% 
IMG OID641616099 
Productporin family protein 
Protein accessionYP_001743282 
Protein GI170682358 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.263748 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAGAA AAGTTCTGGC AATGCTGGTC CCGGCGTTAT TAGTTGCTGG CGCAGCAAAT 
GCGGCTGAAA TTTATAATAA AAATGGCAAT AAACTGGATC TGTACGGAAA AGTTGATGCT
CGTCATACTT TTTCCGATAA CCCTGGCGAT GACGGCGATG AAACTATCAT CGAGTTAGGC
TTCAAAGGTG AGACCCAAAT TACAGATCAG TTAACTGGTT ATGGTCAAGC ACTGACCAAA
ACAAAGGCTA GCGACACTGA AGGTTCTGAT AATACTTATG TCAAACTTGC TTTTGCAGGT
CTGAAGTTCG GCGAGATGGG CTCCTTTGAC TATGGTCGTA ACTATGGTGT GATTTACGAC
GTTGAAGCCT GGACAGATAT GCTTCCTGTA TTCGGCGGCG ACTCCTATAC CTGGACTGAT
AACTTCATGG CAGGCCGTGC AAATGGGGTA GCTACTTACC GCAATAGTGA TTTCTTCGGT
CTGGTGGAAG GTCTGAACTT TGCATTGCAG TATCAAGGTA ACAACGAAGG CAGCAATGCA
GGAGAAGATC AAGAAGGTAC TAAAAATGGT CACGAAGACG TTCGCTTCCA GAACGGTGAT
GGTTTTGGTC TTTCCACTTC ATATGACTTT GACTTCGGTT TGAGCTTAGG TGCCGCTTAC
TCAAACTCTG ACCGTACCAA TTCTCAAGTA GCTCTTGGCG GGTATCACTA CAATGAATAC
AGCAAATTTG CTGGTGGTGA TACAGCTGAA GCATGGACAT TTGGTGCAAA ATACGACGCC
AACAATGTTT ATCTTGCAAT GATGTATGCT GAAACCCGCA ATATGACACC ATATGGTAAT
GTGGGTATCG CAAATAAAAC CCAGAACTTT GAAGCTGTAG CACAGTATCA GTTCGATTTT
GGTCTGCGCC CATCTCTTGC ATATGTTTAT TCAAAAGGCA AAGACCTCGG CGGTAACGAC
TATAACAACA ATGGTCATCA AGAGTATGTT GATCAGGATC TGGTCAACTA TGTTGAAATT
GGAGCGACCT ACTATTTCAA TAAGAACTTC TCAACATATG TTGACTATAA AATAAACTTG
TTGGATAAAG ATGACGATTT CTATGACAAC AATGGCATCG CAACAGATGA CGTTGTTGGT
GTGGGCTTAG TCTACCAGTT CTAA
 
Protein sequence
MKRKVLAMLV PALLVAGAAN AAEIYNKNGN KLDLYGKVDA RHTFSDNPGD DGDETIIELG 
FKGETQITDQ LTGYGQALTK TKASDTEGSD NTYVKLAFAG LKFGEMGSFD YGRNYGVIYD
VEAWTDMLPV FGGDSYTWTD NFMAGRANGV ATYRNSDFFG LVEGLNFALQ YQGNNEGSNA
GEDQEGTKNG HEDVRFQNGD GFGLSTSYDF DFGLSLGAAY SNSDRTNSQV ALGGYHYNEY
SKFAGGDTAE AWTFGAKYDA NNVYLAMMYA ETRNMTPYGN VGIANKTQNF EAVAQYQFDF
GLRPSLAYVY SKGKDLGGND YNNNGHQEYV DQDLVNYVEI GATYYFNKNF STYVDYKINL
LDKDDDFYDN NGIATDDVVG VGLVYQF