Gene EcSMS35_1702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1702 
Symbol 
ID6144352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1705762 
End bp1706862 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content44% 
IMG OID641616578 
Productporin family protein 
Protein accessionYP_001743756 
Protein GI170679684 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3203] Outer membrane protein (porin) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0189246 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAATTAA AAATAGTTGC GGTGGTTGTA ACTGGTTTGT TAGCTGCGAA CGTAGCACAC 
GCTGCCGAAG TCTATAACAA GGATGGTAAT AAACTCGACC TTTATGGCAA GGTTACCGCT
CTACGTTATT TTACTGATGA TAAGCGTGAC GATGGTGATA AAACTTATGC CCGTCTCGGC
TTTAAAGGAG AAACGCAAAT CAATGATCAA ATGATTGGTT TTGGTCACTG GGAATATGAT
TTTAAAGGCT ATAACGATGA AGCCAACGGC TCGCGCGGTA ACAAGACCCG TCTGGCCTAT
GCAGGTTTAA AAATTAGTGA ATTTGGCTCT CTGGACTATG GTCGTAACTA CGGTGTCGGC
TATGACATTG GTTCATGGAC CGATATGTTG CCAGAATTTG GTGGCGATAC CTGGAGTCAG
AAAGATGTCT TCATGACATA CCGTACTACC GGTGTGGCAA CCTATCGCAA CTACGATTTC
TTCGGCTTAA TTGAAGGTCT GAACTTTGCC GCGCAATATC AAGGCAAAAA TGAACGCACT
GACAATGGTC ATCTTTATGG TGCTGACTAC ACGCGTGCCA ATGGTGACGG TTTCGGTATC
TCCTCAACTT ATGTTTATGA TGGCTTTGGT ATCGGTGCGG TGTATACCAA ATCCGATCGG
ACAAATGCGC AGGAAAGAGC CGCTGCTAAT CCTCTCAATG CCTCCGGTAA GAATGCAGAA
CTGTGGGCTA CAGGTATAAA ATATGATGCC AACAACATCT ACTTTGCAGC TAATTACGCT
GAAACATTAA ACATGACCAC CTATGGCGAT GGTTATATCT CTAACAAAGC ACAAAGTTTT
GAAGTAGTGG CACAATATCA ATTCGACTTC GGCTTGCGCC CATCACTCGC TTACCTGAAA
TCGAAAGGCA GAGATCTGGG CCGCTACGGC GACCAGGACA TGATTGAGTA TATCGACGTT
GGTGCGACGT ATTTCTTCAA CAAAAATATG TCGACCTATG TTGATTATAA AATCAACCTG
ATTGATGAAA GCGACTTTAC CCGTGCCGTA GATATTCGCA CCGATAACAT CGTCGCAACG
GGAATTACCT ATCAGTTCTA A
 
Protein sequence
MKLKIVAVVV TGLLAANVAH AAEVYNKDGN KLDLYGKVTA LRYFTDDKRD DGDKTYARLG 
FKGETQINDQ MIGFGHWEYD FKGYNDEANG SRGNKTRLAY AGLKISEFGS LDYGRNYGVG
YDIGSWTDML PEFGGDTWSQ KDVFMTYRTT GVATYRNYDF FGLIEGLNFA AQYQGKNERT
DNGHLYGADY TRANGDGFGI SSTYVYDGFG IGAVYTKSDR TNAQERAAAN PLNASGKNAE
LWATGIKYDA NNIYFAANYA ETLNMTTYGD GYISNKAQSF EVVAQYQFDF GLRPSLAYLK
SKGRDLGRYG DQDMIEYIDV GATYFFNKNM STYVDYKINL IDESDFTRAV DIRTDNIVAT
GITYQF