Gene EcSMS35_3330 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3330 
SymboltolC 
ID6142633 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3406637 
End bp3408118 
Gene Length1482 bp 
Protein Length493 aa 
Translation table11 
GC content51% 
IMG OID641618159 
Productouter membrane channel protein 
Protein accessionYP_001745309 
Protein GI170682545 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT TGCTCCCCAT TCTTATCGGC CTGAGCCTTT CTGGTTTCAG TTCGTTGAGC 
CAGGCCGAGA ACCTGATGCA AGTTTATCAG CAAGCACGCC TTAGTAACCC GGAATTGCGT
AAGTCTGCCG CCGATCGTGA TGCTGCCTTT GAAAAAATTA ATGAAGCGCG CAGTCCATTA
CTGCCACAGC TAGGTTTAGG TGCAGATTAC ACCTATAGCA ACGGCTACCG CGACGCGAAC
GGCATCAACT CTAACGCGAC CAGTGCGTCC CTGCAGTTAA CTCAATCCAT TTTTGATATG
TCGAAATGGC GTGCGTTAAC GCTGCAGGAA AAAGCAGCTG GGATTCAGGA CGTCACGTAT
CAGACCGATC AACAAACTTT GATCCTCAAC ACCGCGACCG CTTATTTCAA CGTGTTGAAT
GCTATTGACG TTCTTTCCTA TACACAGGCG CAAAAAGAAG CGATCTACCG TCAATTAGAT
CAAACCACCC AACGTTTTAA CGTGGGCCTG GTAGCGATCA CTGACGTGCA GAACGCCCGC
GCACAGTACG ATACCGTGCT GGCGAACGAA GTGACCGCAC GTAATAACCT TGATAACGCG
GTAGAGCAGC TGCGCCAGAT CACCGGTAAC TACTATCCGG AACTGGCGGC GCTGAATGTC
GAAAACTTTA AAACCGACAA ACCACAGCCG GTTAACACGC TGCTGAAAGA AGCCGAAAAA
CGCAACCTGT CGTTGTTACA GGCGCGCTTG AGCCAGGACC TGGCGCGCGA GCAAATTCGC
CAGGCGCAGG ATGGTCACTT ACCGACTCTG GATTTAACGG CTTCTACCGG GATTTCTGAC
ACCTCTTATA GCGGTTCGAA AACCCGTGGT GCCGCTGGTA CCCAGTATGA CGATAGCAAT
ATGGGCCAGA ACAAAGTTGG CCTGAGCTTC TCGCTGCCGA TTTATCAGGG CGGAATGGTT
AACTCGCAGG TGAAACAGGC CCAGTACAAC TTTGTTGGTG CCAGCGAGCA ACTGGAAAGC
GCGCATCGTA GCGTCGTACA AACCGTACGT TCCTCCTTCA ACAACATTAA TGCTTCTATC
AGTAGCATTA ACGCCTACAA ACAAGCCGTA GTTTCCGCTC AAAGCTCATT AGACGCGATG
GAAGCGGGTT ACTCGGTCGG TACGCGTACC ATTGTTGATG TGTTGGATGC GACCACCACG
CTGTACAACG CCAAGCAAGA GCTGGCAAAT GCGCGTTATA ACTACCTGAT TAACCAGTTG
AATATTAAAT CAGCTCTGGG TACGTTGAAC GAGCAGGATC TTCTGGCACT GAACAATGCG
CTGAGCAAAC CGGTTTCCAC TAATCCGGAA AACGTTGCCC CGCAAACGCC GGAACAGAAT
GCTATTGCTG ATGGTTATGC GCCTGATAGC CCGGCACCCG TCGTTCAGCA AACATCCGCA
CGCACTACCA CCAGTAACGG TCATAACCCT TTCCGTAACT GA
 
Protein sequence
MKKLLPILIG LSLSGFSSLS QAENLMQVYQ QARLSNPELR KSAADRDAAF EKINEARSPL 
LPQLGLGADY TYSNGYRDAN GINSNATSAS LQLTQSIFDM SKWRALTLQE KAAGIQDVTY
QTDQQTLILN TATAYFNVLN AIDVLSYTQA QKEAIYRQLD QTTQRFNVGL VAITDVQNAR
AQYDTVLANE VTARNNLDNA VEQLRQITGN YYPELAALNV ENFKTDKPQP VNTLLKEAEK
RNLSLLQARL SQDLAREQIR QAQDGHLPTL DLTASTGISD TSYSGSKTRG AAGTQYDDSN
MGQNKVGLSF SLPIYQGGMV NSQVKQAQYN FVGASEQLES AHRSVVQTVR SSFNNINASI
SSINAYKQAV VSAQSSLDAM EAGYSVGTRT IVDVLDATTT LYNAKQELAN ARYNYLINQL
NIKSALGTLN EQDLLALNNA LSKPVSTNPE NVAPQTPEQN AIADGYAPDS PAPVVQQTSA
RTTTSNGHNP FRN