Gene EcSMS35_0703 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_0703 
Symbol 
ID6145855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp707579 
End bp708985 
Gene Length1407 bp 
Protein Length468 aa 
Translation table11 
GC content51% 
IMG OID641615593 
ProductOprD family outer membrane porin 
Protein accessionYP_001742792 
Protein GI170680096 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000220662 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones48 
Fosmid unclonability p-value0.949628 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTGCGT TTAGTGGCAA ACGTAGTACG CTGGCGCTGG CTATCGCCGG TGTTACAGCT 
ATGTCGGGCT TTATGGCAAT TCCGGAGGCT CGCGCCGAAG GATTCATCGA CGATTCAACC
TTAACCGGCG GTATCTATTA CTGGCAGCGT GAACGCGACC GTAAAGATGT TACCGACGGC
GACAAATATA AAACCAACCT TTCTCACTCC ACCTGGAACG CTAATCTCGA TTTTCAGTCT
GGCTATGCCG CTGATATGTT CGGTCTGGAT ATTGCCGCGT TTACAGCGAT TGAAATGGCA
GAAAACGGCG ACAGCTCTCA CCCGAACGAA ATCGCGTTTT CAAAAAGTAA TAAAGCCTAT
GACGAAGACT GGTCCGGCGA TAAAAGCGGT ATAAGTCTGT ATAAAGCGGC GGCTAAATTT
AAATACGGTC CGGTTTGGGC GAGGGCAGGT TACATTCAGC CAACTGGTCA AACGCTGTTA
GCGCCGCACT GGAGCTTTAT GCCGGGTACT TATCAGGGGG CGGAAGCCGG GGCGAATTTT
GATTATGGCG ATGCTGGTGC GTTGAGTTTC TCCTACATGT GGACCAACGA ATACAAAGCA
CCGTGGCATC TGGAAATGGA TGAGTTTTAT CAGAACGATA AAACCACCAA AGTTGATTAT
CTACACTCCC TTGGGGCGAA GTACGACTTC AAAAATAACT TCGTACTGGA AGCGGCGTTT
GGTCAGGCCG AAGGGTATAT CGATCAATAT TTTGCCAAAG CCAGCTACAA ATTTGATATC
GCCGGTAGCC CGTTAACCAC CAGCTACCAG TTCTATGGTA CGCGCGATAA AGTTGACGAT
CGCAGCGTCA ACGATCTTTA TGACGGCACC GCCTGGCTGC AGGCGTTGAC CTTTGGTTAC
CGGGCGGCTG ACGTAGTGGA TTTGCGCCTC GAAGGTACCT GGGTTAAGGC TGACGGTCAG
CAGGGATACT TCCTGCAACG TATGACTCCA ACCTACGCTT CCTCGAACGG TCGCCTGGAT
ATCTGGTGGG ATAACCGTTC CGACTTCAAC GCCAACGGCG AAAAAGCGAT CTTCTTCGGT
GCGATGTATG ACCTGAAAAA CTGGAATCTT CCAGGCTTCG CCATCGGCGC TTCCTACGTT
TACGCATGGG ATGCTAAACC TGCGACCTGG CAGAGCAATC CGGATGCGTA CTACGATAAA
AACCGGACTA TTGAAGAGTC TGCCTACAGC CTGGATGCGG TGTACACCAT TCAGGACGGT
CGCGCCAAAG GCACGATGTT CAAACTGCAC TTCACCGAAT ACGACAACCA CTCCGACATC
CCAAGCTGGG GCGGTGGTTA CGGCAACATC TTCCAGGATG AGCGTGACGT GAAATTTATG
GTAATCGCAC CATTCACCAT CTTCTGA
 
Protein sequence
MRAFSGKRST LALAIAGVTA MSGFMAIPEA RAEGFIDDST LTGGIYYWQR ERDRKDVTDG 
DKYKTNLSHS TWNANLDFQS GYAADMFGLD IAAFTAIEMA ENGDSSHPNE IAFSKSNKAY
DEDWSGDKSG ISLYKAAAKF KYGPVWARAG YIQPTGQTLL APHWSFMPGT YQGAEAGANF
DYGDAGALSF SYMWTNEYKA PWHLEMDEFY QNDKTTKVDY LHSLGAKYDF KNNFVLEAAF
GQAEGYIDQY FAKASYKFDI AGSPLTTSYQ FYGTRDKVDD RSVNDLYDGT AWLQALTFGY
RAADVVDLRL EGTWVKADGQ QGYFLQRMTP TYASSNGRLD IWWDNRSDFN ANGEKAIFFG
AMYDLKNWNL PGFAIGASYV YAWDAKPATW QSNPDAYYDK NRTIEESAYS LDAVYTIQDG
RAKGTMFKLH FTEYDNHSDI PSWGGGYGNI FQDERDVKFM VIAPFTIF