Gene EcSMS35_4699 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4699 
Symbol 
ID6145439 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4797088 
End bp4798821 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content54% 
IMG OID641619515 
ProductOMP85 family outer membrane protein 
Protein accessionYP_001746623 
Protein GI170681698 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0729] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones63 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCTATA TCCGACAGTT ATGCTGTGTA AGCTTACTCT GCTTAAGCGG ATCTGCCGTC 
GCCGCGAACA TCCGTCTACA GGTCGAGGGG TTATCGGGAC AGCTGGAAAA GAACGTTCGT
GCGCAGCTTT CTACGATTGA AAGTGATGAA GTGACGCCAG ACCGTCGCTT TCGCGCACGC
GTCGATGATG CCATCCGCGA AGGTCTGAAA GCGCTGGGTT ATTACCAGCC GACCATTGAA
TTTGATCTCC GTCCACCGCC AAAGAAAGGG CGTCAGGTAT TGATCGCCAA AGTCACGCCA
GGCGTGCCGG TGTTAATTGG CGGCACCGAT GTGGTATTGC GCGGCGGCGC GCGGACCGAT
AAAGACTATT TGAAATTGCT CGATACTCGC CCGGCTATTG GTACGGTACT GAACCAGGGC
GATTATGAAA ATTTCAAAAA GTCCTTAACC AGCATTGCGT TGCGTAAAGG TTATTTCGAT
AGCGAATTTA CCAAAGCGCA GCTGGGCATT GCGCTCGGCC TGCATAAAGC CTTCTGGGAT
ATTGATTATA ACAGTGGCGA ACGTTACCGC TTTGGGCATG TGACCTTTGA AGGATCACAA
ATCCGCGATG AATACCTGCA AAATCTGGTG CCGTTTAAAG AGGGCGATGA GTACGAATCG
AAAGATCTGG CAGAGCTGAA CCGTCGACTT TCTGCTACCG GCTGGTTTAA CTCGGTCGTA
GTGGCTCCAC AATTTGATAA ATCGCGCGAA ACGAAAGTAT TACCATTGAC GGGCGTGGTT
TCGCCGCGAA CTGAAAACAC CATCGAAACC GGGGTCGGTT ACTCTACGGA CGTGGGACCG
CGCGTGAAAG CGACGTGGAA AAAGCCGTGG ATGAACTCTT ATGGTCACAG TCTGACCACC
AGTACCAGTA TTTCCGCGCC GGAACAGATC CTCGACTTCA GCTATAAAAT GCCGCTGTTG
AAGAATCCAC TGGAACAATA TTATCTGGTG CAGGGCGGTT TTAAGCGCAC TGACCTGAAC
GATACCGAGT CTGACTCCAC TACGCTGGTG GCTTCTCGCT ACTGGGATCT CTCCAGCGGC
TGGCAGCGTG CCATTAACCT GCGCTGGAGC CTCGATCACT TTACCCAGGG TGAAATTACC
AACACCACGA TGCTGTTTTA TCCTGGGGTG ATGATTAGCC GCACGCGTTC TCGCGGTGGC
CTGATGCCAA CCTGGGGCGA CTCGCAACGC TACTCTATCG ACTACTCCAA CACGGCCTGG
GGCTCAGATG TCGATTTCTC CGTTTTCCAG GCGCAGAACG TCTGGATCCG CACACTGTAC
GATCGCCATC GTTTTGTGAC ACGCGGCACG CTGGGCTGGA TTGAAACGGG TGATTTCGAC
AAAGTACCGC CGGATCTGCG TTTCTTCGCC GGGGGCGATC GCAGTATTCG CGGCTACAAA
TACAAATCAA TCGCTCCGAA ATACGCCAAC GGTGACCTGA AAGGGGCCTC GAAGTTGATA
ACCGGATCGC TGGAGTACCA GTACAACGTG ACCGGAAAAT GGTGGGGCGC GGTGTTTGTC
GATAGCGGCG AAGCGGTAAG CGATATTCGC CGCAGCGACT TTAAAACCGG TACCGGGGTC
GGCGTGCGCT GGGAATCGCC GGTCGGGCCA ATCAAGCTCG ATTTTGCCGT ACCGGTCGCG
GATAAAGACG AGCACGGGTT ACAGTTTTAC ATCGGTCTGG GGCCAGAATT ATGA
 
Protein sequence
MRYIRQLCCV SLLCLSGSAV AANIRLQVEG LSGQLEKNVR AQLSTIESDE VTPDRRFRAR 
VDDAIREGLK ALGYYQPTIE FDLRPPPKKG RQVLIAKVTP GVPVLIGGTD VVLRGGARTD
KDYLKLLDTR PAIGTVLNQG DYENFKKSLT SIALRKGYFD SEFTKAQLGI ALGLHKAFWD
IDYNSGERYR FGHVTFEGSQ IRDEYLQNLV PFKEGDEYES KDLAELNRRL SATGWFNSVV
VAPQFDKSRE TKVLPLTGVV SPRTENTIET GVGYSTDVGP RVKATWKKPW MNSYGHSLTT
STSISAPEQI LDFSYKMPLL KNPLEQYYLV QGGFKRTDLN DTESDSTTLV ASRYWDLSSG
WQRAINLRWS LDHFTQGEIT NTTMLFYPGV MISRTRSRGG LMPTWGDSQR YSIDYSNTAW
GSDVDFSVFQ AQNVWIRTLY DRHRFVTRGT LGWIETGDFD KVPPDLRFFA GGDRSIRGYK
YKSIAPKYAN GDLKGASKLI TGSLEYQYNV TGKWWGAVFV DSGEAVSDIR RSDFKTGTGV
GVRWESPVGP IKLDFAVPVA DKDEHGLQFY IGLGPEL