Gene EcSMS35_3268 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3268 
Symbol 
ID6146758 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3346437 
End bp3347786 
Gene Length1350 bp 
Protein Length449 aa 
Translation table11 
GC content53% 
IMG OID641618098 
Productputative polysaccharide biosynthesis protein 
Protein accessionYP_001745248 
Protein GI170681924 
COG category[R] General function prediction only 
COG ID[COG2244] Membrane protein involved in the export of O-antigen and teichoic acid 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.809539 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGGCGGGTT TTAACATCAA ACATTGGTTT GCAGATGGCG CGTTTCGCAC CATTATTCGC 
AATAGCGCCT GGTTAGGCTC CAGTAATGTC GTGAGCGCTT TACTGGGTCT GTTGGCGCTC
TCGTGTGCCG GTAAAGGGAT GACGCCCGCC ATGTTTGGCG TACTGGTGAT TGTGCAATCG
TACGCCAAGT CGATCAGCGA TTTTATTAAG TTTCAGACAT GGCAGTTGGT GGTGCAGTAC
GGAACGCCAG CCTTGACCAA CAATAATCCG CAGCAATTTC GCAATGTCGT CTCATTTTCC
TTCTCGCTGG ATATCGTCAG CGGCGCGGTG GCGATTGTCG GCGGCATTGC CTTACTCCCT
TTTCTTTCCC ATTCATTAGG TCTGGATGAC CAAAGCTTTT GGCTGGCAGC GCTGTATTGC
ACGCTCATTC CTTCAATGGC TTCCTCCACG CCGACCGGCA TTCTGCGTGC GGTAGATCGC
TTCGATCTAA TTGCTGTACA GCAAGCGACG AAACCTTTTC TGCGTGCAGC GGGGAGCGTC
GTAGCCTGGT ATTTTGACTT TGGTTTTGCG GGTTTTGTTA TTGCCTGGTA CGTGTCGAAT
CTGGTTGGCG GCACCATGTA CTGGTGGTTT GCCGCGCGCG AATTACGCCG CAGAAATATC
CATAACGCCT TCAAATTGAA TCTGTTTGAG TCTGCCCGAC ACATTAAAGG CGCGTGGAGT
TTTGTCTGGT CAACCAACAT TGCCCACTCC ATCTGGTCGG CGCGTAACTC GTGCAGCACA
GTGCTGGTGG GGATCGTGTT AGGACCCGCT GCCGCCGGGT TATTTAAAAT CGCCATGACA
TTCTTCGACG CCGCCGGAAC GCCAGCGGGT TTACTGGGCA AAAGTTTTTA CCCAGAGGTG
ATGCGTTTAG ATCCGCGCAC CACCAGACCG TGGTTGTTAG GCGTGAAGTC AGGTTTACTG
GCGGGCGGAA TCGGTATTCT GGTGGCGCTC GCGGTGTTGA TTGTCGGCAA GCCGCTCATT
TCGCTGGTGT TTGGCGTTAA GTATCTCGAA GCGTATGACC TGATTCAGGT GATGCTGGGC
GCAATTGTGA TCTCGATGCT GGGCTTCCCT CAGGAGTCAT TGCTATTGAT GGCGGGAAAA
CAGCGCGCTT TTCTCGTGGC GCAAACCATC GCCTCAATCG GCTACATCGT ATTGCTGTTC
ATGTTCTGTC ATCTGTTTGG CGTGCTGGGA GCAGCGTTTG CTTATTTCGG CGGTCAATGT
CTGGATGTGG TGCTCTCGCT AATTCCAACC TTAAGGGCAT TTTTTCAGCG CCATTCCTTG
CTTTATAACG CAGCCGGAGA GAAATCCTGA
 
Protein sequence
MAGFNIKHWF ADGAFRTIIR NSAWLGSSNV VSALLGLLAL SCAGKGMTPA MFGVLVIVQS 
YAKSISDFIK FQTWQLVVQY GTPALTNNNP QQFRNVVSFS FSLDIVSGAV AIVGGIALLP
FLSHSLGLDD QSFWLAALYC TLIPSMASST PTGILRAVDR FDLIAVQQAT KPFLRAAGSV
VAWYFDFGFA GFVIAWYVSN LVGGTMYWWF AARELRRRNI HNAFKLNLFE SARHIKGAWS
FVWSTNIAHS IWSARNSCST VLVGIVLGPA AAGLFKIAMT FFDAAGTPAG LLGKSFYPEV
MRLDPRTTRP WLLGVKSGLL AGGIGILVAL AVLIVGKPLI SLVFGVKYLE AYDLIQVMLG
AIVISMLGFP QESLLLMAGK QRAFLVAQTI ASIGYIVLLF MFCHLFGVLG AAFAYFGGQC
LDVVLSLIPT LRAFFQRHSL LYNAAGEKS