Gene EcSMS35_2263 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2263 
Symbol 
ID6145742 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2284330 
End bp2285463 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content41% 
IMG OID641617138 
Productglycosyl transferase, group 1 family protein 
Protein accessionYP_001744311 
Protein GI170679591 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAC TGTGTTATTT CATAAATTCG GATTGGTATT TTGATTTGCA TTGGACCGAT 
CGTGCAATTG CCGCCCGAGA TGCCGGTTAT GAGATTCACA TTATTAGCCA TTTTGTCGAT
GATAAAATAG CGGAAAAATT CAGGACATTA GGTTTTGTTT GCCATAACAT TCCACTTGTC
GCCCAATCAT TCAACGTTTT GATTTTCTTT CGGGCCTTTT CTAAGGCTCG GAAAATTATT
CAAAACATCA ATCCAGATTT GTTGCACTGC ATTACCATCA AGCCCTGTTT GATCGGCGGT
TTTCTGGCTA AAAGTACCCA TCGTCCTGTT ATTCTGAGTT TTGTCGGTTT GGGCCGAGTA
TTTTCCGCAG AGTCTGCCTG TCTTAAGCTG CTGCGAAGTT TTACTGTTAT GGCATATAAG
TATATCGCCA GTAACAAATG CAGTTTGTTT ATGTTCGAAC ATGATAAAGA CAGAGCTAAA
CTTGCCGATC TGGTTGGTAT CGATTACAAA CAGACTATTG TTATTGATGG CGCGGGTATT
AATCCAGAGA TTTACAAATA CTCTCTGGAG CAGCAGCGTG ATGTTCCGGT CGTCCTTTTT
GCCAGCCGTA TGCTGTGGAG TAAAGGACTG GGTGACCTGA TTGAAGCCAA AAAAATACTG
AGTAATAAAA ATATTCACTT TACGCTGAAT GTTGCCGGTA TTTTAGTTGA GAATGATAAA
GACGCAATTC CGCTGGCGAC GATACAGAAG TGGCAAAGCG AAGGCGTGAT TAACTGGCTC
GGTCATTGCT CTAATGTATT TGATTTAATT GAAGAATCAA ATATCGTTGC TTTGCCGTCG
GTCTACGCCG AAGGCGTACC GCGTATCTTG CTGGAAGCTT CCTCTGTCGG GCGCGCTTGT
ATCGCTTATG ATGTTGGTGG CTGTGATAGC TTAATTATCA ATAACTATAA TGGGTTGATT
GTAAAAAGTA AATCTGTCGA GGAATTAGCG GAGAAACTCG GTTTCCTGTT GGATAACCCA
GAAACGCGCG TCGCAATGGG TATCAATGGC AGAAAACGCA TTCAAGATAA ATTCTCGAGT
GTGATGATCA TTAATAAAAC ATTAAAAACA TATCGTGATG TTGTTGAAGA GTAA
 
Protein sequence
MKKLCYFINS DWYFDLHWTD RAIAARDAGY EIHIISHFVD DKIAEKFRTL GFVCHNIPLV 
AQSFNVLIFF RAFSKARKII QNINPDLLHC ITIKPCLIGG FLAKSTHRPV ILSFVGLGRV
FSAESACLKL LRSFTVMAYK YIASNKCSLF MFEHDKDRAK LADLVGIDYK QTIVIDGAGI
NPEIYKYSLE QQRDVPVVLF ASRMLWSKGL GDLIEAKKIL SNKNIHFTLN VAGILVENDK
DAIPLATIQK WQSEGVINWL GHCSNVFDLI EESNIVALPS VYAEGVPRIL LEASSVGRAC
IAYDVGGCDS LIINNYNGLI VKSKSVEELA EKLGFLLDNP ETRVAMGING RKRIQDKFSS
VMIINKTLKT YRDVVEE