Gene EcSMS35_2702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_2702 
Symbol 
ID6147137 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp2774798 
End bp2778079 
Gene Length3282 bp 
Protein Length1093 aa 
Translation table11 
GC content55% 
IMG OID641617573 
ProductTPR repeat-containing protein 
Protein accessionYP_001744738 
Protein GI170682243 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones56 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACTCCAG TAAAAGTGTG GCAAGAGCGC GTTGAGATCC CGACCTATGA AACCGGGCCG 
CAGGATATAC ATCCCATGTT CCTGGAAAAT CGCGTTTATC AGGGATCGTC CGGCGCGGTT
TATCCCTACG GCGTGACCGA TACGCTGAGC GAGCAGAAAA CCCTGAAATC CTGGCAGGCG
GTGTGGCTGG AAAATGACTA CATCAAAGTG ATGATCCTGC CGGAACTGGG CGGTCGGGTG
CATCGCGCAT GGGATAAAGT GAAACAGCGC GATTTTGTCT ATCACAATGA AGTCATTAAA
CCTGCGCTGG TGGGGCTGCT GGGACCGTGG ATCTCCGGCG GGATTGAGTT TAACTGGCCG
CAACACCATC GCCCGACCAC CTTTATGCCC GTTGATTTCA CCCTCGAAGC CCATGACGAC
GGCGCACAGA CGGTGTGGGT CGGCGAAACG GAGCCGATGC ACGGTTTACA GGTGATGACA
GGTTTCACCC TGCGTCCTGA CCGGGCGGCG CTGGAAATCG CCAGTCGCGT CTATAACGGC
AACGCCACGC CGCGTCATTT CTTGTGGTGG GCCAACCCGG CAGTGAAAGG GGGTGAAGGG
CATCAGAGCG TCTTCCCGCC GGATGTAACG GCGGTGTTTG ATCACGGCAA ACGGGCCGTC
TCCGCTTTCC CCATCGCCAC CGGCACTTAC TACAAAGTGG ACTACTCCGC CGGAGTGGAC
ATTTCTCGCT ATAAAAATGT GCCCGTTCCA ACCTCATATA TGGCTGAAAA ATCACAGTAC
GATTTTGTTG GCGCGTGGTG TCACGATGAA GATGGTGGTT TGCTACACGT TGCCAACCAC
CATATTGCGC CAGGTAAAAA ACAGTGGAGC TGGGGACACA GTGAATTTGG CCAGGCGTGG
GATAAGAGCC TGACTGACAA TAACGGCCCG TATATCGAAC TGATGACCGG TATTTTTGCC
GATAACCAGC CTGATTTTAC CTGGCTTGAT GCTTACGAGG AGAAGCGTTT CGAGCAGTAT
TTCCTGCCTT ATCATTCTCT GGGCATGGTG CAAAATGCCT CCCGCGATGC GGTGATAAAA
CTCCAGCGTA GTGAGCGGGG GATTGAGTGG GGGCTGTATG CCATCTCTCC GTTGAACGGA
TACCGCCTGG CGATCCGCGA AATCGGCAAA TGCAACGCGT TGCTCGATGA TGCCGTGGCC
CTGACACCAG CGACCGCCAT CCAGGGCGTG TTGCACGGTA TCAATCCTGA CAGACTGACC
ATTGAGCTCT CCGATGCCGA CGGCAATATT GTACTGAGTT ATCATGAACA TCAGCCGCAA
GCGTTGCCGT TGCCGGACGT CGCCAAAGCG CCACTGTCAG CACAAGACAT TACCAGTACA
GATGAAGCCT GGTTTATCGG TCAGCATCTG GAGCAATATC ATCACGCCAG CCGTTCACCG
TTCGATTACT ACCTGCGCGG CGTGGCGCTG GATCCGCTGG ATTATCGCTG TAACCTGGCG
CTGGCGATGC TGGAATATAA CCGCGCCGAT TTCCCGCAAG CGGTGGCGTA TGCCACTCAG
GCGCTGAAAC GCGCACATGC GCTGAACAAA AATCCGCAGT GCGGACAGGC CAGTTTGATT
CGCGCCAGTG CTTACGAACG TCAGGGACAA TATCAACAAG CCGAAGAGGA TTTCTGGCGG
GCGGTCTGGA GCGGCAATAG TAAAGCAGGT GGCTATTATG GTCTGGCACG ACTGGCGGCG
CGTAATGGTC ACTTCGATGC TGGTCTGGAT TTTTGCCAAC AAAGTCTTCG CGCCTGCCCA
ACCAATCAGG AAGTGCTTTG CCTGCATAAT CTGCTGCTGG TGTTAAGTGG TCGTCAGGAC
AACGCGCGTT TGCAGCGCGA GAAACTGCTG CGCGATTATC CGCTGAACGC CACTCTGTGG
TGGCTGAACT GGTTCGATGG TCGTAGCGAA TCAGCCCTCG CGCAGTGGCG CGGTCTGTGT
CAGGGACGCG ACGTTAACGC CCTGATGACC GCCGGGCAAC TGATTAACTG GGGAATGCCT
ACCCTGGCGG CAGAGATGCT GAACGCACTG GACTGCCAGC GCACGCTGCC GCTTTATCTG
CAGGCGAGTT TATTGCCGAA AGCCGAACGC GGCGAACTGG TCGCAAAAGC CATTGATGTC
TTCCCGCAGT TTGTCCGTTT CCCGAATACG CTGGAAGAAG TGGCGGCGCT GGAGAGTATT
GAAGAGTGCT GGTTTGCCCG TCATTTATTG GCCTGCTTCT ACTACAACAA GCGTAGCTAC
GGCAAAGCCA TTGCCTTATG GCAACGTTGC GTAGAGATGT CGCCGGAGTT TGCCGACGGC
TGGCGCGGGT TAGCGATCCA TGCGTGGAAT AAGCAACACG ATTATGAGCT GGCCGCGCGT
TATCTTGATA ATGCTTATCA GCTTGCGCCG CAGGATGCAC GTCTGCTTTT CGAACGGGAT
TTGCTGGATA AGTTAAGTGG AGCCACACCG GAGAAACGAC TGGCGCGTCT GGAAAATAAT
CTGGAAATTG CGCTGAAACG CGACGACATG ACCGCAGAAC TGCTCAATTT GTGGCATCTC
ACGGGTCAGG CAGACAAAGC GGCGGACATT CTCGCCACGC GCAAATTCCA CCCGTGGGAA
GGCGGGGAAG GGAAGATCAC CAGTCAGTTT ATCCTCAACC AGTTATTACG CGCCTGGCAG
CATCTTGATG CCAGAGAGCC GCAGCAGGCC AGCGAACTGC TTCATGCCGC GCTGCATTAT
CCGGAGAATT TAAGCGAAGG CCGTTTACCG GGGCAAACTG ATAACGACAT CTGGTTCTGG
CAGGCGATAT GCGCCAACGC GCAGGGAGAT GAAACTGAAG CGATGCGTTG TTTACGTCTG
GCGGCGACTG GCGATCGCAC CATTAACATT CACAGTTATT ACAACGATCA GCCGGTTGAT
TATCTCTTCT GGCAAGGAAT GGCGCTGCGA CTGCTGGGCG AACAACAAAT CGCACAGCAA
CTGTTTAGTG AAATGAAACA GTGGGCGCAA GAGATGGCGA AAACCAGTAT TGAGGCGGAT
TTCTTTGCTG TTTCACAACC TGACCTGTTG TCGCTGTATG GCGATTTACA ACAGCAGCAT
AAAGAAAAAT GCCTGATGGT GGCGATGCTG GCGTCCGCGG GACTCGGGGA GGTTGCGCAC
TATGAATCTG CTCGCGCTGA ATTGACGGCG ATTAATCCGG CCTGGCCGAA AGCGGCATTA
TTCACCACTG TGATGCCTTT TATTTTTAAC TACGTTCACT AA
 
Protein sequence
MTPVKVWQER VEIPTYETGP QDIHPMFLEN RVYQGSSGAV YPYGVTDTLS EQKTLKSWQA 
VWLENDYIKV MILPELGGRV HRAWDKVKQR DFVYHNEVIK PALVGLLGPW ISGGIEFNWP
QHHRPTTFMP VDFTLEAHDD GAQTVWVGET EPMHGLQVMT GFTLRPDRAA LEIASRVYNG
NATPRHFLWW ANPAVKGGEG HQSVFPPDVT AVFDHGKRAV SAFPIATGTY YKVDYSAGVD
ISRYKNVPVP TSYMAEKSQY DFVGAWCHDE DGGLLHVANH HIAPGKKQWS WGHSEFGQAW
DKSLTDNNGP YIELMTGIFA DNQPDFTWLD AYEEKRFEQY FLPYHSLGMV QNASRDAVIK
LQRSERGIEW GLYAISPLNG YRLAIREIGK CNALLDDAVA LTPATAIQGV LHGINPDRLT
IELSDADGNI VLSYHEHQPQ ALPLPDVAKA PLSAQDITST DEAWFIGQHL EQYHHASRSP
FDYYLRGVAL DPLDYRCNLA LAMLEYNRAD FPQAVAYATQ ALKRAHALNK NPQCGQASLI
RASAYERQGQ YQQAEEDFWR AVWSGNSKAG GYYGLARLAA RNGHFDAGLD FCQQSLRACP
TNQEVLCLHN LLLVLSGRQD NARLQREKLL RDYPLNATLW WLNWFDGRSE SALAQWRGLC
QGRDVNALMT AGQLINWGMP TLAAEMLNAL DCQRTLPLYL QASLLPKAER GELVAKAIDV
FPQFVRFPNT LEEVAALESI EECWFARHLL ACFYYNKRSY GKAIALWQRC VEMSPEFADG
WRGLAIHAWN KQHDYELAAR YLDNAYQLAP QDARLLFERD LLDKLSGATP EKRLARLENN
LEIALKRDDM TAELLNLWHL TGQADKAADI LATRKFHPWE GGEGKITSQF ILNQLLRAWQ
HLDAREPQQA SELLHAALHY PENLSEGRLP GQTDNDIWFW QAICANAQGD ETEAMRCLRL
AATGDRTINI HSYYNDQPVD YLFWQGMALR LLGEQQIAQQ LFSEMKQWAQ EMAKTSIEAD
FFAVSQPDLL SLYGDLQQQH KEKCLMVAML ASAGLGEVAH YESARAELTA INPAWPKAAL
FTTVMPFIFN YVH