Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4700 |
Symbol | |
ID | 6145884 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4798818 |
End bp | 4802597 |
Gene Length | 3780 bp |
Protein Length | 1259 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641619516 |
Product | hypothetical protein |
Protein accession | YP_001746624 |
Protein GI | 170683277 |
COG category | [S] Function unknown |
COG ID | [COG2911] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTTTAT GGAAAAAAAT CAGCCTCGGC GTGGTTATCG TTATCTTACT GTTGCTGGGA TCGGTGGCGT TTCTGGTGGG CACCACTAGC GGCCTGCATC TGGTATTTAA AGCGGCGGAT CGCTGGGTGC CAGGACTGGA TATTGGCAAG GTCACCGGCG GCTGGCGCGA TCTCACCTTG TCTGACGTTC GTTATGAGCA GCCAGGCGTG GCGGTAAAAG CGGGCAATCT GCATCTGGCG GTCGGGCTTG AGTGCCTGTG GAACAGTAGC GTTTGTATTA ATGACCTGGC GCTGAAAGAC ATTCAGGTCA ACATCGACAG TAAAAAAATG CCTCCTTCTG AACAGGTTGA AGAAGAGGAA GATAGCGGTC CGCTGGATCT CTCCACGCCG TATCCCATTA CCCTGACACG GGTGGCGCTG GACAACGTCA ACATCAAGAT TGATGACACC ACGGTATCGG TGATGGACTT CACCTCCGGC CTGAACTGGC AGGAGAAAAC TCTGACCCTG AAACCGACGT CGCTGAAAGG CTTGCTGATT GCCTTGCCGA AGGTGGCGGA CGTGGCGCAG GAAGAAGTGG TCGAACCGAA GATTGAAAAT CCGCAGCCTG ATGAAAAACC GCTCGGCGAA ACGCTGAAAG ATCTCTTTTC TCGCCCGGTA TTGCCGGAAA TGACCGACGT GCATTTGCCG CTTAACCTGA ACATTGAAGA GTTTAAGGGC GAGCAGCTGC GCGTGACGGG TGACACGGAC ATCACCGTGA GCACCATGCT GCTGAAAGTG AGCAGCATCG ACGGTAATAC TAAACTGGAC GCCCTGGATA TCGATTCCAG TCAAGGGATC GTCAACGCCA GCGGCACGGC GCAGCTGTCA GACAACTGGC CGGTGGATAT CACCCTCAAC AGCACACTGA ACGTGGAGCC GTTGAAAGGT GAAAAGGTGA AGCTGAAAGT GGGCGGCGCG CTGCGCGAAC AGCTGGAGAT TGGCGTAAAC CTTTCCGGTC CGGTGGATAT GGATTTACGC GCCCAGACGC GACTGGCGGA AGCCGGATTG CCGCTCAACG TGGAAGTGAA CAGCAAACAG CTTTACTGGC CGTTCACTGG TGAGAAGCAG TATCAGGCGG ATGATCTGAA ACTGAAACTC ACCGGTAAAA TGACCGATTA CACGCTCTCT ATGCGAACGG CAGTGAAGGG ACAGGAGATC CCGCCAGCCA CCATTACCCT TGATGCCAAA GGTAATGAAC AGCAGGTCAA TCTCGACAAA CTTACCGTCG CGGCGCTGGA AGGGAAAACT GAACTCAAGG CGTTGCTCGA CTGGCAGCAG GCAATTAGCT GGCGCGGTGA GCTAACGCTT AACGGCATTA ACACCGCCAA AGAGTTCCCG GAGTGGCCGT CGAAACTCAA TGGCTTGATT AAAACCCGCG GTAGCCTGTA CGGCGGTACC TGGCAGATGG AGGTGCCAGA ACTGAAGCTG ACCGGTAACG TCAAACAGAA CAAAGTGAAC GTTGACGGCA CGCTGAAAGG CAACAGTTAT ATGCAGTGGA AGATCCCAGG GCTGCATCTG GAACTGGGGC CAAACAGTGC CGAAGTGAAA GGCGAGCTGG GGGTAAAAGA TCTCAATCTT GATGCCACCA TCAACGCGCC GGGGCTGGAT AACGCGCTGC CGGGGCTTGG CGGTACAGCG AAAGGGCTGG TGAAAGTTCG CGGCACGGTG GAAGCGCCAC AACTACTGGC AGATATCACC GCGCGCGGCC TGCGCTGGCA GGAACTTTCC GTGGCGCAGG TTCGCGTGGA AGGCGATATC AAATCCACCG ATCAGATTGC GGGTAAACTT GACGTACGCG TTGAGCAAAT CTCGCAGCCC GATGTGAGTA TCAACCTCGT CACCCTGAAT GCCAAAGGCA GCGAAAAACA GCACGAGTTA CAGTTGCGGA TTCAGGGCGA GCCGGTTTCC GGGCAGCTTA ATCTGGCAGG AAGTTTTGAT CGCAAAGAAG AACGCTGGAA GGGAACTCTT AGCAATACCC GCTTCCAGAC GCCGGTCGGC CCGTGGTCGC TGACCCGCGA TATTGCGCTG GATTACCGCA ATAAGGAGCA AAAAATCAGC ATCGGGCCAC ACTGTTGGCT TAACCCGAAT GCGGAATTGT GCGTGCCGCA AACTATCGAT GCGGGGGCCG AAGGGCGTGC GGTGGTGAAT CTCAACCGCT TCGACCTCGC CATGCTGAAA CCGTTTATGC CAGAAACCAC TCAGGCCAGC GGTATCTTCA CGGGTAAAGC GGACGTTGCC TGGGACACCA CGAAAGAGGG GCTGCCGCAG GGCAGTATCA CCCTTTCGGG GCGTAATGTG CAGGTAACGC AAACCGTCAA CGATGCGGCG CTGCCCGTGG CGTTTCAGAC GCTGAATCTG ACGGCGGAAT TGCGTAACAA CCGTGCCGAA TTGGGCTGGA CCATCCGCCT GACCAATAAC GGCCAGTTTG ATGGACAGGT GCAGGTGACC GATCCGCAAG GCCGCCGTAA TCTTGGTGGC AACGTCAATA TCCGTAACTT CAACCTTGCG ATGATAAACC CTATCTTTAC CCGTGGGGAA AAAGCTGCGG GGATGGTGAG TGCCAACTTA CGTCTGGGTG GTGATGTGCA AAGCCCGCAG TTGTTTGGGC AGCTACAGGT TACGGGTGTG GATATCGACG GCAACTTTAT GCCGTTTGAT ATGCAGCCGA GCCAGCTTGC GGTCAACTTT AGCGGTATGC GCTCGACGCT TGCCGGTACA GTACGGACCC AGCAGGGTGA GATCTACCTG AACGGCGACG CCGACTGGAG CCAGATTGAA AACTGGCGGG CGCGGGTAAC AGCGAAGGGC AGTAAAGTGC GTATCACCGT GCCGCCGATG GTACGAATGG ATGTATCACC AGATGTTGTA TTCGAGGCTA CACCAAACCT GTTTACCCTC GATGGTCGCG TGGATGTCCC GTGGGCGCGC ATCGTGGTGC ATGAGCTGCC GGAAAGCGCG GTGGGCGTCT CCAGCGATGT GGTGATGCTT AACGATAACC TGCAACCGGA AGAGGCAAAA ACGGCGTCGA TTCCGATTAA CAGCAACCTG ATTGTCCACG TCGGCAACAA TGTGCGCATT GACGCCTTTG GCCTGAAAGC GCGGCTGACG GGCGATCTTA ACGTCGTACA GGATAAACAA GGGCTGGGCC TGAACGGGCA GATCAACATC CCTGAAGGGC GCTTCCATGC CTATGGTCAG GATCTGATTG TGCGTAAGGG TGAGTTACTG TTCTCTGGTC CGCCGGATCA ACCGTATCTT AATATCGAAG CTATTCGTAA CCCGGATGCT ACAGAAGACG ACGTAATCGC CGGAGTTCGC GTCACTGGTC TGGCGGACGA ACCGAAAGCG GAGATCTTCT CTGACCCGGC GATGTCGCAA CAGGCTGCCT TGTCTTATTT GCTACGTGGA CAAGGGTTGG AGAGCGATCA GAGCGACAGT GCGGCGATGA CCTCGATGCT GATTGGTCTG GGGGTTGCAC AAAGTGGTCA GATTGTGGGT AAAATCGGCG AGACGTTTGG CGTAAGCAAT TTAGCGCTCG ACACCCAGGG AGTAGGCGAC TCCTCCCAGG TGGTGGTCAG CGGCTATGTA TTGCCAGGTC TGCAAGTGAA ATACGGCGTG GGTATATTTG ACTCTATAGC AACACTCACG TTACGTTATC GCCTGATGCC TAAGCTATAT CTGGAAGCCG TGTCTGGTGT AGACCAGGCA CTGGATTTGC TCTATCAGTT CGAGTTTTAG
|
Protein sequence | MSLWKKISLG VVIVILLLLG SVAFLVGTTS GLHLVFKAAD RWVPGLDIGK VTGGWRDLTL SDVRYEQPGV AVKAGNLHLA VGLECLWNSS VCINDLALKD IQVNIDSKKM PPSEQVEEEE DSGPLDLSTP YPITLTRVAL DNVNIKIDDT TVSVMDFTSG LNWQEKTLTL KPTSLKGLLI ALPKVADVAQ EEVVEPKIEN PQPDEKPLGE TLKDLFSRPV LPEMTDVHLP LNLNIEEFKG EQLRVTGDTD ITVSTMLLKV SSIDGNTKLD ALDIDSSQGI VNASGTAQLS DNWPVDITLN STLNVEPLKG EKVKLKVGGA LREQLEIGVN LSGPVDMDLR AQTRLAEAGL PLNVEVNSKQ LYWPFTGEKQ YQADDLKLKL TGKMTDYTLS MRTAVKGQEI PPATITLDAK GNEQQVNLDK LTVAALEGKT ELKALLDWQQ AISWRGELTL NGINTAKEFP EWPSKLNGLI KTRGSLYGGT WQMEVPELKL TGNVKQNKVN VDGTLKGNSY MQWKIPGLHL ELGPNSAEVK GELGVKDLNL DATINAPGLD NALPGLGGTA KGLVKVRGTV EAPQLLADIT ARGLRWQELS VAQVRVEGDI KSTDQIAGKL DVRVEQISQP DVSINLVTLN AKGSEKQHEL QLRIQGEPVS GQLNLAGSFD RKEERWKGTL SNTRFQTPVG PWSLTRDIAL DYRNKEQKIS IGPHCWLNPN AELCVPQTID AGAEGRAVVN LNRFDLAMLK PFMPETTQAS GIFTGKADVA WDTTKEGLPQ GSITLSGRNV QVTQTVNDAA LPVAFQTLNL TAELRNNRAE LGWTIRLTNN GQFDGQVQVT DPQGRRNLGG NVNIRNFNLA MINPIFTRGE KAAGMVSANL RLGGDVQSPQ LFGQLQVTGV DIDGNFMPFD MQPSQLAVNF SGMRSTLAGT VRTQQGEIYL NGDADWSQIE NWRARVTAKG SKVRITVPPM VRMDVSPDVV FEATPNLFTL DGRVDVPWAR IVVHELPESA VGVSSDVVML NDNLQPEEAK TASIPINSNL IVHVGNNVRI DAFGLKARLT GDLNVVQDKQ GLGLNGQINI PEGRFHAYGQ DLIVRKGELL FSGPPDQPYL NIEAIRNPDA TEDDVIAGVR VTGLADEPKA EIFSDPAMSQ QAALSYLLRG QGLESDQSDS AAMTSMLIGL GVAQSGQIVG KIGETFGVSN LALDTQGVGD SSQVVVSGYV LPGLQVKYGV GIFDSIATLT LRYRLMPKLY LEAVSGVDQA LDLLYQFEF
|
| |