Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2661 |
Symbol | |
ID | 6145103 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2729183 |
End bp | 2731357 |
Gene Length | 2175 bp |
Protein Length | 724 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641617532 |
Product | putative adhesin |
Protein accession | YP_001744697 |
Protein GI | 170681535 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 42 |
Fosmid unclonability p-value | 0.440388 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGCGAT GGACACGCTG TATTATGCTA ACATTTATCT CTGGGGCTGC TTTTGCGGCT CCAGAGATAA ATGTTGAGCA TAAAGAATCA TTACCTGATT TAGGTAGTCA GGCCGCACAA CAAGAAGAAC AAACCAACAA GGGGAAATCG CTAAAAGAGC GTGGTGCCGA TTACGTCATT AACTCCGCTA CGCAAGGGTT TGAAAACCTG ACACCCGAGG CGCTGGAGTC CCAGGCCAGA AGCTATCTGC AAAGTCAAAT TACCTCTACG ACACAATCTT ATATTGAGGA TACACTCTCC CCCTACGGTA AGGTTCGAAC GAATCTTTCT CTTGGCCAGG GCGGCGATCT GGATGGCAGT TCCATCGACT ATTTTGTTCC CTGGTATGAT AATCAAACTA CTGTTTATTT CAGCCAATTT TCTGCACAAC GAAAAGAAGA TCGTACGATC GGGAATATTG GGCTTGGTGT AAGACATAAT TTTGATAAAT GGTTGTTGGG TGGAAATATA TTTTATGATT ATGATTTTAC CCGTGGGCAT CGCCGTTTAG GTTTAGGTGT CGAAGCGTGG ACTGATTATT TAAAATTCTC TGGCAACTAT TATCACCCGC TTTCTGACTG GAAAGACTCC GAAGATTTCG ATTTTTATGA GGAGCGTCCT GCGCGCGGCT GGGATATTCG CGCCGAAGCC TGGCTTCCCG CTTATCCGCA GCTTGGTGGT AAAATTGTTT TCGAACAGTA TTACGGTAAT GAAGTCGCGC TTTTTGGCAC TGATAATCTG GAGAAAGATC CCTTCGCTGT CACGCTGGGT GTGAAATACC AGCCCGTTCC GCTCATTGCT GTCGGTACAG ATTTCAAAGC TGGCACTGGC GATAATACCG ATCTTTCTGT TAACGCAACG CTTAATTATC AGTTTGGCGT CCCGCTCAAA GACCAGCTTG ATCCTGACAA AGTGAGCGCT GCTCACTCGC TTATGGGCAG CCGTCATGAC TTTGTTGAAC GTAACAACTT TATTGTTCTG GAATACAAAG AAAAAGATCC CCTCGACGTC ACTTTGTGGC TGAAAGCCGA CGCCACGAAT GAACATCCCG AGTGCGTGAT TAAAGATACG CCTGAAGAAG CTATCGGGCT GGAAAAATGT AAATGGACCA TTAACGCGCT CATTAACCAT CACTACAAAA TCGTCGCTGC CTCCTGGCAG GCGAAAAACA ATGCTGCACG TACGCTGGTG ATGCCGGTTA TCAAAGAAAA CACGCTTACC GAAGGTAACA ACAACCACTG GAATCTGGTG CTGCCTGCCT GGCAGTACAG TTCTGATAAA GCCGAACAAG AAAAACTCAA TACCTGGCGG GTTCGTCTGG CGCTGGAAGA TGAAAAAGGT AATCGACAGA ATTCCGGCGT GGTGGAAATC ACCGTTCAGC AGGATCGCAA AATTGAGCTC ATCGTTAACA ACATCGCCGA TGTACCAGAT GAGAACAATC ACAGTCATGA AGCCAGCGCA CAGGCCGATG GTGTAGATGG TGTGGTGATG GATCTCGATA TTACTGATAG CTTTGGCGAC AACACTGACC GCAACGGTAA TGTCTTGCCG CAGGATAATC TCAACCCGCA GTTGTTTGAC GCCAACGATA AAAAAGTGAC GCTGACGAAT AAACCCTGCA CTACCGAGAC TCCCTGTGTC TTTATCGCCA AACAAGATAA AGAAAAGGGG ACCGTTACGC TCTCCAGTAC ATTGCCTGGC ACCTTCCGTT GGAAGGCGAA AGCCGCTCCC TATGATGACA GTAACTATGT TGACGTAACG TTCCTGGGTT CTGACATCGG CGGGCTGAAT GCCTTTATCT ATCGTGTTGG TGCGGCTAAG CCCGTGAATC TTATCGGCAA TAAAGAGCCG TTGCCTCTCA ATAGCAGCTA TCGCTTTGTG TTGTGGCGTG ACGCTAACAA AGACGGCGTG TTCCAGCTAT CAGAGAAATT CACTGAAGAG GAGATGAAAC AGTATGACTA TCAGTGGGAA TTCACGGGAC ATAGCGTGAA TGGCAATACC GGTGCGCAAG CCAATACCAC TAACGCCGAT ATAGAGATCC CGGCAACGAA CAAGGACGCG GCAACGAAGT TTAGTGCCCA GGTAACCGAT GGCGTGCAGG GATACGGTTT GCAGGTCAAT TACAGCAAGA AATAG
|
Protein sequence | MLRWTRCIML TFISGAAFAA PEINVEHKES LPDLGSQAAQ QEEQTNKGKS LKERGADYVI NSATQGFENL TPEALESQAR SYLQSQITST TQSYIEDTLS PYGKVRTNLS LGQGGDLDGS SIDYFVPWYD NQTTVYFSQF SAQRKEDRTI GNIGLGVRHN FDKWLLGGNI FYDYDFTRGH RRLGLGVEAW TDYLKFSGNY YHPLSDWKDS EDFDFYEERP ARGWDIRAEA WLPAYPQLGG KIVFEQYYGN EVALFGTDNL EKDPFAVTLG VKYQPVPLIA VGTDFKAGTG DNTDLSVNAT LNYQFGVPLK DQLDPDKVSA AHSLMGSRHD FVERNNFIVL EYKEKDPLDV TLWLKADATN EHPECVIKDT PEEAIGLEKC KWTINALINH HYKIVAASWQ AKNNAARTLV MPVIKENTLT EGNNNHWNLV LPAWQYSSDK AEQEKLNTWR VRLALEDEKG NRQNSGVVEI TVQQDRKIEL IVNNIADVPD ENNHSHEASA QADGVDGVVM DLDITDSFGD NTDRNGNVLP QDNLNPQLFD ANDKKVTLTN KPCTTETPCV FIAKQDKEKG TVTLSSTLPG TFRWKAKAAP YDDSNYVDVT FLGSDIGGLN AFIYRVGAAK PVNLIGNKEP LPLNSSYRFV LWRDANKDGV FQLSEKFTEE EMKQYDYQWE FTGHSVNGNT GAQANTTNAD IEIPATNKDA ATKFSAQVTD GVQGYGLQVN YSKK
|
| |