Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1206 |
Symbol | |
ID | 6142895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1210145 |
End bp | 1212682 |
Gene Length | 2538 bp |
Protein Length | 845 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641616083 |
Product | putative prophage side tail fiber protein |
Protein accession | YP_001743266 |
Protein GI | 170679883 |
COG category | [M] Cell wall/membrane/envelope biogenesis |
COG ID | [COG3064] Membrane protein involved in colicin uptake |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.116414 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGCAG TAAAAATCTC AGGTGTGCTG AAAGATGGTG CGGGAAAACC AATACAGAAC TGCACTATTC AACTGAAGGC AAAGCGTAAC AGCACCACGG TACTGGTGAA CACGGTGGCT TCTGAAAATC CGGATGAAGC CGGACGTTAC AGCATGGATG TTGAGTATGG CCAGTACAGC GTCACCCTGC TGGTTGAAGG TTTTCCGCCT TCACATGCCG GGACCATTAC CGTCTATGAA GGTTCCAGAC CAGGTACGCT GAATGATTTT CTCGGTGCCA TGACGGAAGA TGATGTCATG CCGGAGGCAT TGCGTCGTTT TGAGGCAATG GTGGAAGAAG CGGCACGCAA CGCCGAAGCC GCCTCTCAGA GCGCAGCGGC GGCAAAGAAA TCCGAAACTG CAGCGGCATC ATCGAAGAAC GCGGCGAAAA CCTCAGAAAC GAATGCAGCT AACAGCGCAC AGGCGGCAGC GACCTCACAG ACTGCATCGG CAAACTCCGC GACAGCAGCC AAAAAATCAG AAACCAACGC GAAAAACAGC GAGACAGCCG CAAAGACGAG CGAAACCAAC GCAAAGTCCA GCCAGACGGC AGCGAAGACC AGCGAAACGA ATGCCAAAGC CAGTGAAACT GCGGCAAAAA ACAGCCAGGT TGCAGCAGCC CAAAGCGAGA GCGCGGCAGC CGGTTCTGCG ACTTCAGCAG CCGGATCAGC AACTGCTGCG GCTAACAGCC AGAAAGCTGC GAAGACGAGT GAAACTAACG CAAAGTCCAG CCAGACGGCA GCGAAGACCA GCGAAACGAA TGCCAAAGCC AGTGAAACTG CGGCGAAAAA CAGTCAGGAT GCTGCAGCCC AAAGCGAGAG TGCCGCAGCT GGTTCTGCAA GTGCGGCGGC TTCTTCTGCC ACTGCATCAG CCAACAGTCA AAAAGCTGCA AAAACCAGTG AAACCAACGC AAAGGCGAGC GAGACTGCGG CGGCTAACTC GGCGAAAGCA TCCGCTGCAA GCCAGACGGC TGCAAAAGCA AGTGAAGACG CAGCCAGAGA GTATGCAAGC CAGGCTGCGG AGCCGTATAA ACAAGTTTTG CAGCCGCTTC CCGATGTGTG GATACCGTTT AACGATTCAC TGGATATGAT TACGGGCTTT TCGCCATCAT ATAAAAAGAT TGTTATTGGC GACGACGAAA TAACGATGCC TGGTGACAAG GTTGTTAAGT TTAAACGCGC ATCGAAAGCA ACCTATATTA ATAAATCTGG TGTACTGACA GAGGCTGCCA TTGACGAGCC GCGATTTGAA CGTGATGGCC TGCTTATTGA GGGGCAAAGA ACTAATCTTC TGCTTAATTC AACAAATCCA TCTAAATGGA ATAAGTCAGG CAATCTGGAA CTCACAGAAA TATCCACGGA TTCTTTTAAT TTTACTTATG GGAGATTTAC TGTAAAAGAT ACTCTTATTG ATCAGACAAG TGCGATTAAT ATCGTAACGG TTTCTGGCAG TAAAGGATTT GATGTCACAG GTGATGAAAA ATATGTGACC ATTTCATGCC GTGTCAGAAG TGATGTTGAA AATATAAGGT GTCGTTTAAG ATTTGAACAT CATGATGGTT CTACTTACAC TTTTTTGGGA GATGCTTACC TCAATTTATC AACACTTGTA ATTGATAAGA CTGGTGGTGC AGCAAATCGT ATTATTGCAA AGGCTGTAAA AGATGAGGCT ACTGGTTGGA TTTTCTATCA GGCTACAATT AATGCACTAG ATACAGAGAG CATGATTGGT GCGATGGTTC AATATGCTCC AGTAAAAGGT TCAGGCACAG CATCTGGAGA CTATCTGGAT ATCGCAACTC CACAAGTGGA AGGTGGATCA AGTGCTTCGT CATTTATTGT AACTGATATA ATTGCAAGCA CTCGCGCAAG CGATATGGTG ACAGTCCCAA TCAAGAATAA CCTTTATAAT CTTCCTTTTA CAGTTCTTTG TGAGGTACAT AAGAACTGGT ATAAAACGCC AAATGCAGCA CCGCGTGTTT TTGATACCGG CGGTCATCAA ACCGGAGCGG CTATTATTCT TGGCTTCGGT CGTTCAACAG ATTACGACGG ATTTCCTTAT TGCGATATAG GTTTGGCTAA CAGACGGGTA AACGAAAACG CATCGCTTGA AAAAATGGTT ATGGGGATGC GTGTAAAGTC AGATCAGTCT ACGTGCTCAG TAAGTAACGG GCGTATATCC AGCGAAAAGA AAGCCACATG GTCCTATATT CAGAACACCG CAATTATCCG TATTGGAGGC CAGACTACAG CCGGGTTGCG TCATTTATTT GGTCATGTCA GGAATTTCAG AATATGGCAC AAGGCATTGA CTGATGCTCA GGTGGGTGGA TTTGCCCCTA TATTTCCAGA CACCTGTTAT CACTTAACCC ATTACTGGCC TGCTGCCGCA GATATTCCCG TGGCGAGCGA TAACCCAGTG CACTATGCGG ATGCCATTCG TTATAATGCT CGAACGCCTC TGCAAGGTTC TTTGCTGCCG TTAACCCGTC TGGTTTAG
|
Protein sequence | MAAVKISGVL KDGAGKPIQN CTIQLKAKRN STTVLVNTVA SENPDEAGRY SMDVEYGQYS VTLLVEGFPP SHAGTITVYE GSRPGTLNDF LGAMTEDDVM PEALRRFEAM VEEAARNAEA ASQSAAAAKK SETAAASSKN AAKTSETNAA NSAQAAATSQ TASANSATAA KKSETNAKNS ETAAKTSETN AKSSQTAAKT SETNAKASET AAKNSQVAAA QSESAAAGSA TSAAGSATAA ANSQKAAKTS ETNAKSSQTA AKTSETNAKA SETAAKNSQD AAAQSESAAA GSASAAASSA TASANSQKAA KTSETNAKAS ETAAANSAKA SAASQTAAKA SEDAAREYAS QAAEPYKQVL QPLPDVWIPF NDSLDMITGF SPSYKKIVIG DDEITMPGDK VVKFKRASKA TYINKSGVLT EAAIDEPRFE RDGLLIEGQR TNLLLNSTNP SKWNKSGNLE LTEISTDSFN FTYGRFTVKD TLIDQTSAIN IVTVSGSKGF DVTGDEKYVT ISCRVRSDVE NIRCRLRFEH HDGSTYTFLG DAYLNLSTLV IDKTGGAANR IIAKAVKDEA TGWIFYQATI NALDTESMIG AMVQYAPVKG SGTASGDYLD IATPQVEGGS SASSFIVTDI IASTRASDMV TVPIKNNLYN LPFTVLCEVH KNWYKTPNAA PRVFDTGGHQ TGAAIILGFG RSTDYDGFPY CDIGLANRRV NENASLEKMV MGMRVKSDQS TCSVSNGRIS SEKKATWSYI QNTAIIRIGG QTTAGLRHLF GHVRNFRIWH KALTDAQVGG FAPIFPDTCY HLTHYWPAAA DIPVASDNPV HYADAIRYNA RTPLQGSLLP LTRLV
|
| |