Gene EcSMS35_1206 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_1206 
Symbol 
ID6142895 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp1210145 
End bp1212682 
Gene Length2538 bp 
Protein Length845 aa 
Translation table11 
GC content48% 
IMG OID641616083 
Productputative prophage side tail fiber protein 
Protein accessionYP_001743266 
Protein GI170679883 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3064] Membrane protein involved in colicin uptake 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones36 
Fosmid unclonability p-value0.116414 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGCAG TAAAAATCTC AGGTGTGCTG AAAGATGGTG CGGGAAAACC AATACAGAAC 
TGCACTATTC AACTGAAGGC AAAGCGTAAC AGCACCACGG TACTGGTGAA CACGGTGGCT
TCTGAAAATC CGGATGAAGC CGGACGTTAC AGCATGGATG TTGAGTATGG CCAGTACAGC
GTCACCCTGC TGGTTGAAGG TTTTCCGCCT TCACATGCCG GGACCATTAC CGTCTATGAA
GGTTCCAGAC CAGGTACGCT GAATGATTTT CTCGGTGCCA TGACGGAAGA TGATGTCATG
CCGGAGGCAT TGCGTCGTTT TGAGGCAATG GTGGAAGAAG CGGCACGCAA CGCCGAAGCC
GCCTCTCAGA GCGCAGCGGC GGCAAAGAAA TCCGAAACTG CAGCGGCATC ATCGAAGAAC
GCGGCGAAAA CCTCAGAAAC GAATGCAGCT AACAGCGCAC AGGCGGCAGC GACCTCACAG
ACTGCATCGG CAAACTCCGC GACAGCAGCC AAAAAATCAG AAACCAACGC GAAAAACAGC
GAGACAGCCG CAAAGACGAG CGAAACCAAC GCAAAGTCCA GCCAGACGGC AGCGAAGACC
AGCGAAACGA ATGCCAAAGC CAGTGAAACT GCGGCAAAAA ACAGCCAGGT TGCAGCAGCC
CAAAGCGAGA GCGCGGCAGC CGGTTCTGCG ACTTCAGCAG CCGGATCAGC AACTGCTGCG
GCTAACAGCC AGAAAGCTGC GAAGACGAGT GAAACTAACG CAAAGTCCAG CCAGACGGCA
GCGAAGACCA GCGAAACGAA TGCCAAAGCC AGTGAAACTG CGGCGAAAAA CAGTCAGGAT
GCTGCAGCCC AAAGCGAGAG TGCCGCAGCT GGTTCTGCAA GTGCGGCGGC TTCTTCTGCC
ACTGCATCAG CCAACAGTCA AAAAGCTGCA AAAACCAGTG AAACCAACGC AAAGGCGAGC
GAGACTGCGG CGGCTAACTC GGCGAAAGCA TCCGCTGCAA GCCAGACGGC TGCAAAAGCA
AGTGAAGACG CAGCCAGAGA GTATGCAAGC CAGGCTGCGG AGCCGTATAA ACAAGTTTTG
CAGCCGCTTC CCGATGTGTG GATACCGTTT AACGATTCAC TGGATATGAT TACGGGCTTT
TCGCCATCAT ATAAAAAGAT TGTTATTGGC GACGACGAAA TAACGATGCC TGGTGACAAG
GTTGTTAAGT TTAAACGCGC ATCGAAAGCA ACCTATATTA ATAAATCTGG TGTACTGACA
GAGGCTGCCA TTGACGAGCC GCGATTTGAA CGTGATGGCC TGCTTATTGA GGGGCAAAGA
ACTAATCTTC TGCTTAATTC AACAAATCCA TCTAAATGGA ATAAGTCAGG CAATCTGGAA
CTCACAGAAA TATCCACGGA TTCTTTTAAT TTTACTTATG GGAGATTTAC TGTAAAAGAT
ACTCTTATTG ATCAGACAAG TGCGATTAAT ATCGTAACGG TTTCTGGCAG TAAAGGATTT
GATGTCACAG GTGATGAAAA ATATGTGACC ATTTCATGCC GTGTCAGAAG TGATGTTGAA
AATATAAGGT GTCGTTTAAG ATTTGAACAT CATGATGGTT CTACTTACAC TTTTTTGGGA
GATGCTTACC TCAATTTATC AACACTTGTA ATTGATAAGA CTGGTGGTGC AGCAAATCGT
ATTATTGCAA AGGCTGTAAA AGATGAGGCT ACTGGTTGGA TTTTCTATCA GGCTACAATT
AATGCACTAG ATACAGAGAG CATGATTGGT GCGATGGTTC AATATGCTCC AGTAAAAGGT
TCAGGCACAG CATCTGGAGA CTATCTGGAT ATCGCAACTC CACAAGTGGA AGGTGGATCA
AGTGCTTCGT CATTTATTGT AACTGATATA ATTGCAAGCA CTCGCGCAAG CGATATGGTG
ACAGTCCCAA TCAAGAATAA CCTTTATAAT CTTCCTTTTA CAGTTCTTTG TGAGGTACAT
AAGAACTGGT ATAAAACGCC AAATGCAGCA CCGCGTGTTT TTGATACCGG CGGTCATCAA
ACCGGAGCGG CTATTATTCT TGGCTTCGGT CGTTCAACAG ATTACGACGG ATTTCCTTAT
TGCGATATAG GTTTGGCTAA CAGACGGGTA AACGAAAACG CATCGCTTGA AAAAATGGTT
ATGGGGATGC GTGTAAAGTC AGATCAGTCT ACGTGCTCAG TAAGTAACGG GCGTATATCC
AGCGAAAAGA AAGCCACATG GTCCTATATT CAGAACACCG CAATTATCCG TATTGGAGGC
CAGACTACAG CCGGGTTGCG TCATTTATTT GGTCATGTCA GGAATTTCAG AATATGGCAC
AAGGCATTGA CTGATGCTCA GGTGGGTGGA TTTGCCCCTA TATTTCCAGA CACCTGTTAT
CACTTAACCC ATTACTGGCC TGCTGCCGCA GATATTCCCG TGGCGAGCGA TAACCCAGTG
CACTATGCGG ATGCCATTCG TTATAATGCT CGAACGCCTC TGCAAGGTTC TTTGCTGCCG
TTAACCCGTC TGGTTTAG
 
Protein sequence
MAAVKISGVL KDGAGKPIQN CTIQLKAKRN STTVLVNTVA SENPDEAGRY SMDVEYGQYS 
VTLLVEGFPP SHAGTITVYE GSRPGTLNDF LGAMTEDDVM PEALRRFEAM VEEAARNAEA
ASQSAAAAKK SETAAASSKN AAKTSETNAA NSAQAAATSQ TASANSATAA KKSETNAKNS
ETAAKTSETN AKSSQTAAKT SETNAKASET AAKNSQVAAA QSESAAAGSA TSAAGSATAA
ANSQKAAKTS ETNAKSSQTA AKTSETNAKA SETAAKNSQD AAAQSESAAA GSASAAASSA
TASANSQKAA KTSETNAKAS ETAAANSAKA SAASQTAAKA SEDAAREYAS QAAEPYKQVL
QPLPDVWIPF NDSLDMITGF SPSYKKIVIG DDEITMPGDK VVKFKRASKA TYINKSGVLT
EAAIDEPRFE RDGLLIEGQR TNLLLNSTNP SKWNKSGNLE LTEISTDSFN FTYGRFTVKD
TLIDQTSAIN IVTVSGSKGF DVTGDEKYVT ISCRVRSDVE NIRCRLRFEH HDGSTYTFLG
DAYLNLSTLV IDKTGGAANR IIAKAVKDEA TGWIFYQATI NALDTESMIG AMVQYAPVKG
SGTASGDYLD IATPQVEGGS SASSFIVTDI IASTRASDMV TVPIKNNLYN LPFTVLCEVH
KNWYKTPNAA PRVFDTGGHQ TGAAIILGFG RSTDYDGFPY CDIGLANRRV NENASLEKMV
MGMRVKSDQS TCSVSNGRIS SEKKATWSYI QNTAIIRIGG QTTAGLRHLF GHVRNFRIWH
KALTDAQVGG FAPIFPDTCY HLTHYWPAAA DIPVASDNPV HYADAIRYNA RTPLQGSLLP
LTRLV