Gene EcSMS35_4341 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4341 
Symbol 
ID6143363 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4431634 
End bp4433655 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content49% 
IMG OID641619162 
Productputative phage tail fiber protein 
Protein accessionYP_001746286 
Protein GI170683508 
COG category[R] General function prediction only 
COG ID[COG5301] Phage-related tail fibre protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0213021 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCACAA AATTCAAAAC CGTTATCACC ACTGCCGGTG CAGCAAAGCT GGCAGCGGCA 
ACCGCGCCGG GAGGGCGGAA GGTCAACATT ACCACGATGG CCGTCGGGGA TGGCGGTGGT
AAATTGCCTG TCCCGGATGC CGGACAGACC GGGCTTATCC ACGAAGTCTG GCGACATGCG
CTGAACAAAA TCAGCCAGGA CAAACGAAAC AGTAATTATA TTATCGCAGA GCTGGTTATT
CCGCCGGAGG TGGGCGGTTT CTGGATGCGT GAGCTTGGCC TGTACGATGA TGCGGGAACG
TTAATTGCCG TGGCGAACAT GGCCGAAAGT TATAAGCCAG CCCTTGCCGA AGGCTCAGGA
CGTTCGCAGA CCTGCCGCAT GGTCATTATC GTCAGCAGTG TAGCCTCAGT GGATCTGACC
ATCGACACCA CAACGGTGAT GGCGACGCAG GATTACGTTG ATGACAAAAT TGCAGAACAT
GAACAGTCAC GACGTCACCC TGACGCCTCG CTGACCGCAA AAGGTTTTAC TCAGTTAAGC
AGTGCGACCA ACAGCACGTC TGAAACACTG GCTGCAACGC CGAAAGCCGT TAAGACGGTA
ATGGATGAAA CGAACAAGAA AGCGCCATTA AACAGCCCTG CACTGACCGG AACGCCAACG
ACGCCAACTG CGCGACAGGG AACGAATAAT ACTCAGATCG CAAACACGGC TTTCGTTATG
GCCGCGATTG CCGCCCTTGT AGACTCGTCG CCTGACGCAC TGAATACGCT GAACGAGCTG
GCGGCGGCGC TGGGCAATGA CCCGAATTTT GCTACCACCA TGACTAATGC GCTTGCGGGT
AAGCAACCGA AAGATGCCAC CCTGACGGCA CTGGCGGGGC TTGCTACTGC GGCAGACAGG
TTTCCGTATT TTATTGGGAA TGATGTTGCC AGCTTGGCAA CCCTGACAAA AGTCGGGCGG
GATATTCTGG CTAAATCGAC CGTTGCCGCC GTTATCGAAT ATCTCGGTTT ACAGGAAACG
GTAAACCGAG CCGGGAACGC CGTGCAAAAA AATGGCGATA CCTTGTCCGG TGGACTTACT
TTTGAAAACG ACTCAATCCT TGCCTGGATT CGAAATACTG ACTGGGCGAA GATTGGATTT
AAAAATGATG CCGATGGTGA CACTGATTCA TTCATGTGGT TTGAAACGGG GGATAACGGC
AATGAATATT TCAAATGGAG AAGCCGCCAG AGTACTACAA CAAAAGACCT GATGAATCTT
AAATGGGATG CTTTGTATGT TCTTGTCAAT GCCATTGTAA ATGGCGAAGT CATATCAAAA
TCAGCAAACG GCCTACGTAT TGCTTATGGT AATTACGGAT TCTTTATTCG TAATGATGGT
TCAAATACAT ACTTCATGTT GACAAACTCC GGTGACAACA TGGGGACTTA TAACGGATTA
AGGCCATTAT GGATTAATAA CGCTACTGGC GCTGTTTCGA TGGGGCGTGG CCTTAATGTT
TCAGGGGAGA CGCTTTCAGA CCGTTTTGCT ATTAACAGCA GTAATGGTAT GTGGATTCAG
ATGCGCGATA ACAACGCTAT CTTTGGGAAA AATATAGTTA ACACTGATAG CGCTCAGGCG
TTGCTTCGCC AGAATCACGC TGACCGCAAG TTCATGATAG GTGGACTGGG GAACAAGCAA
TTTGGCATCT ACATGATTAA TAACTCAAGG ACAGCCAATG GCACCGATGG TCAGGCGTAC
ATGGATAATA ACGGTAACTG GCTTTGTGGC TCGCAAGTTA TTCCTGGCAA CTATGGCAAT
TTTGATTCCA GATATGTGAA AGATGTTCGA CTTGGGTCAC AGCAATATTA TGGAGTGAAC
AACTGGCAAA CATGGAATTT CCAGTGCCCG TCAGGTCATG TATTGTCTGG TATTAATGTT
CAGGATACAG GGTCCAACTC TGCCGATAAT ATAGCGGGCG TTTATTACAG ACCCGTTCAA
AAGTATATAA ATGGCACCTG GTATAATGTA GCGAGCGTTT AA
 
Protein sequence
MSTKFKTVIT TAGAAKLAAA TAPGGRKVNI TTMAVGDGGG KLPVPDAGQT GLIHEVWRHA 
LNKISQDKRN SNYIIAELVI PPEVGGFWMR ELGLYDDAGT LIAVANMAES YKPALAEGSG
RSQTCRMVII VSSVASVDLT IDTTTVMATQ DYVDDKIAEH EQSRRHPDAS LTAKGFTQLS
SATNSTSETL AATPKAVKTV MDETNKKAPL NSPALTGTPT TPTARQGTNN TQIANTAFVM
AAIAALVDSS PDALNTLNEL AAALGNDPNF ATTMTNALAG KQPKDATLTA LAGLATAADR
FPYFIGNDVA SLATLTKVGR DILAKSTVAA VIEYLGLQET VNRAGNAVQK NGDTLSGGLT
FENDSILAWI RNTDWAKIGF KNDADGDTDS FMWFETGDNG NEYFKWRSRQ STTTKDLMNL
KWDALYVLVN AIVNGEVISK SANGLRIAYG NYGFFIRNDG SNTYFMLTNS GDNMGTYNGL
RPLWINNATG AVSMGRGLNV SGETLSDRFA INSSNGMWIQ MRDNNAIFGK NIVNTDSAQA
LLRQNHADRK FMIGGLGNKQ FGIYMINNSR TANGTDGQAY MDNNGNWLCG SQVIPGNYGN
FDSRYVKDVR LGSQQYYGVN NWQTWNFQCP SGHVLSGINV QDTGSNSADN IAGVYYRPVQ
KYINGTWYNV ASV