Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4341 |
Symbol | |
ID | 6143363 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4431634 |
End bp | 4433655 |
Gene Length | 2022 bp |
Protein Length | 673 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641619162 |
Product | putative phage tail fiber protein |
Protein accession | YP_001746286 |
Protein GI | 170683508 |
COG category | [R] General function prediction only |
COG ID | [COG5301] Phage-related tail fibre protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0213021 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCACAA AATTCAAAAC CGTTATCACC ACTGCCGGTG CAGCAAAGCT GGCAGCGGCA ACCGCGCCGG GAGGGCGGAA GGTCAACATT ACCACGATGG CCGTCGGGGA TGGCGGTGGT AAATTGCCTG TCCCGGATGC CGGACAGACC GGGCTTATCC ACGAAGTCTG GCGACATGCG CTGAACAAAA TCAGCCAGGA CAAACGAAAC AGTAATTATA TTATCGCAGA GCTGGTTATT CCGCCGGAGG TGGGCGGTTT CTGGATGCGT GAGCTTGGCC TGTACGATGA TGCGGGAACG TTAATTGCCG TGGCGAACAT GGCCGAAAGT TATAAGCCAG CCCTTGCCGA AGGCTCAGGA CGTTCGCAGA CCTGCCGCAT GGTCATTATC GTCAGCAGTG TAGCCTCAGT GGATCTGACC ATCGACACCA CAACGGTGAT GGCGACGCAG GATTACGTTG ATGACAAAAT TGCAGAACAT GAACAGTCAC GACGTCACCC TGACGCCTCG CTGACCGCAA AAGGTTTTAC TCAGTTAAGC AGTGCGACCA ACAGCACGTC TGAAACACTG GCTGCAACGC CGAAAGCCGT TAAGACGGTA ATGGATGAAA CGAACAAGAA AGCGCCATTA AACAGCCCTG CACTGACCGG AACGCCAACG ACGCCAACTG CGCGACAGGG AACGAATAAT ACTCAGATCG CAAACACGGC TTTCGTTATG GCCGCGATTG CCGCCCTTGT AGACTCGTCG CCTGACGCAC TGAATACGCT GAACGAGCTG GCGGCGGCGC TGGGCAATGA CCCGAATTTT GCTACCACCA TGACTAATGC GCTTGCGGGT AAGCAACCGA AAGATGCCAC CCTGACGGCA CTGGCGGGGC TTGCTACTGC GGCAGACAGG TTTCCGTATT TTATTGGGAA TGATGTTGCC AGCTTGGCAA CCCTGACAAA AGTCGGGCGG GATATTCTGG CTAAATCGAC CGTTGCCGCC GTTATCGAAT ATCTCGGTTT ACAGGAAACG GTAAACCGAG CCGGGAACGC CGTGCAAAAA AATGGCGATA CCTTGTCCGG TGGACTTACT TTTGAAAACG ACTCAATCCT TGCCTGGATT CGAAATACTG ACTGGGCGAA GATTGGATTT AAAAATGATG CCGATGGTGA CACTGATTCA TTCATGTGGT TTGAAACGGG GGATAACGGC AATGAATATT TCAAATGGAG AAGCCGCCAG AGTACTACAA CAAAAGACCT GATGAATCTT AAATGGGATG CTTTGTATGT TCTTGTCAAT GCCATTGTAA ATGGCGAAGT CATATCAAAA TCAGCAAACG GCCTACGTAT TGCTTATGGT AATTACGGAT TCTTTATTCG TAATGATGGT TCAAATACAT ACTTCATGTT GACAAACTCC GGTGACAACA TGGGGACTTA TAACGGATTA AGGCCATTAT GGATTAATAA CGCTACTGGC GCTGTTTCGA TGGGGCGTGG CCTTAATGTT TCAGGGGAGA CGCTTTCAGA CCGTTTTGCT ATTAACAGCA GTAATGGTAT GTGGATTCAG ATGCGCGATA ACAACGCTAT CTTTGGGAAA AATATAGTTA ACACTGATAG CGCTCAGGCG TTGCTTCGCC AGAATCACGC TGACCGCAAG TTCATGATAG GTGGACTGGG GAACAAGCAA TTTGGCATCT ACATGATTAA TAACTCAAGG ACAGCCAATG GCACCGATGG TCAGGCGTAC ATGGATAATA ACGGTAACTG GCTTTGTGGC TCGCAAGTTA TTCCTGGCAA CTATGGCAAT TTTGATTCCA GATATGTGAA AGATGTTCGA CTTGGGTCAC AGCAATATTA TGGAGTGAAC AACTGGCAAA CATGGAATTT CCAGTGCCCG TCAGGTCATG TATTGTCTGG TATTAATGTT CAGGATACAG GGTCCAACTC TGCCGATAAT ATAGCGGGCG TTTATTACAG ACCCGTTCAA AAGTATATAA ATGGCACCTG GTATAATGTA GCGAGCGTTT AA
|
Protein sequence | MSTKFKTVIT TAGAAKLAAA TAPGGRKVNI TTMAVGDGGG KLPVPDAGQT GLIHEVWRHA LNKISQDKRN SNYIIAELVI PPEVGGFWMR ELGLYDDAGT LIAVANMAES YKPALAEGSG RSQTCRMVII VSSVASVDLT IDTTTVMATQ DYVDDKIAEH EQSRRHPDAS LTAKGFTQLS SATNSTSETL AATPKAVKTV MDETNKKAPL NSPALTGTPT TPTARQGTNN TQIANTAFVM AAIAALVDSS PDALNTLNEL AAALGNDPNF ATTMTNALAG KQPKDATLTA LAGLATAADR FPYFIGNDVA SLATLTKVGR DILAKSTVAA VIEYLGLQET VNRAGNAVQK NGDTLSGGLT FENDSILAWI RNTDWAKIGF KNDADGDTDS FMWFETGDNG NEYFKWRSRQ STTTKDLMNL KWDALYVLVN AIVNGEVISK SANGLRIAYG NYGFFIRNDG SNTYFMLTNS GDNMGTYNGL RPLWINNATG AVSMGRGLNV SGETLSDRFA INSSNGMWIQ MRDNNAIFGK NIVNTDSAQA LLRQNHADRK FMIGGLGNKQ FGIYMINNSR TANGTDGQAY MDNNGNWLCG SQVIPGNYGN FDSRYVKDVR LGSQQYYGVN NWQTWNFQCP SGHVLSGINV QDTGSNSADN IAGVYYRPVQ KYINGTWYNV ASV
|
| |