Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4345 |
Symbol | |
ID | 6143764 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4435619 |
End bp | 4436809 |
Gene Length | 1191 bp |
Protein Length | 396 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 641619166 |
Product | phage tail sheath protein |
Protein accession | YP_001746290 |
Protein GI | 170683123 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 34 |
Fosmid unclonability p-value | 0.0327685 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTGACT ATCATCACGG CGTGCAGGTG CTGGAGATTA ACGACGGCAC CCGCGTCATT TCCACCGTAT CCACCGCCAT TGTTGGCATG GTCTGCACGG CCAGCGATGC GGATGCGGAA ATCTTCCCCC TCAATAAACC AGTGCTGATT ACCAATGTGC AGAGCGCAAT TGCAAAGGCC GGTAAAAAAG GCACGCTGGC GGCATCGTTG CAGGCCATCG CCGACCAGTC AAAACCGGTC ACCGTTGTCG TGCGTGTGGA AGACGGCACC GGCGACGACG AGGAAACGAA ACTTGCGCAG ACCGTTTCCA ATATCATCGG CACCACCGAC GAAAACGGTC AGTACACCGG ACTGAAAGCC CTGCTGGCGG CGGAGTCGGT AACCGGTGTT AAACCGCGTA TTCTCGGCGT GCCGGGGCTG GATACCAAAG AGGTGGCTGT TGCACTGGCA TCAGTCTGTC AGAAGCTGCG CGCTTTCGGG TATATCAGCG CATGGGGCTG TAAAACCATT TCCGAGGTGA AAGCCTACCG CCAGAATTTC AGCCAGCGTG AGCTGATGGT CATCTGGCCG GATTTCCTCG CATGGGATAC GGTCACCAGC ACCACCGCCA CCGCGTATGC CACCGCCCGT GCGCTGGGTC TGCGTGCCAG AATCGACCAG GAGCAGGGCT GGCATAAAAC GCTGTCCAAT GTCGGGGTGA ACGGTGTTAC CGGCATCAGC GCATCTGTAT TCTGGGATTT GCAGGAGTCC GGCACCGATG CTGACCTGCT TAACGAGTCA GGCGTCACTA CGCTGATTCG CCGCGACGGT TTCCGATTCT GGGGTAACCG TACCTGCTCT GATGACCCGC TGTTCCTCTT TGAAAACTAC ACCCGCACCG CGCAGGTGCT GGCCGACACG ATGGCTGAGG CGCACATGTG GGCGGTGGAC AAGCCCATCA CCGCAACGCT GATTCGCGAC ATCGTTGACG GCATCAATGC CAAATTCCGT GAGCTGAAAA CAAACGGCTA TATCGTGGAT GCGACCTGCT GGTTCAGCGA AGAATCCAAC GATGCGGAAA CCCTTAAGGC CGGAAAACTG TATATCGACT ACGACTATAC CCCGGTGCCT CCTCTTGAAA ACCTGACCCT GCGCCAGCGT ATTACCGATA AATACCTGGC AAATCTGGTC ACCTCGGTTA ACAGCAATTA A
|
Protein sequence | MSDYHHGVQV LEINDGTRVI STVSTAIVGM VCTASDADAE IFPLNKPVLI TNVQSAIAKA GKKGTLAASL QAIADQSKPV TVVVRVEDGT GDDEETKLAQ TVSNIIGTTD ENGQYTGLKA LLAAESVTGV KPRILGVPGL DTKEVAVALA SVCQKLRAFG YISAWGCKTI SEVKAYRQNF SQRELMVIWP DFLAWDTVTS TTATAYATAR ALGLRARIDQ EQGWHKTLSN VGVNGVTGIS ASVFWDLQES GTDADLLNES GVTTLIRRDG FRFWGNRTCS DDPLFLFENY TRTAQVLADT MAEAHMWAVD KPITATLIRD IVDGINAKFR ELKTNGYIVD ATCWFSEESN DAETLKAGKL YIDYDYTPVP PLENLTLRQR ITDKYLANLV TSVNSN
|
| |