Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0811 |
Symbol | |
ID | 6145769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 813238 |
End bp | 814194 |
Gene Length | 957 bp |
Protein Length | 318 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615699 |
Product | hypothetical protein |
Protein accession | YP_001742891 |
Protein GI | 170682288 |
COG category | [S] Function unknown |
COG ID | [COG0392] Predicted integral membrane protein |
TIGRFAM ID | [TIGR00374] conserved hypothetical protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.00494476 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 51 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGTAAAT CACACCCGCG CTGGCGCTTA GCAAAGAAAC TCCTCACCTG GCTGTTTTTT ATCGCGGTGA TTGTGTTACT GGTGGTCTAC GCCAAAAAAG TGGACTGGGA AGAGGTCTGG AAGGTCATCC GCGACTACAA TCGCGTTGCG CTGCTTAGTG CGGTCGGGCT GGTGGTCGTC AGTTATCTGA TTTACGGTTG CTACGACCTG CTCGCCCGTT TCTACTGCGG TCACAAACTG GCGAAGCGCC AGGTGATGCT GGTGTCGTTT ATCTGCTACG CCTTCAACCT GACACTCAGT ACCTGGGTCG GCGGCATTGG TATGCGCTAT CGTTTGTACT CCCGGCTAGG GTTACCGGGC AGCACTATTA CGCGGATTTT CTCGCTCAGT ATTACCACCA ACTGGCTGGG CTATATTTTG CTGGCAGGGA TTATCTTTAC CGCAGGCGTG GTGGAGCTGC CGGACCACTG GTATGTCGAT CAAACCACGC TGCGCATTCT CGGCATTGGC TTACTGATGA TTATCGCGGT TTATTTGTGG TTTTGCGCTT TCGCGAAGCA CCGCCATATG ACCATCAAAG GACAAAAACT GGTGCTGCCT TCATGGAAAT TCGCCCTCGC CCAAATGCTG ATTTCCAGTG TTAACTGGAT GGTAATGGGG GCGATTATCT GGCTGTTACT TGGTCAAAGC GTGAACTATT TCTTTGTACT GGGCGTGTTA CTGGTTAGTA GTATTGCTGG CGTCATCGTG CATATTCCAG CGGGGATTGG TGTGCTGGAA GCGGTGTTTA TCGCGCTACT GGCTGGGGAG CATACATCCA AGGGCACAAT TATCGCCGCC CTACTCGCTT ACCGTGTGCT GTATTACTTT ATCCCGCTGC TGCTGGCGCT GGTTTGCTAT CTGGTACTGG AAAGCCAGGC GAAGAAGCTG CGGGCGAAAA ATGAAGCGGC GATGTGA
|
Protein sequence | MSKSHPRWRL AKKLLTWLFF IAVIVLLVVY AKKVDWEEVW KVIRDYNRVA LLSAVGLVVV SYLIYGCYDL LARFYCGHKL AKRQVMLVSF ICYAFNLTLS TWVGGIGMRY RLYSRLGLPG STITRIFSLS ITTNWLGYIL LAGIIFTAGV VELPDHWYVD QTTLRILGIG LLMIIAVYLW FCAFAKHRHM TIKGQKLVLP SWKFALAQML ISSVNWMVMG AIIWLLLGQS VNYFFVLGVL LVSSIAGVIV HIPAGIGVLE AVFIALLAGE HTSKGTIIAA LLAYRVLYYF IPLLLALVCY LVLESQAKKL RAKNEAAM
|
| |