Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1751 |
Symbol | |
ID | 6143902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1754295 |
End bp | 1755638 |
Gene Length | 1344 bp |
Protein Length | 447 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616627 |
Product | hypothetical protein |
Protein accession | YP_001743805 |
Protein GI | 170683072 |
COG category | [S] Function unknown |
COG ID | [COG5383] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 3 |
Fosmid unclonability p-value | 2.98401e-17 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGGCGAACA GCATCACGGC GGATGAGATT CGGGAACAGT TTTCGCAGGC AATGTCAGCC ATGTACCAGC AAGAAGTTCC GCAGTACGGC ACGCTGCTGG AACTGGTAGC TGATGTGAAT CTGGCTGTGC TGGAAAACAA TCCTCAACTG CACGAAAAAA TGGTAAATGC AGACGAGCTG GCGCGACTGA ATGTTGAACG TCATGGGGCG ATTCGTGTTG GGACTGCACA AGAGCTTGCT ACTCTTCGGC GGATGTTTGC CATTATGGGG ATGTACCCGG TGAGCTATTA CGATCTCTCG CAGGCGGGAG TGCCGGTACA TTCCACAGCA TTTCGGCCCA TTGATGATGC TTCTCTGGCG CGTAATCCCT TCCGCGTTTT TACCTCATTA CTCCGCCTTG AGCTTATCGA GAACGAATTT TTGCGCCAGA AAGCGGCGGA GATTCTGCGT CAGCGCGATA TCTTCACCCC ACGTTGTCGA CAACTGTTAG AGGAGTATGA GCAGCGGGGC GGTTTTAACG AAACACAGGC ACAGGAGTTT GTGCAGGAAG CCCTGGAAAC GTTTCGTTGG CACCAGTCAG CAACGGTAGA TGAAGAAACC TATCGCGCCT TGCACAACGA ACATCGGTTG ATTGCTGATG TGGTCTGTTT TCCTGGATGC CATATCAACC ACCTGACGCC ACGTACGCTG GATATTGACC GGGTGCAGTC GATGATGCCT GAATGCGGGA TTGAACCCAA AATTCTGATC GAAGGGCCGC CGCGCCGCGA GGTATCTATT TTACTACGCC AGACCAGCTT TAAAGCACTG GAAGAGACGG TGTTGTTTGC AGGGCAGAAA CAGGGCACGC ATACCGCGCG CTTTGGTGAA ATAGAGCAGC GTGGCGTGGC ATTAACGCCG AAAGGACGAC AACTGTATGA TGATCTACTG CGGAACGCTG GAACCGGGCA GGATAATCTC ACTCACCAAA TGCATTTACA GGAAACCTTC CGCACTTTTC CTGACAGTGA GTTTTTAATG CGTCAGCAAG GGTTGGCATG GTTCCGGTAC CGTCTGACGC CTTCAGGTGA GGCGCATCGT CAGGCGATTC ATCCCGGAGA CGATCCACAG CCCTTAATTG AACGTGGTTG GGTAGTGGCG CAACCCATTA CCTATGAAGA TTTCTTGCCT GTTAGCGCGG CGGGGATCTT CCAGTCAAAT CTGGGTAATG AAACGCAGGC ACGCAGTCAC GGTAATGCCA GTCGCGAAGC ATTTGAGCAG GCGTTGGGCT GCCCGGTTTT GGATGAGTTC CAGCTTTATC AGGAAGCGGA AGAACGCAGT AAACGTCGCT GTGGTTTGCT TTAA
|
Protein sequence | MANSITADEI REQFSQAMSA MYQQEVPQYG TLLELVADVN LAVLENNPQL HEKMVNADEL ARLNVERHGA IRVGTAQELA TLRRMFAIMG MYPVSYYDLS QAGVPVHSTA FRPIDDASLA RNPFRVFTSL LRLELIENEF LRQKAAEILR QRDIFTPRCR QLLEEYEQRG GFNETQAQEF VQEALETFRW HQSATVDEET YRALHNEHRL IADVVCFPGC HINHLTPRTL DIDRVQSMMP ECGIEPKILI EGPPRREVSI LLRQTSFKAL EETVLFAGQK QGTHTARFGE IEQRGVALTP KGRQLYDDLL RNAGTGQDNL THQMHLQETF RTFPDSEFLM RQQGLAWFRY RLTPSGEAHR QAIHPGDDPQ PLIERGWVVA QPITYEDFLP VSAAGIFQSN LGNETQARSH GNASREAFEQ ALGCPVLDEF QLYQEAEERS KRRCGLL
|
| |