Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4687 |
Symbol | |
ID | 6143275 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4785726 |
End bp | 4787219 |
Gene Length | 1494 bp |
Protein Length | 497 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641619503 |
Product | hypothetical protein |
Protein accession | YP_001746611 |
Protein GI | 170682000 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.0268272 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTACCAA TTAAACCCGT TTGTCTCGCA CTGGCTTTAT TTTCACTCAA TTCAATAGCC TGCTCCGGCG TACCATCTGC CTCTTTTTAT TACCAACCGT GGGGGCAAGG AACAAAATAT ATTGTTGAAG ATAGCGATGG TTACGGACTC TATTTTCCGG ATATTCATTT ACCGCAGGTG AGGTTCAAAC AACTGCGCCG TGTATCTGAC GGCACTGATG ATGAATATGA TTCTCCTGAT GATATCAATA ATGATTTTGT GCGGGTGGAT GGTGGCGCTT ATATCAGAAC CAACGGTGGT TGGCGTTTCG ACTGGTTGAC TGATGGCCGC CATATTTTAT GGGCAGGGAA AATAGTTCAG AACCCGCCGG GAACACCGAA AGTCGATGCC GCAACATTCC GCGCATGGGG GCGTTTTGCT GCCGATAAAG ACAGCCTCTA TTTTGATGGC GAACGCACGG ATGATAATCA CGGCGAAAAG CAGGTCGATA TGGATTCGCT GCAACAGGTG GGGGGAAGGA TGAATTCACT TGATTCCGCG GATGTGCTGA AAGATCGTCG CAATCTTTAC TTTCAGGGGC GCTGGTTAGG TAGTGCGCAA GGGTATTCTA TTCTTGACAT TAAACCCTGG ACACGTCGTC CCATTATTTA TTCACCTAAT AGCTGCGAAA GCAAAAGTAA TCCGGGACCC TGGGATACAA TCTGGCGGAC TAAATCGCAG GTCTTTGTGA ATGGAGATGC CATCAACGCC GATCCGGATA CCTTTCATAT TGTGCGCTGG ATAGCCGGTT CACTGCTAAT TTATCGCGAT AAAGACGGGG AGAAACGTTA TGCGTTCGGT AAAAACTGCC GAAGTATATT TGACCTACAG CGGGATAAAG TAACCTGGAT GACGCGTGAT GCTTCCAGGC GTGCTAATGA CTGCCGGGTT GCGACAATTC CTGATGTTGA TCCGGAATAT TTTCGCCCGC TGACGGAGAA TGTGGCGCAA TATAAAGATT CGCTCTATCT GGTGAAATAT ATATCTGCGG TCGAAAAAAC GCTGTCGGTC ATACATTTAC CAGACCCACA ACAAGAATTA CAGAAAGGTG CTAATGTTGT AGGGAACAAA ATTTATTTCA TCGCGCGCGA TGATGTGACT ATTTTTGATA TTAATGGGCA ATGGCAGTGG TATAAAAGCC CCGATGGCAC ACCATTAAAT TATTTAGCAC ATGACGATCG CTATACGTAT TTTATTGATG AAGACACGGT TGAGCATTTT GAACTGAAAG GCCAATGGAC GTGGTTTAAA TACGCCAATG GTCAATTATC AGATACCTTT GCTCATGATG ACCACTATAT TTATTACGTA GGCGATGGGC TGGTACGTGA TGCGAAGCGG CGTAATGAGA CTCGACAACT CGATGCGGCG CATCTTGATA AAAACGGCAG TTTGCTTACC GTTGAAGGTA AGTACACCAG TTACAACAAT GAATTACTTC CACTGAATGA CTGA
|
Protein sequence | MLPIKPVCLA LALFSLNSIA CSGVPSASFY YQPWGQGTKY IVEDSDGYGL YFPDIHLPQV RFKQLRRVSD GTDDEYDSPD DINNDFVRVD GGAYIRTNGG WRFDWLTDGR HILWAGKIVQ NPPGTPKVDA ATFRAWGRFA ADKDSLYFDG ERTDDNHGEK QVDMDSLQQV GGRMNSLDSA DVLKDRRNLY FQGRWLGSAQ GYSILDIKPW TRRPIIYSPN SCESKSNPGP WDTIWRTKSQ VFVNGDAINA DPDTFHIVRW IAGSLLIYRD KDGEKRYAFG KNCRSIFDLQ RDKVTWMTRD ASRRANDCRV ATIPDVDPEY FRPLTENVAQ YKDSLYLVKY ISAVEKTLSV IHLPDPQQEL QKGANVVGNK IYFIARDDVT IFDINGQWQW YKSPDGTPLN YLAHDDRYTY FIDEDTVEHF ELKGQWTWFK YANGQLSDTF AHDDHYIYYV GDGLVRDAKR RNETRQLDAA HLDKNGSLLT VEGKYTSYNN ELLPLND
|
| |