Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1462 |
Symbol | |
ID | 6147509 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1446562 |
End bp | 1447953 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641616340 |
Product | sodium/dicarboxylate symporter family protein |
Protein accession | YP_001743520 |
Protein GI | 170683964 |
COG category | [R] General function prediction only |
COG ID | [COG1823] Predicted Na+/dicarboxylate symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.726866 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAACTTTC CATTAATTGC GAACATCGTG GTGTTCGTTG TACTGCTGTT TGCGCTGGCT CAGACCCGCC ATAAACAGTG GAGTCTGGCG AAAAAAGTGC TGGTGGGTCT GGTGATGGGT GTGGTTTTTG GCCTCGCCCT GCATACCATT TATGGTTCTG ACAGCCAGGT ACTTAAAGAT TCTGTACAGT GGTTTAACAT CGTTGGTAAC GGCTATGTTC AACTGCTGCA AATGATCGTT ATGCCGTTAG TCTTTGCCTC TATTCTGAGC GCGGTTGCCC GTCTGCATAA CGCATCTCAG TTAGGCAAAA TCAGTTTTCT GACCATCGGA ACGCTTTTGT TTACCACGCT GATTGCGGCG CTGGTCGGTG TGCTGGTCAC CAACCTGTTT GGTTTGACGG CTGAAGGTCT GGTTCAGGGT GGTGCAGAAA CTGCACGTTT GAACGCCATC GAAAGTAACT ATGTTGGTAA AGTCTCTGAC CTGAGCGTTC CGCAGCTGGT CTTGTCCTTT ATCCCGAAAA ACCCGTTTGC CGATTTGACC GGAGCCAATC CGACGTCAAT TATCAGCGTG GTAATTTTTG CCGCATTCCT CGGCGTAGCT GCGCTCAAGC TGCTGAAGGA TGATGCGCCG AAAGGTGAAC GCGTCTTAAC CGCTATCGAT ACCCTGCAAA GCTGGGTGAT GAAACTGGTT CGCCTGGTCA TGCAGCTGAC CCCTTACGGC GTTCTGGCGC TGATGACCAA AGTGGTTGCA GGTTCTAACC TGCAAGACAT CATCAAACTG GGAAGTTTCG TTGTCGCGTC CTACCTCGGT CTGCTGATTA TGTTTGCAGT GCATGGCCTT CTGCTGGGCA TTAATGGCGT GAGCCCGCTG AAATACTTCC GTAAGGTATG GCCAGTTCTG ACGTTTGCCT TTACCAGCCG TTCCAGTGCT GCGTCTATCC CACTGAATGT GGAAGCACAA ACGCGTCGAC TCGGTGTTCC TGAATCCATC GCCAGTTTCG CCGCCTCTTT CGGTGCAACC ATTGGTCAGA ACGGCTGTGC CGGTTTGTAT CCGGCGATGC TGGCGGTGAT GGTTGCGCCT ACGGTTGGCA TTAACCCGAT GGACCCGATG TGGATTGCGA CGCTGGTCGG TATTGTTACC GTTAGTTCCG CAGGCGTTGC CGGTGTCGGT GGTGGTGCAA CTTTCGCCGC ACTGATTGTG CTGCCTGCGA TGGGCCTGCC AGTAACCCTG GTGGCGCTGT TAATCTCCGT TGAACCGCTT ATCGACATGG GCCGTACGGC GCTAAACGTT AGTGGCTCGA TGACAGCTGG CACGCTGACC AGCCAGTGGC TGAAGCAAAC CGATAAAGCC ATTCTGGATA GCGAAGACGA CGCCGAACTG GCACACCGTT AA
|
Protein sequence | MNFPLIANIV VFVVLLFALA QTRHKQWSLA KKVLVGLVMG VVFGLALHTI YGSDSQVLKD SVQWFNIVGN GYVQLLQMIV MPLVFASILS AVARLHNASQ LGKISFLTIG TLLFTTLIAA LVGVLVTNLF GLTAEGLVQG GAETARLNAI ESNYVGKVSD LSVPQLVLSF IPKNPFADLT GANPTSIISV VIFAAFLGVA ALKLLKDDAP KGERVLTAID TLQSWVMKLV RLVMQLTPYG VLALMTKVVA GSNLQDIIKL GSFVVASYLG LLIMFAVHGL LLGINGVSPL KYFRKVWPVL TFAFTSRSSA ASIPLNVEAQ TRRLGVPESI ASFAASFGAT IGQNGCAGLY PAMLAVMVAP TVGINPMDPM WIATLVGIVT VSSAGVAGVG GGATFAALIV LPAMGLPVTL VALLISVEPL IDMGRTALNV SGSMTAGTLT SQWLKQTDKA ILDSEDDAEL AHR
|
| |