Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_1950 |
Symbol | |
ID | 5588679 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 1936758 |
End bp | 1938149 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640925623 |
Product | sodium/dicarboxylate symporter family protein |
Protein accession | YP_001463026 |
Protein GI | 157158064 |
COG category | [R] General function prediction only |
COG ID | [COG1823] Predicted Na+/dicarboxylate symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAACTTTC CATTAATTGC GAACATCGTG GTGTTCGTTG TACTGCTGTT TGCGCTGGCT CAGACCCGCC ATAAACAGTG GAGTCTGGCG AAAAAAGTGC TGGTGGGTCT GGTGATGGGT GTGGTTTTTG GCCTTGCCCT GCATACCATT TATGGTTCTG ACAGCCAGGT ACTTAAAGAT TCTGTACAGT GGTTTAACAT CGTTGGTAAC GGCTATGTTC AACTGCTGCA AATGATCGTT ATGCCGTTAG TCTTCGCCTC TATTCTGAGC GCGGTTGCCC GTCTGCATAA CGCATCTCAG TTAGGCAAAA TCAGTTTTCT GACCATCGGT ACGCTTTTGT TTACCACACT GATTGCGGCG CTGATCGGTG TGCTGGTCAC CAACCTGTTT GGTTTGACGG CTGAAGGTCT GGTTCAGGGT GGTGCAGAAA CTGCACGTCT GAACGCCATC GAAAGTAACT ATGTTGGTAA AGTCTCTGAC CTGAGCGTTC CGCAGCTGGT CTTGTCCTTT ATCCCGAAAA ACCCGTTTGC CGATCTGACC GGAGCCAATC CGACGTCAAT TATCAGCGTG GTAATTTTTG CCGCATTCCT CGGTGTAGCT GCGCTGAAAC TGCTGAAGGA TGATGCGCCG AAAGGTGAAC GCGTCTTAAC CGCTATCGAT ACCCTGCAAA GCTGGGTGAT GAAACTGGTT CGCCTGGTCA TGCAGTTGAC CCCTTACGGC GTTCTGGCAC TAATGACCAA AGTGGTTGCA GGTTCTAACC TGCAAGACAT CATCAAACTG GGAAGTTTCG TTGTCGCGTC CTACCTCGGT CTGCTGATTA TGTTTGCAGT GCATGGCATT CTGCTGGGCA TTAATGGCGT GAGTCCGCTG AAGTACTTCC GTAAGGTATG GCCTGTGCTG ACGTTTGCCT TTACCAGCCG TTCCAGTGCT GCGTCTATCC CACTGAATGT GGAAGCACAA ACGCGTCGTC TGGGCGCTCC TGAATCCATC GCCAGTTTCG CCGCCTCTTT CGGTGCAACC ATTGGTCAGA ACGGCTGCGC CGGTTTGTAT CCGGCAATGC TGGCGGTGAT GGTTGCGCCT ACGGTTGGCA TTAACCCGCT GGACCCGATG TGGATTGCGA CGCTGGTCGG TATTGTTACC GTTAGTTCCG CAGGCGTTGC CGGTGTCGGT GGTGGTGCAA CTTTCGCCGC ACTGATTGTG CTGCCTGCGA TGGGCCTGCC AGTAACCCTG GTGGCGCTGT TAATCTCCGT TGAACCGCTT ATCGACATGG GCCGTACGGC GCTAAACGTT AGTGGCTCGA TGACAGCTGG CACGCTGACC AGCCAGTGGC TGAAGCAAAC CGATAAAGCC ATTCTGGATA GCGAAGACGA CGCCGAACTG GCACACCGTT AA
|
Protein sequence | MNFPLIANIV VFVVLLFALA QTRHKQWSLA KKVLVGLVMG VVFGLALHTI YGSDSQVLKD SVQWFNIVGN GYVQLLQMIV MPLVFASILS AVARLHNASQ LGKISFLTIG TLLFTTLIAA LIGVLVTNLF GLTAEGLVQG GAETARLNAI ESNYVGKVSD LSVPQLVLSF IPKNPFADLT GANPTSIISV VIFAAFLGVA ALKLLKDDAP KGERVLTAID TLQSWVMKLV RLVMQLTPYG VLALMTKVVA GSNLQDIIKL GSFVVASYLG LLIMFAVHGI LLGINGVSPL KYFRKVWPVL TFAFTSRSSA ASIPLNVEAQ TRRLGAPESI ASFAASFGAT IGQNGCAGLY PAMLAVMVAP TVGINPLDPM WIATLVGIVT VSSAGVAGVG GGATFAALIV LPAMGLPVTL VALLISVEPL IDMGRTALNV SGSMTAGTLT SQWLKQTDKA ILDSEDDAEL AHR
|
| |