Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A2024 |
Symbol | |
ID | 6873797 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 1956159 |
End bp | 1957550 |
Gene Length | 1392 bp |
Protein Length | 463 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642785138 |
Product | sodium:dicarboxylate symporter family protein |
Protein accession | YP_002215804 |
Protein GI | 198243533 |
COG category | [R] General function prediction only |
COG ID | [COG1823] Predicted Na+/dicarboxylate symporter |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.546812 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 69 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATTTTC CATTAATTGC GAACATTGCG GTGTTCGTCA TTCTGCTGTT TGTAGTGGCG CAGGCCCGTC ATAAACAGTG GAGTCTGGCT AAAAAAGTGC TTGTCGGCCT TGTGATGGGC GTGGTCTTTG GTCTGGCCCT GCACACCATT TATGGCGCTG ACAGTCAGGT GTTAAAAGAC TCCATTCAGT GGTTCAATAT TGTCGGCAAC GGCTATGTGC AATTGCTACA AATGATTGTG ATGCCGCTGG TATTCGCCTC TATTTTAAGC GCCGTGGCAC GACTGCATAA CGCCTCTCAA TTGGGGAAAA TCAGTTTTCT GACTATCGGC ACGCTGCTGT TTACTACCCT TATCGCAGCG CTGGTGGGCG TGCTGGTCAC CAATCTGTTC GGACTGACGG CGGAAGGTCT GGTTCAGGGC GGCGCGGAAA CCGCGCGTCT GAATGCCATT GAGACGAGCT ATGTCGGCAA AGTGGCCGAT CTCAGCGTGC CGCAACTGGT GCTGTCTTTT GTACCGAAAA ATCCGTTTGC GGATCTGACT GGCGCAAATC CTACGTCTAT TATCAGCGTG GTGATTTTCG CCGCACTCCT GGGCGTGGCC GCGCTTAAAC TGCTGAAAGA TGATGCGCCA AAAGGCGAGC GTGTGCTTGT TGCTATCGAT ACCCTGCAAA GCTGGGTGAT GAAGCTGGTG CGTCTGGTGA TGCAACTTAC GCCTTACGGT GTACTGGCGC TGATGACGAA AGTGGTTGCC GGCTCCAATC TGCAGGACAT TATTAAGCTT GGAAGCTTCG TGGTAGCGTC ATATCTTGGT CTGGCGATAA TGTTTGTGGT TCACGGCATA CTATTGGGCG TGAACGGGAT TAGCCCGCTG AAATATTTCC GCAAGGTGTG GCCGGTGTTG ACATTTGCCT TTACCAGCCG CTCCAGCGCC GCCTCCATTC CACTCAATGT CGAAGCGCAA ACCCGCCGCC TGGGCGTGCC GGAGTCCATC GCCAGCTTTG CTGCCTCTTT TGGCGCGACC ATCGGCCAAA ACGGTTGTGC CGGTCTTTAT CCTGCGATGT TGGCGGTAAT GGTTGCGCCG ACAGTCGGCA TCAACCCCTT AGACCCCGTA TGGATAGCCA CGCTGGTGGG TATCGTTACC GTAAGTTCGG CGGGTGTTGC CGGCGTTGGC GGCGGCGCGA CCTTCGCCGC GCTGATTGTG CTGCCTGCCA TGGGCTTACC GGTCACGCTG GTCGCGCTGT TAATCTCCGT CGAACCGTTG ATTGATATGG GCCGTACCGC GCTGAACGTG AGTGGCTCAA TGACCGCGGG TACGCTGACC AGCCAGTGGC TGAAGCAGAC CGATAAAACC ATTCTCGATA GCGAAGAAGA CGCTGAACTG GCGCATCGAT AA
|
Protein sequence | MNFPLIANIA VFVILLFVVA QARHKQWSLA KKVLVGLVMG VVFGLALHTI YGADSQVLKD SIQWFNIVGN GYVQLLQMIV MPLVFASILS AVARLHNASQ LGKISFLTIG TLLFTTLIAA LVGVLVTNLF GLTAEGLVQG GAETARLNAI ETSYVGKVAD LSVPQLVLSF VPKNPFADLT GANPTSIISV VIFAALLGVA ALKLLKDDAP KGERVLVAID TLQSWVMKLV RLVMQLTPYG VLALMTKVVA GSNLQDIIKL GSFVVASYLG LAIMFVVHGI LLGVNGISPL KYFRKVWPVL TFAFTSRSSA ASIPLNVEAQ TRRLGVPESI ASFAASFGAT IGQNGCAGLY PAMLAVMVAP TVGINPLDPV WIATLVGIVT VSSAGVAGVG GGATFAALIV LPAMGLPVTL VALLISVEPL IDMGRTALNV SGSMTAGTLT SQWLKQTDKT ILDSEEDAEL AHR
|
| |