Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4058 |
Symbol | |
ID | 6871579 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3900221 |
End bp | 3901207 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642787007 |
Product | trap dicarboxylate transporter DctP subunit |
Protein accession | YP_002217634 |
Protein GI | 198244332 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.677544 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 64 |
Fosmid unclonability p-value | 0.748274 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAC ACGTTATTGC TCGTTCATTA TTGATAGCTG GTCTGACGGT TTTCAGCGTG TCGTCTCTGG CGGCGCAATC TTTACGTTTT GGTTATGAAA CACCGCAAAC TGACTCCCAA CATATTGCCG CGAAAAAATT TAACGAATTA TTAAAAGAAA AAACTAACGG CGAATTAACG CTAAAACTCT TTCCTGACAG CACATTAGGT AACGCCCAGG CAATGATCAG CGGGGTGCGC GGCGGAACTA TTGATATGGA AATGTCCGGT TCGAACAATT TCACCGGCCT GGCCCCTGTA TTCAACTTAC TTGATGTCCC CTTCCTGTTT CGCGATACCG CGCATGCGCA TAAAACGCTC GACGGCAAAG TCGGCGATGA ACTGAAAAAA TCACTCGATT CAAAAGGGTT AAAAGTGCTG GCCTACTGGG AAAACGGCTG GCGCGACGTC ACCAACTCCC GCGCGCCGGT AAAAACGCCG GGCGATTTGA AAGGCTTAAA AATCCGCACT AACAACAGCC CAATGAATAT CGCGGCCTTT AAAATCTTCG GCGCGAACCC TATTCCGATG CCGTTCTCCG AAGTCTATAC CGGCCTCGAA ACCCGTACGA TTGATGCCCA AGAACACCCT ATCAACGTCG TGTGGTCAGC GAAATTCTAT GAGGTACAGA AATACCTCTC CCTCACTCAT CACGCCTATT CGCCTCTGCT GCTGGTGATC AATAAAGCCA AATTCGACGC TTTAAGCCCG CAGTTCCAGG AGGCACTGCT GAGTTCCGCC AAAGAAGCGG GTGACTATCA GCGCAAACTG GTCGCCGAAG ATCAGCAAAA AATTATCGAT GGCATGAAAG AAGCCGGAGT TGAAGTCCTG ACCGATATCG ACCGTAAAGC CTTCAGCGAT GCGCTGGGCA GCCAGGTGCG CGATATGTTC CTGAAAGACA ACCCGCAGGG CGCCGATCTC CTGAAAGCCG TGGACGAGGT GCAATAA
|
Protein sequence | MKLHVIARSL LIAGLTVFSV SSLAAQSLRF GYETPQTDSQ HIAAKKFNEL LKEKTNGELT LKLFPDSTLG NAQAMISGVR GGTIDMEMSG SNNFTGLAPV FNLLDVPFLF RDTAHAHKTL DGKVGDELKK SLDSKGLKVL AYWENGWRDV TNSRAPVKTP GDLKGLKIRT NNSPMNIAAF KIFGANPIPM PFSEVYTGLE TRTIDAQEHP INVVWSAKFY EVQKYLSLTH HAYSPLLLVI NKAKFDALSP QFQEALLSSA KEAGDYQRKL VAEDQQKIID GMKEAGVEVL TDIDRKAFSD ALGSQVRDMF LKDNPQGADL LKAVDEVQ
|
| |