Gene SeD_A3520 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3520 
Symbol 
ID6874833 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3378649 
End bp3379632 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content52% 
IMG OID642786510 
Producttrap dicarboxylate transporter DctP subunit 
Protein accessionYP_002217147 
Protein GI198244579 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.656034 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones71 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA CACGTTCATT CACAACATCA GCGGTATTAC TGGCCGGCTG TTTGCTACTG 
GCATTTCCAG CGCTCGCCAA AACCACGCTG AAACTGAGCC ACAATCAGGA TAAAAGCCAC
GCCGTTCACA AAGCGATGAG CTATCTGGCC GATAAAGCGA AAGCCTATTC GGACGGCGAA
TTAAATATTC GTATTTACCC CAACGCCACG CTGGGCAACG AACGTGAATC GCTGGAATTG
ATGAACTCCG GCGCTCTGCA AATGGTGAAA GTCAATGCGG CATCGCTGGA GTCTTTTGCG
CCGGAATATA GCGTGTTTAG CCTGCCGTTT TTATTCCGCG ACCGCGATCA CTACTACAAC
GTACTGAAAA GCGACTTAGG GAAACGCATT CTCGCGTCCT CCGAAAGCAA AGGCTTCGTC
GGCTTAACCT GGTACGACGG CGGCGCCCGC AGTTTTTACG CTGGTAAGCC CATCACTCAA
CCCGACGATT TAGCCGGTAT GAAAATCAGA GTGCAGCAAA GCCCCAGCGC TATCGCGATG
GTGAAAGCGC TCGGCGGCGT GCCGACGCCG ATGGCGCAAG GCGAACTCTA TACCGCGCTC
CAGCAAGGCG TGGTCGATGG CGGCGAAAAC AACCCCGTGG TTTATGCCGA TATGCGTCAT
GCGGAGGTGG CGAAATTCTA TTCCCGCGAC GAGCACACGA TGGTGCCGGA TGTCCTGGTC
ATCAGTACCA AAGTACTTAA CAAATTGAGC GATAAAGAGC GGAAAGCGTT ATATAAAGCC
GCAGATGAAT CCATGCAGCA AATGAAAGAC GTCATCTGGC CCGCCGCGGA AAAAGAGGCT
TATGAGAGCA TGAAGGCCAT GAATGCGACC GTTGTTGATA TTGATAAATC CGCGTTCAAA
CAGCGTGTTA AGCCCTTGTT TGATGAGTTC CGCGCGAAAG ATGCTCAATC AGCGAAGGAT
CTGGAATACA TCGAGAATAT GTAA
 
Protein sequence
MKNTRSFTTS AVLLAGCLLL AFPALAKTTL KLSHNQDKSH AVHKAMSYLA DKAKAYSDGE 
LNIRIYPNAT LGNERESLEL MNSGALQMVK VNAASLESFA PEYSVFSLPF LFRDRDHYYN
VLKSDLGKRI LASSESKGFV GLTWYDGGAR SFYAGKPITQ PDDLAGMKIR VQQSPSAIAM
VKALGGVPTP MAQGELYTAL QQGVVDGGEN NPVVYADMRH AEVAKFYSRD EHTMVPDVLV
ISTKVLNKLS DKERKALYKA ADESMQQMKD VIWPAAEKEA YESMKAMNAT VVDIDKSAFK
QRVKPLFDEF RAKDAQSAKD LEYIENM