Gene SeHA_C3419 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeHA_C3419 
Symbol 
ID6491352 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Heidelberg str. SL476 
KingdomBacteria 
Replicon accessionNC_011083 
Strand
Start bp3323756 
End bp3324739 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content52% 
IMG OID642743550 
Producttrap transporter solute receptor 
Protein accessionYP_002047165 
Protein GI194449769 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.670449 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones81 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAACA CACGTTCATT CACAACATCA GCGGTATTAC TGGCCGGCTG TTTGCTACTG 
GCATTTCCAG CGCTCGCCAA AACCACGCTG AAACTGAGCC ACAATCAGGA TAAAAGCCAC
GCCGTTCACA AAGCGATGAG CTATCTGGCC GATAAAGCGA AAGCCTATTC GGACGGCGAA
TTAAATATTC GTATTTACCC CAACGCCACG CTGGGCAACG AACGTGAATC GCTGGAATTG
ATGAACTCCG GCGCTCTGCA AATGGTGAAA GTCAATGCGG CATCGCTGGA GTCTTTTGCG
CCGGAATATA GCGTGTTTAG CCTGCCGTTT TTATTCCGCG ACCGCGATCA CTACTACAAC
GTACTGAAAA GCGACTTAGG GAAACGCATT CTCGCGTCCT CCGAAAGCAA AGGCTTCGTC
GGCTTAACCT GGTACGACGG CGGCGCCCGC AGTTTTTACG CTGGTAAGCC CATCACTCAA
CCCGACGATT TAGCCGGTAT GAAAATCAGA GTGCAGCAAA GCCCCAGCGC TATCGCGATG
GTGAAAGCGC TCGGCGGTGT GCCGACGCCG ATGGCGCAAG GCGAACTCTA TACCGCGCTC
CAGCAAGGCG TGGTCGATGG CGGCGAAAAC AACCCCGTGG TTTATGCCGA TATGCGTCAT
GCGGAGGTGG CGAAATTCTA TTCCCGCGAC GAGCACACGA TGGTGCCGGA TGTCCTGGTC
ATCAGTACCA AAGTACTTAA CAAATTGAGC GATAAAGAGC GGAAAGCGTT ATATAAAGCC
GCAGATGAAT CCATGCAGCA AATGAAAGAT GTCATCTGGC CCGCCGCGGA AAAAGAGGCT
TATGAGAGCA TGAAGGCCAT GAACGCGACT GTTGTTGATA TTGATAAATC CGCGTTCAAA
CAGCGTGTTA AGCCCTTGTT TGATGAGTTC CGCGCAAAAG ACGCTCAGTC AGCGAAGGAT
CTGGAATACA TCGAGAATAT GTAA
 
Protein sequence
MKNTRSFTTS AVLLAGCLLL AFPALAKTTL KLSHNQDKSH AVHKAMSYLA DKAKAYSDGE 
LNIRIYPNAT LGNERESLEL MNSGALQMVK VNAASLESFA PEYSVFSLPF LFRDRDHYYN
VLKSDLGKRI LASSESKGFV GLTWYDGGAR SFYAGKPITQ PDDLAGMKIR VQQSPSAIAM
VKALGGVPTP MAQGELYTAL QQGVVDGGEN NPVVYADMRH AEVAKFYSRD EHTMVPDVLV
ISTKVLNKLS DKERKALYKA ADESMQQMKD VIWPAAEKEA YESMKAMNAT VVDIDKSAFK
QRVKPLFDEF RAKDAQSAKD LEYIENM