Gene SeD_A3966 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSeD_A3966 
Symbol 
ID6874896 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSalmonella enterica subsp. enterica serovar Dublin str. CT_02021853 
KingdomBacteria 
Replicon accessionNC_011205 
Strand
Start bp3801386 
End bp3802858 
Gene Length1473 bp 
Protein Length490 aa 
Translation table11 
GC content51% 
IMG OID642786923 
Productinner membrane transporter YhiP 
Protein accessionYP_002217551 
Protein GI198246072 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3104] Dipeptide/tripeptide permease 
TIGRFAM ID[TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.826151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones85 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATACAA CTGCACCTAC GGGCTTGCTG CAGCAACCTC GTCCATTTTT CATGATCTTT 
TTTGTAGAAT TATGGGAACG ATTTGGCTAT TACGGCGTCC AGGGCATCCT GGCGGTCTTT
TTCGTTAAAC AATTGGGTTT TTCTCAGGAA CAGGCCTTTA TTACCTTTGG CGCTTTTGCG
GCGCTGGTTT ATGGCCTGAT CTCCATCGGC GGCTATGTTG GCGACCATCT GTTAGGGACT
AAACGCACCC TGGTCCTGGG CGCGATTGTG CTGGCGATTG GCTATTTTAT GACCGGCATG
TCGTTATTAA ATCCCGATCT GATTTTTATC GCACTGGGTA CGATTGCCGT GGGCAACGGG
TTATTTAAAG CCAATCCCGC CAGCCTGCTC TCTAAATGCT ATCAGCCTAA AGATCCCCGG
CTGGATGGCG CTTTCACCCT GTTTTATATG TCGATTAACA TCGGTTCTTT GTTATCGCTA
TCGCTGGCGC CGGTGATTGC CGATAAATTT GGCTATACGG TGACCTATAA TCTGTGCGGC
GCTGGTTTAA TTGTTGCGCT TCTGGTGTAC TTCGCCTGCC GTGGCATGGT GAAAAATATC
GGTTCTGAAC CGGATCATAA ACCGCTACGT TTTCGCAATT TGCTGCTGGT ACTACTCGGC
ACCGTCGTCA TGATTTTCCT CTGCGCCTGG CTGATGCACA ACGTTAAGAT TGCCAATCTG
GTGCTCATCG TCCTTTCTAT CGTCGTCACT ATTTTCTTCT TTCGCGAAGC GTTTCGTCTG
GATAAAACCG GCCGCAATAA AATGTTCGTG GCGTTTATTC TGATGATTGA AGCCGTGCTG
TTTTACATTC TGTATGCGCA GATGCCTACC TCGCTGAACT TCTTTGCGAT TAATAACGTG
CATCATGAAA TTCTTGGATT CGCCATTAAC CCGGTGAGTT TTCAGGCGCT GAACCCATTC
TGGGTGGTCG TCGCCAGTCC GGTACTGGCA GCGATTTACA CCCGACTGGG TAGCAAAGGC
AAAGATCTGA CTATGCCGAT GAAGTTTACG CTCGGTATGT TCCTCTGCGC GCTGGGTTTT
CTGACCGCCG CCGCCGCCGG GATGTGGTTT GCCGATGCGC AAGGACTGAC GTCGCCGTGG
TTTATCGTGC TGGTGTATCT GTTCCAGAGT CTGGGCGAGT TGCTGATTAG CGCGCTGGGA
CTGGCAATGG TCGCCGCTCT GGTGCCGCAG CATCTGATGG GCTTTATTCT GGGAATGTGG
TTCCTGACCC AGGCCGCCGC CTTCCTGCTC GGCGGTTATG TGGCGACCTT CACTGCCGTA
CCGGAAAACA TCACCGATCC GTTACAGACG CTGCCCATTT ATACCGGCGT CTTTAGCAAA
ATTGGTCTGG TAACACTGGC GGTCACCGTG GTGATGGCCA TTATGGTGCC GTGGTTAAAC
CGGATGATTA ATACGCCAGG TACCGAACAG TAA
 
Protein sequence
MNTTAPTGLL QQPRPFFMIF FVELWERFGY YGVQGILAVF FVKQLGFSQE QAFITFGAFA 
ALVYGLISIG GYVGDHLLGT KRTLVLGAIV LAIGYFMTGM SLLNPDLIFI ALGTIAVGNG
LFKANPASLL SKCYQPKDPR LDGAFTLFYM SINIGSLLSL SLAPVIADKF GYTVTYNLCG
AGLIVALLVY FACRGMVKNI GSEPDHKPLR FRNLLLVLLG TVVMIFLCAW LMHNVKIANL
VLIVLSIVVT IFFFREAFRL DKTGRNKMFV AFILMIEAVL FYILYAQMPT SLNFFAINNV
HHEILGFAIN PVSFQALNPF WVVVASPVLA AIYTRLGSKG KDLTMPMKFT LGMFLCALGF
LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELLISALG LAMVAALVPQ HLMGFILGMW
FLTQAAAFLL GGYVATFTAV PENITDPLQT LPIYTGVFSK IGLVTLAVTV VMAIMVPWLN
RMINTPGTEQ