Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A3966 |
Symbol | |
ID | 6874896 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 3801386 |
End bp | 3802858 |
Gene Length | 1473 bp |
Protein Length | 490 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 642786923 |
Product | inner membrane transporter YhiP |
Protein accession | YP_002217551 |
Protein GI | 198246072 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.826151 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 85 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATACAA CTGCACCTAC GGGCTTGCTG CAGCAACCTC GTCCATTTTT CATGATCTTT TTTGTAGAAT TATGGGAACG ATTTGGCTAT TACGGCGTCC AGGGCATCCT GGCGGTCTTT TTCGTTAAAC AATTGGGTTT TTCTCAGGAA CAGGCCTTTA TTACCTTTGG CGCTTTTGCG GCGCTGGTTT ATGGCCTGAT CTCCATCGGC GGCTATGTTG GCGACCATCT GTTAGGGACT AAACGCACCC TGGTCCTGGG CGCGATTGTG CTGGCGATTG GCTATTTTAT GACCGGCATG TCGTTATTAA ATCCCGATCT GATTTTTATC GCACTGGGTA CGATTGCCGT GGGCAACGGG TTATTTAAAG CCAATCCCGC CAGCCTGCTC TCTAAATGCT ATCAGCCTAA AGATCCCCGG CTGGATGGCG CTTTCACCCT GTTTTATATG TCGATTAACA TCGGTTCTTT GTTATCGCTA TCGCTGGCGC CGGTGATTGC CGATAAATTT GGCTATACGG TGACCTATAA TCTGTGCGGC GCTGGTTTAA TTGTTGCGCT TCTGGTGTAC TTCGCCTGCC GTGGCATGGT GAAAAATATC GGTTCTGAAC CGGATCATAA ACCGCTACGT TTTCGCAATT TGCTGCTGGT ACTACTCGGC ACCGTCGTCA TGATTTTCCT CTGCGCCTGG CTGATGCACA ACGTTAAGAT TGCCAATCTG GTGCTCATCG TCCTTTCTAT CGTCGTCACT ATTTTCTTCT TTCGCGAAGC GTTTCGTCTG GATAAAACCG GCCGCAATAA AATGTTCGTG GCGTTTATTC TGATGATTGA AGCCGTGCTG TTTTACATTC TGTATGCGCA GATGCCTACC TCGCTGAACT TCTTTGCGAT TAATAACGTG CATCATGAAA TTCTTGGATT CGCCATTAAC CCGGTGAGTT TTCAGGCGCT GAACCCATTC TGGGTGGTCG TCGCCAGTCC GGTACTGGCA GCGATTTACA CCCGACTGGG TAGCAAAGGC AAAGATCTGA CTATGCCGAT GAAGTTTACG CTCGGTATGT TCCTCTGCGC GCTGGGTTTT CTGACCGCCG CCGCCGCCGG GATGTGGTTT GCCGATGCGC AAGGACTGAC GTCGCCGTGG TTTATCGTGC TGGTGTATCT GTTCCAGAGT CTGGGCGAGT TGCTGATTAG CGCGCTGGGA CTGGCAATGG TCGCCGCTCT GGTGCCGCAG CATCTGATGG GCTTTATTCT GGGAATGTGG TTCCTGACCC AGGCCGCCGC CTTCCTGCTC GGCGGTTATG TGGCGACCTT CACTGCCGTA CCGGAAAACA TCACCGATCC GTTACAGACG CTGCCCATTT ATACCGGCGT CTTTAGCAAA ATTGGTCTGG TAACACTGGC GGTCACCGTG GTGATGGCCA TTATGGTGCC GTGGTTAAAC CGGATGATTA ATACGCCAGG TACCGAACAG TAA
|
Protein sequence | MNTTAPTGLL QQPRPFFMIF FVELWERFGY YGVQGILAVF FVKQLGFSQE QAFITFGAFA ALVYGLISIG GYVGDHLLGT KRTLVLGAIV LAIGYFMTGM SLLNPDLIFI ALGTIAVGNG LFKANPASLL SKCYQPKDPR LDGAFTLFYM SINIGSLLSL SLAPVIADKF GYTVTYNLCG AGLIVALLVY FACRGMVKNI GSEPDHKPLR FRNLLLVLLG TVVMIFLCAW LMHNVKIANL VLIVLSIVVT IFFFREAFRL DKTGRNKMFV AFILMIEAVL FYILYAQMPT SLNFFAINNV HHEILGFAIN PVSFQALNPF WVVVASPVLA AIYTRLGSKG KDLTMPMKFT LGMFLCALGF LTAAAAGMWF ADAQGLTSPW FIVLVYLFQS LGELLISALG LAMVAALVPQ HLMGFILGMW FLTQAAAFLL GGYVATFTAV PENITDPLQT LPIYTGVFSK IGLVTLAVTV VMAIMVPWLN RMINTPGTEQ
|
| |