Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeHA_C3996 |
Symbol | |
ID | 6491223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Heidelberg str. SL476 |
Kingdom | Bacteria |
Replicon accession | NC_011083 |
Strand | + |
Start bp | 3873064 |
End bp | 3874050 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 50% |
IMG OID | 642744097 |
Product | trap transporter solute receptor |
Protein accession | YP_002047702 |
Protein GI | 194451165 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 79 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAATTAC ACGTTATTGC TCGTTCATTA TTGATAGCTG GTCTGACGGT TTTCAGCGTG TCGTCTCTGG CGGCGCAATC TTTACGTTTT GGTTATGAAA CACCGCAAAC TGACTCCCAA CATATTGCCG CGAAAAAATT TAACGAACTA TTAAAAGAAA AAACTAACGG CGAATTAACG CTAAAACTCT TTCCTGACAG CACATTAGGT AACGCCCAGG CAATGATCAG CGGGGTGCGC GGCGGAACCA TTGATATGGA AATGTCCGGT TCGAACAATT TCACCGGCCT GGCCCCTGTA TTCAACTTAC TTGATGTCCC CTTCCTGTTT CGCGATACCG CGCATGCGCA TAAAACGCTC GACGGCAAAG TCGGCGATGA ACTGAAAAAA TCACTCGATT CAAAAGGGTT AAAAGTCCTC GCCTACTGGG AAAACGGCTG GCGCGACGTC ACCAACTCCC GCGCGCCGGT AAAAACGCCG GGCGATTTGA AAGGCTTAAA AATCCGCACT AACAACAGCC CAATGAATAT CGCGGCCTTT AAAATCTTCG GCGCGAACCC TATTCCGATG CCGTTCTCCG AAGTCTATAC CGGCCTCGAA ACCCGTACGA TTGATGCCCA GGAACACCCT ATCAACGTCG TGTGGTCAGC GAAATTCTAT GAGGTACAGA AATACCTCTC CCTCACTCAT CACGCCTATT CGCCTCTGCT GCTGGTGATC AATAAAGCCA AATTCGACGC TTTAAGCCCG CAGCTCCAGG AGGCACTGCT GAGTTCCGCT AAAGAAGCGG GTGACTATCA GCGCAAACTG GTCGCCGAAG ATCAGCAAAA AATTATCGAT GGCATGAAAG AAGCCGGAGT TGAAGTCCTG ACCGATATCG ACCGTAAAGC CTTCAGCGAT GCGCTGGGCA GCCAGGTGCG CGATATGTTC CTGAAAGACA ACCCGCAGGG CGCTGACCTC CTGAAAGCCG TGGACGAGGT GCAATAA
|
Protein sequence | MKLHVIARSL LIAGLTVFSV SSLAAQSLRF GYETPQTDSQ HIAAKKFNEL LKEKTNGELT LKLFPDSTLG NAQAMISGVR GGTIDMEMSG SNNFTGLAPV FNLLDVPFLF RDTAHAHKTL DGKVGDELKK SLDSKGLKVL AYWENGWRDV TNSRAPVKTP GDLKGLKIRT NNSPMNIAAF KIFGANPIPM PFSEVYTGLE TRTIDAQEHP INVVWSAKFY EVQKYLSLTH HAYSPLLLVI NKAKFDALSP QLQEALLSSA KEAGDYQRKL VAEDQQKIID GMKEAGVEVL TDIDRKAFSD ALGSQVRDMF LKDNPQGADL LKAVDEVQ
|
| |