Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SNSL254_A3743 |
Symbol | tsgA |
ID | 6483677 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Newport str. SL254 |
Kingdom | Bacteria |
Replicon accession | NC_011080 |
Strand | + |
Start bp | 3604089 |
End bp | 3605270 |
Gene Length | 1182 bp |
Protein Length | 393 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 642739012 |
Product | hypothetical protein |
Protein accession | YP_002042723 |
Protein GI | 194446244 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG0738] Fucose permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 11 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 0.0396479 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTAACA GCAACCGCAT CAAGCTCACA TGGATCAGCT TTCTTTCCTA CGCCCTGACC GGGGCGCTGG TGATTGTCAC CGGGATGGTG ATGGGAAATA TCGCAGACTA TTTTCAGCTG CCCGTTTCCA GCATGAGTAA CACCTTTACT TTCCTGAATG CCGGGATTTT GATCTCGATC TTCCTCAATG CGTGGCTGAT GGAAATCATC CCGCTGAAAA CACAGCTACG CTTTGGTTTT ATCCTGATGG TGCTGGCGGT GGCCGGGCTG ATGTTCAGCC ATAGCCTGGC GTTGTTCTCA GCGGCGATGT TTGTGCTGGG GCTGGTCAGC GGGATCACCA TGTCGATTGG CACCTTCCTG ATTACGCAAC TGTATGAAGG GCGTCAGCGC GGTTCCCGAC TGCTGTTTAC CGACTCCTTC TTCAGCATGG CGGGGATGAT TTTTCCTATG GTCGCCGCCT TCCTGCTGGC GCGTAGTATT GAGTGGTACT GGGTCTACGC CTGCATCGGC CTGGTCTACC TGGCGATTTT CATCCTGACC TTCGGCTGTG AATTTCCGGC GCTGGGTAAA CATGCGCAGC ACTCTCAGGC ACCTGTCGTC AAAGAAAAAT GGGGCATTGG CGTACTGTTT CTCGCCGTCG CCGCGCTGTG CTATATCCTC GGTCAATTGG GCTTTATCTC CTGGGTGCCG GAATACGCCA AAGGCCTCGG CATGAGCCTG AATGACGCCG GGGCGCTGGT GAGTGATTTC TGGATGTCCT ATATGTTTGG CATGTGGGCG TTCAGCTTTA TCCTGCGCTT TTTCGATCTG CAACGCATTC TGACCGTACT GGCGGGTATG GCGGCGGTAC TGATGTATTT GTTTATTACC GGCACGCAGG CGCATATGCC GTGGTTTATT CTGACGCTGG GCTTCTTCTC CAGCGCCATT TATACCTCCA TCATTACGCT GGGATCGCAG CAAACGAAAG TGGCCTCGCC TAAGCTGGTT AACTTTATTC TGACCTGCGG CACTATCGGA ACGATGCTGA CCTTCGTCGT CACCGGCCCG ATTGTGGCGC ACAGCGGCCC ACAGGCGGCG TTACTCACCG CGAATGGTCT GTATGCGGTG GTCTTTGTGA TGTGCTTTGC GCTCGGTTTT GTATCCCGTC ATCGTCAGCA TAGCGCGCCG GCTACGCATT GA
|
Protein sequence | MTNSNRIKLT WISFLSYALT GALVIVTGMV MGNIADYFQL PVSSMSNTFT FLNAGILISI FLNAWLMEII PLKTQLRFGF ILMVLAVAGL MFSHSLALFS AAMFVLGLVS GITMSIGTFL ITQLYEGRQR GSRLLFTDSF FSMAGMIFPM VAAFLLARSI EWYWVYACIG LVYLAIFILT FGCEFPALGK HAQHSQAPVV KEKWGIGVLF LAVAALCYIL GQLGFISWVP EYAKGLGMSL NDAGALVSDF WMSYMFGMWA FSFILRFFDL QRILTVLAGM AAVLMYLFIT GTQAHMPWFI LTLGFFSSAI YTSIITLGSQ QTKVASPKLV NFILTCGTIG TMLTFVVTGP IVAHSGPQAA LLTANGLYAV VFVMCFALGF VSRHRQHSAP ATH
|
| |