Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | SeD_A4517 |
Symbol | |
ID | 6873798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salmonella enterica subsp. enterica serovar Dublin str. CT_02021853 |
Kingdom | Bacteria |
Replicon accession | NC_011205 |
Strand | + |
Start bp | 4358374 |
End bp | 4359453 |
Gene Length | 1080 bp |
Protein Length | 359 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 642787430 |
Product | putative fructose-like permease EIIC subunit 2 |
Protein accession | YP_002218041 |
Protein GI | 198243942 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1299] Phosphotransferase system, fructose-specific IIC component |
TIGRFAM ID | [TIGR01427] PTS system, fructose subfamily, IIC component |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 40 |
Fosmid unclonability p-value | 0.000540443 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAAAGAGT TGGTACAGAT CCTGAAAAAT ACCCGCCAGC ACCTGATGAC CGGTGTTTCG CATATGATCC CCTTTGTGGT GGCTGGCGGA ATTTTGCTGG CAGTCTCCGT CATGCTATAT GGCAAGGGCG CCGTACCCGA TGCCGCCACC GATCCGAATC TTAAAAAACT GTTTGATATC GGTGTCGCCG GGCTGACGCT GATGGTGCCT TTCCTCGCCG CATACATTGG CTACTCCATT GCCGAACGCT CCGCGCTGGC TCCTTGCGCG ATTGGTGCCT GGGTGGGTAA CAGCTTCGGC GCGGGCTTTT TCGGGGCACT TATCGCCGGA CTTATCGGCG GGATCGTGGT GCATTACTTG AAGAAAATCC CGGTGCATAA GGTGCTGCGT TCTGTGATGC CTATTTTTGT GATTCCCATC GTTGGCACTT TTATCACCGC GGGCATCATG ATGTGGGGGC TGGGCGAACC GATCGGTGCG CTGACAAGCA GCCTGACCCA ATGGCTGCAA GGGATGCAGC AGGGCAGCAT CGTGCTGCTG GCGGTGATCA TGGGGCTGAT GCTGGCTTTT GATATGGGCG GCCCGGTTAA TAAAGTCGCT TATGCGTTCA TGCTGATTTG CGTGGCGCAG GGCGTATATA CCGTGGTGGC TATCGCCGCG GTGAGCATCT GCGTACCGCC GCTGGGACTG GGGCTGGCGA CGCTGATTGG CCGCAAGAAT TTTTCTGTTG AAGAGCGCGA AGCCGGTAAA GCCGCGCTGG TCATGGGCTG CGTGGGCGTA ACGGAAGGGG CGATTCCTTT CGCCGCTGCC GATCCGCTGC GCGTGATCCC ATCCATTATG GTGGGCTCCG CTTGCGGTGC GGTAATGGCC GCGCTGTTTG GCGCGCAGTG TTATGCCGGT TGGGGCGGTT TAATTGTTCT GCCAGTCGTG GAAGGCAAGC TGGGTTATGT CGCGGCAGTC GCCGTGGGCG CGGTGGTAAC GGCAGTCTGC GTTAACGTGC TGAAAAGCCT GACGCGTAAG AATGTGTCGC AAGTTGACGA AAAAGAAGAC GACCTGGATT TAGATTTTGA GATGAATTAA
|
Protein sequence | MKELVQILKN TRQHLMTGVS HMIPFVVAGG ILLAVSVMLY GKGAVPDAAT DPNLKKLFDI GVAGLTLMVP FLAAYIGYSI AERSALAPCA IGAWVGNSFG AGFFGALIAG LIGGIVVHYL KKIPVHKVLR SVMPIFVIPI VGTFITAGIM MWGLGEPIGA LTSSLTQWLQ GMQQGSIVLL AVIMGLMLAF DMGGPVNKVA YAFMLICVAQ GVYTVVAIAA VSICVPPLGL GLATLIGRKN FSVEEREAGK AALVMGCVGV TEGAIPFAAA DPLRVIPSIM VGSACGAVMA ALFGAQCYAG WGGLIVLPVV EGKLGYVAAV AVGAVVTAVC VNVLKSLTRK NVSQVDEKED DLDLDFEMN
|
| |