Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A3868 |
Symbol | |
ID | 5592798 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 3864130 |
End bp | 3865569 |
Gene Length | 1440 bp |
Protein Length | 479 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 640922978 |
Product | putative transporter |
Protein accession | YP_001460456 |
Protein GI | 157163138 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2211] Na+/melibiose symporter and related transporters |
TIGRFAM ID | [TIGR00792] sugar (Glycoside-Pentoside-Hexuronide) transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGTTTACA AAATGCGTTA TCGGTCTCAG CACCCCTATA GCATCAAGGA AAAGCAGATG AAGAGTGAAG TGTTGTCCGT TAAAGAGAAA ATTGGTTATG GCATGGGAGA CGCCGCCAGC CACATTATTT TCGATAACGT AATGTTATAT ATGATGTTCT TTTATACCGA TATTTTTGGC ATTCCTGCCG GTTTTGTCGG AACCATGTTT TTGGTCGCTC GTGCACTGGA TGCGATTTCC GATCCTTGCA TGGGGTTGTT GGCCGATCGA ACGCGCTCTC GCTGGGGTAA ATTTCGTCCG TGGGTACTGT TTGGCGCACT GCCATTCGGG ATCGTCTGTG TACTGGCCTA TAGCACGCCA GATCTCAGTA TGAACGGCAA AATGATCTAT GCAGCAATTA CTTACACCCT ACTTACCTTA CTTTATACCG TCGTCAATAT CCCTTACTGC GCATTGGGTG GTGTAATCAC CAATGACCCG ACTCAGCGTA TCTCGCTGCA ATCCTGGCGT TTTGTGCTGG CGACCGCGGG AGGCATGCTT TCTACTGTTC TGATGATGCC ACTGGTTAAT TTAATTGGCG GTGATAATAA ACCACTCGGT TTCCAGGGCG GTATCGCGGT CCTTTCCGTG GTGGCATTCA TGATGCTGGC ATTTTGTTTC TTCACCACTA AAGAACGCGT TGAAGCACCA CCTACAACAA CGTCTATGCG GGAAGATTTA CGTGATATCT GGCAAAACGA CCAGTGGCGG ATTGTCGGTT TACTAACCAT TTTCAATATC CTGGCGGTGT GCGTACGCGG TGGGGCGATG ATGTATTACG TCACATGGAT TTTGGGCACG CCGGAAGTGT TTGTCGCTTT TCTCACCACT TATTGCGTGG GTAACCTGAT TGGTTCCGCA CTGGCAAAAC CTCTGACCGA CTGGAAATGT AAAGTCACTA TCTTCTGGTG GACGAACGCC CTGCTGGCAG TGATTAGCCT CGCGATGTTC TTTGTTCCCA TGCAGGCCAG CATCACTATG TTTGTCTTCA TCTTCGTGAT TGGTGTGTTG CATCAACTGG TGACACCTAT CCAGTGGGTA ATGATGTCCG ATACCGTCGA CTACGGCGAG TGGTGCAATG GTAAACGCCT GACCGGGATC AGTTTTGCTG GCACGCTGTT TGTGCTCAAA CTGGGGTTGG CCTTCGGCGG CGCTCTTATC GGCTGGATGC TGGCTTATGG CGGATATGAT GCGGCAGAAA AAGCGCAGAA CAGCGCCACG ATTAGCATCA TTATTGCGCT ATTCACGATT GTTCCGGCGA TCTGTTATTT GCTGAGCGCG ATTATCGCTA AACGCTACTA CTCACTCACG ACGCACAATC TGAAAACCGT TATGGAACAG CTGGCTCAGG GTAAACGCCG TTGCCAGCAA CAATTCACCT CTCAAGAAGT GCAGAACTAA
|
Protein sequence | MVYKMRYRSQ HPYSIKEKQM KSEVLSVKEK IGYGMGDAAS HIIFDNVMLY MMFFYTDIFG IPAGFVGTMF LVARALDAIS DPCMGLLADR TRSRWGKFRP WVLFGALPFG IVCVLAYSTP DLSMNGKMIY AAITYTLLTL LYTVVNIPYC ALGGVITNDP TQRISLQSWR FVLATAGGML STVLMMPLVN LIGGDNKPLG FQGGIAVLSV VAFMMLAFCF FTTKERVEAP PTTTSMREDL RDIWQNDQWR IVGLLTIFNI LAVCVRGGAM MYYVTWILGT PEVFVAFLTT YCVGNLIGSA LAKPLTDWKC KVTIFWWTNA LLAVISLAMF FVPMQASITM FVFIFVIGVL HQLVTPIQWV MMSDTVDYGE WCNGKRLTGI SFAGTLFVLK LGLAFGGALI GWMLAYGGYD AAEKAQNSAT ISIIIALFTI VPAICYLLSA IIAKRYYSLT THNLKTVMEQ LAQGKRRCQQ QFTSQEVQN
|
| |