Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4055 |
Symbol | |
ID | 6145193 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4147203 |
End bp | 4148495 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641618881 |
Product | d-galactonate transporter |
Protein accession | YP_001746019 |
Protein GI | 170682764 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | [TIGR00881] phosphoglycerate transporter family protein [TIGR00893] d-galactonate transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTC CCGTTAATGC AGCAAAGCCG GGGCGTCGGC GTTATCTGAC GCTGGTGATG ATCTTTATTA CGGTGGTCAT TTGTTATGTT GACCGCGCTA ACCTGGCCGT GGCTTCCGCC CATATTCAGG AAGAGTTCGG TATTACGAAA GCGGAAATGG GCTATGTATT TTCGGCCTTC GCCTGGCTTT ATACGCTATG TCAGATCCCC GGCGGTTGGT TTTTAGATCG CGTAGGTTCA CGCGTGACTT ATTTTATTGC GATATTTGGC TGGTCAGTGG CGACGTTATT CCAGGGCTTT GCCACGGGAT TAATGTCATT AATTGGTCTG CGCGCAATAA CCGGTATTTT CGAAGCGCCC GCTTTCCCGA CCAATAACCG GATGGTGACC AGCTGGTTCC CGGAACATGA ACGCGCTTCT GCCGTTGGTT TTTATACGTC TGGTCAGTTT GTCGGTCTGG CGTTTCTGAC TCCGCTGCTG ATCTGGATTC AGGAGATGTT GAGCTGGCAC TGGGTGTTCA TTGTCACCGG TGGTATCGGC ATTATCTGGT CGCTGATTTG GTTTAAGGTT TATCAGCCGC CGCGTCTGAC CAAAGGCATC AGCAAAGCTG AACTGGATTA CATTCGTGAT GGCGGCGGTC TGGTGGATGG CGATGCGCCG GTGAAGAAAG AGGCGCGTCA GCCGTTAACA GCCAAAGACT GGAAACTGGT GTTCCATCGT AAACTGATCG GCGTCTATCT TGGGCAATTT GCGGTGGCTT CTACACTGTG GTTTTTCTTA ACCTGGTTCC CGAACTATTT AACCCAGGAA AAAGGAATCA CGGCGCTGAA AGCAGGCTTT ATGACCACGG TGCCATTCCT CGCGGCGTTT GTCGGCGTCC TGCTCTCTGG CTGGGTCGCG GATCTGCTGG TACGTAAGGG CTTTTCACTG GGCTTTGCGC GTAAAACGCC GATTATCTGC GGCTTGCTGA TCTCCACCTG CATTATGGGC GCTAACTACA CTAACGATCC GATGATGATT ATGTGCCTGA TGGCGCTGGC ATTCTTCGGC AACGGTTTTG CTTCGATTAC CTGGTCGCTG GTTTCTTCTC TGGCACCGAT GCGCCTGATT GGTTTAACCG GCGGCGTGTT TAACTTTGCC GGTGGTTTGG GCGGCATCAC CGTTCCGCTG GTGGTGGGGT ACCTGGCGCA GGGTTACGGT TTCGCGCCTG CACTGGTTTA TATCTCCGCC GTCGCGTTGA TTGGCGCGCT CTCTTACATC CTGCTGGTGG GCGATGTGAA GCGCGTTGGC TAA
|
Protein sequence | MDIPVNAAKP GRRRYLTLVM IFITVVICYV DRANLAVASA HIQEEFGITK AEMGYVFSAF AWLYTLCQIP GGWFLDRVGS RVTYFIAIFG WSVATLFQGF ATGLMSLIGL RAITGIFEAP AFPTNNRMVT SWFPEHERAS AVGFYTSGQF VGLAFLTPLL IWIQEMLSWH WVFIVTGGIG IIWSLIWFKV YQPPRLTKGI SKAELDYIRD GGGLVDGDAP VKKEARQPLT AKDWKLVFHR KLIGVYLGQF AVASTLWFFL TWFPNYLTQE KGITALKAGF MTTVPFLAAF VGVLLSGWVA DLLVRKGFSL GFARKTPIIC GLLISTCIMG ANYTNDPMMI MCLMALAFFG NGFASITWSL VSSLAPMRLI GLTGGVFNFA GGLGGITVPL VVGYLAQGYG FAPALVYISA VALIGALSYI LLVGDVKRVG
|
| |