Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1094 |
Symbol | |
ID | 6146749 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 1109201 |
End bp | 1110493 |
Gene Length | 1293 bp |
Protein Length | 430 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 641615978 |
Product | d-galactonate transporter |
Protein accession | YP_001743170 |
Protein GI | 170680666 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2271] Sugar phosphate permease |
TIGRFAM ID | [TIGR00881] phosphoglycerate transporter family protein [TIGR00893] d-galactonate transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATATTC CCGTTAATGC AGCAAAGCCG GGGCGTCGGC GTTATCTGAC GCTGGTGATG ATCTTTATTA CGGTGGTCAT TTGTTATGTT GACCGCGCTA ACCTGGCCGT GGCTGCCGCC CATATTCAGG AAGAGTTCGG CATTACCAAA GCGGAAATGG GCTATGTATT TTCGGCCTTC GCCTGGCTTT ATACGCTATG TCAGATCCCC GGCGGTTGGT TTTTAGATCG CGTAGGTTCA CGCGTGACTT ATTTTATTGC GATATTTGGC TGGTCAGTGG CGACGTTATT CCAGGGCTTT GCCACGGGAT TAATGTCATT AATTGGTCTG CGCGCGATAA CCGGTATTTT CGAAGCGCCC GCTTTCCCGA CCAATAACCG GATGGTGACC AGCTGGTTCC CGGAACATGA ACGCGCTTCT GCCGTTGGTT TTTATACGTC TGGTCAGTTT GTCGGTCTGG CGTTTCTGAC TCCGCTGCTG ATCTGGATTC AGGAGATGTT GAGCTGGCAC TGGGTGTTCA TTGTCACCGG TGGTATCGGC ATTATCTGGT CGCTGATTTG GTTTAAGGTT TATCAGCCGC CGCGCCTGAC CAAAGGCATC AGCAAAGCTG AACTGGATTA CATTCGTGAT GGCGGCGGTC TGGTGGATGG TGATGCGCCG GTGAAAAAAG AGGCACGTCA GCCGTTAACA GCCAAAGACT GGAAACTGGT GTTCCATCGT AAACTGATCG GCGTCTATCT TGGACAATTT GCGGTGACTT CTACACTGTG GTTTTTTTTG ACCTGGTTCC CGAACTATTT AACCCAGGAA AAAGGAATCA CGGCGCTGAA AGCAGGCTTT ATGACCTCGG TACCATTCCT CGCGGCGTTT GTCGGCGTCC TGCTCTCTGG CTGGGTCGCG GATCTGCTGG TACGTAAGGG CTTTTCACTG GGCTTTGCGC GTAAAACGCC GATTATCTGC GGCTTGCTGA TCTCCACCTG CATTATGGGC GCTAACTACA CTAACGATCC GATGATGATT ATGTGCCTGA TGGCGCTGGC ATTCTTCGGC AACGGTTTTG CTTCGATTAC CTGGTCGCTG GTCTCTTCTC TGGCACCGAT GCGCCTGATT GGTTTAACCG GTGGCGTATT TAACTTTGTT GGCGGTCTGG GCGGCATCAC CGTTCCGCTG GTGGTGGGGT ACCTGGCGCA GGGTTACGGT TTCGCACCTG CACTGGTTTA TATCTCCGCC GTCGCGTTGA TTGGCGCGCT CTCTTACATT CTGATGGTGG GCGATGTGAA GCGCGTTGGA TAA
|
Protein sequence | MDIPVNAAKP GRRRYLTLVM IFITVVICYV DRANLAVAAA HIQEEFGITK AEMGYVFSAF AWLYTLCQIP GGWFLDRVGS RVTYFIAIFG WSVATLFQGF ATGLMSLIGL RAITGIFEAP AFPTNNRMVT SWFPEHERAS AVGFYTSGQF VGLAFLTPLL IWIQEMLSWH WVFIVTGGIG IIWSLIWFKV YQPPRLTKGI SKAELDYIRD GGGLVDGDAP VKKEARQPLT AKDWKLVFHR KLIGVYLGQF AVTSTLWFFL TWFPNYLTQE KGITALKAGF MTSVPFLAAF VGVLLSGWVA DLLVRKGFSL GFARKTPIIC GLLISTCIMG ANYTNDPMMI MCLMALAFFG NGFASITWSL VSSLAPMRLI GLTGGVFNFV GGLGGITVPL VVGYLAQGYG FAPALVYISA VALIGALSYI LMVGDVKRVG
|
| |