Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3305 |
Symbol | |
ID | 6144329 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3381142 |
End bp | 3382125 |
Gene Length | 984 bp |
Protein Length | 327 aa |
Translation table | 11 |
GC content | 42% |
IMG OID | 641618135 |
Product | TRAP transporter solute receptor DctP family protein |
Protein accession | YP_001745285 |
Protein GI | 170683750 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.829241 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 32 |
Fosmid unclonability p-value | 0.0101989 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAGCAC TTCGTCCTCT GACGGCATCG CTTATGCTGT TAACCAGTTG TTTATTAATA TCTAATACCA CACTGGCAAA AACAACATTA AAACTGAGTC ATAATCAGGA TAAAAGCCAT GCTGTCCATA AAGCATTAAG CTATTTAGCG GATAAAACCA AAGAGTATTC TAATGGTGAA TTAGTTATTC GCATTTACCC AAATGCAACA TTAGGTAATG AGCGTGAATC ACTGGAATTA ATGAACTCCG GTGCATTACA GATGGTTAAG GTTAATGCCG CATCACTAGA ATCATTTGCT CCAGATTACA GCCTTTTCAG TCTGCCGTTC TTATTCCGCG ACCGTGATCA CTATTATCGT GTTCTGCAAA GTGATTTAGG TAAAAAAATA CTTAATTCAT CAGAAAGCAA AGGTTTTGTT GGTATTACGT ACTATGACGG CGGTGCGAGA AGTTTTTATT CCAATAAACC GATTACAAAA CCTGAAGATT TAGCGGGAAT GAAAATCAGG GTTCAACAAA GCCCCAGCGC CATTGCAATG ATGAAAGCAC TCGGTGGTGT CGCTACCCCG ATGGCGCAAG GCGAACTCTA CACCGCACTT CAGCAAGGAG TGGTTGATGG CGGAGAGAAC AATACCGTTG TCTATTCCGA TATGCGTCAC GCCGAAGTCG CAAAAGTCTA TTCACGTGAT GAACACACCA TGGTACCTGA TGTGTTAATT ATCAGCACCA ACGTGTTGAA TAAACTTGGT GATAAAGAGC GCACGGCATT ATTAAAAGCA GCCGATGAGT CCATGATGCA GATGAAGGAC GTCATCTGGC CTGCGGCTGA AAAAGAAGCC TACGACAAAA TGAAAGGGAT GAACGCAACA GTTGTTGATG TTGATAAATC CGCTTTCAAA GAACGCGTCA AACCGTTATA CGATGAATTC AAGGCGAAAG ATGCACAATC AGCCAAAAAC CTTGAGCTAG TAGAAAGTAT GTAA
|
Protein sequence | MKALRPLTAS LMLLTSCLLI SNTTLAKTTL KLSHNQDKSH AVHKALSYLA DKTKEYSNGE LVIRIYPNAT LGNERESLEL MNSGALQMVK VNAASLESFA PDYSLFSLPF LFRDRDHYYR VLQSDLGKKI LNSSESKGFV GITYYDGGAR SFYSNKPITK PEDLAGMKIR VQQSPSAIAM MKALGGVATP MAQGELYTAL QQGVVDGGEN NTVVYSDMRH AEVAKVYSRD EHTMVPDVLI ISTNVLNKLG DKERTALLKA ADESMMQMKD VIWPAAEKEA YDKMKGMNAT VVDVDKSAFK ERVKPLYDEF KAKDAQSAKN LELVESM
|
| |