Gene EcSMS35_3305 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3305 
Symbol 
ID6144329 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3381142 
End bp3382125 
Gene Length984 bp 
Protein Length327 aa 
Translation table11 
GC content42% 
IMG OID641618135 
ProductTRAP transporter solute receptor DctP family protein 
Protein accessionYP_001745285 
Protein GI170683750 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component 
TIGRFAM ID[TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.829241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones32 
Fosmid unclonability p-value0.0101989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAGCAC TTCGTCCTCT GACGGCATCG CTTATGCTGT TAACCAGTTG TTTATTAATA 
TCTAATACCA CACTGGCAAA AACAACATTA AAACTGAGTC ATAATCAGGA TAAAAGCCAT
GCTGTCCATA AAGCATTAAG CTATTTAGCG GATAAAACCA AAGAGTATTC TAATGGTGAA
TTAGTTATTC GCATTTACCC AAATGCAACA TTAGGTAATG AGCGTGAATC ACTGGAATTA
ATGAACTCCG GTGCATTACA GATGGTTAAG GTTAATGCCG CATCACTAGA ATCATTTGCT
CCAGATTACA GCCTTTTCAG TCTGCCGTTC TTATTCCGCG ACCGTGATCA CTATTATCGT
GTTCTGCAAA GTGATTTAGG TAAAAAAATA CTTAATTCAT CAGAAAGCAA AGGTTTTGTT
GGTATTACGT ACTATGACGG CGGTGCGAGA AGTTTTTATT CCAATAAACC GATTACAAAA
CCTGAAGATT TAGCGGGAAT GAAAATCAGG GTTCAACAAA GCCCCAGCGC CATTGCAATG
ATGAAAGCAC TCGGTGGTGT CGCTACCCCG ATGGCGCAAG GCGAACTCTA CACCGCACTT
CAGCAAGGAG TGGTTGATGG CGGAGAGAAC AATACCGTTG TCTATTCCGA TATGCGTCAC
GCCGAAGTCG CAAAAGTCTA TTCACGTGAT GAACACACCA TGGTACCTGA TGTGTTAATT
ATCAGCACCA ACGTGTTGAA TAAACTTGGT GATAAAGAGC GCACGGCATT ATTAAAAGCA
GCCGATGAGT CCATGATGCA GATGAAGGAC GTCATCTGGC CTGCGGCTGA AAAAGAAGCC
TACGACAAAA TGAAAGGGAT GAACGCAACA GTTGTTGATG TTGATAAATC CGCTTTCAAA
GAACGCGTCA AACCGTTATA CGATGAATTC AAGGCGAAAG ATGCACAATC AGCCAAAAAC
CTTGAGCTAG TAGAAAGTAT GTAA
 
Protein sequence
MKALRPLTAS LMLLTSCLLI SNTTLAKTTL KLSHNQDKSH AVHKALSYLA DKTKEYSNGE 
LVIRIYPNAT LGNERESLEL MNSGALQMVK VNAASLESFA PDYSLFSLPF LFRDRDHYYR
VLQSDLGKKI LNSSESKGFV GITYYDGGAR SFYSNKPITK PEDLAGMKIR VQQSPSAIAM
MKALGGVATP MAQGELYTAL QQGVVDGGEN NTVVYSDMRH AEVAKVYSRD EHTMVPDVLI
ISTNVLNKLG DKERTALLKA ADESMMQMKD VIWPAAEKEA YDKMKGMNAT VVDVDKSAFK
ERVKPLYDEF KAKDAQSAKN LELVESM