Gene EcSMS35_3311 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_3311 
Symbol 
ID6145653 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp3388889 
End bp3390496 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content53% 
IMG OID641618140 
Productputative binding protein 
Protein accessionYP_001745290 
Protein GI170680624 
COG category[E] Amino acid transport and metabolism 
COG ID[COG4166] ABC-type oligopeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.43332 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones38 
Fosmid unclonability p-value0.120975 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATACGC GAAATTTATT ATGGCTGGTC AGCCTGGTAA GTGCGGCTCC TCTCTACGCT 
GCTGACGTTC CCGCCAACAC ACCGCTCGCC CCGCAACAAG TCTTTCGTTA CAACAATCAT
AGCGACCCAG GCACGCTCGA CCCGCAAAAG GTGGAGGAGA ATACTGCCGC GCAGATTGTG
CTGGATCTGT TTGAAGGTCT GGTATGGATG GACGGTGAAG GCCAGGTGCA GCCCGCTCAG
GCTGAACGCT GGGAGATACT GGACGGCGGC AAGCGCTATA TTTTCCATCT GCGTAGCGGT
TTGCAGTGGT CAGACGGTCA GCCTCTGACG GCAGAGGATT TTGTCCTCGG CTGGCAGCGC
GCGGTTGACC CGAAAACGGC AAGCCCTTTT GCTGGCTATC TGGCACAGGC GCACATTAAC
AATGCCGCGG CTATTGTTGC GGGTAAAGCA GATGTTACAT CGCTGGGTGT CAAAGCGACG
GATGATCGTA CTCTTGAAGT TACGCTTGAG CAGCCAGTTC CTTGGTTCAC GACGATGCTC
GCCTGGCCGA CGCTGTTCCC GGTTCCTCAT CATGTCATCG CTAAACATGG CGATAGCTGG
AGTAAGCCAG AGAACATGGT TTACAACGGT GCCTTTGTGC TTGATCAGTG GGTAGTTAAC
GAAAAGATTA CTGCACGCAA AAATCCAAAG TACCGCGATG CGCAACATAC AGTATTGCAA
CAGGTTGAGT ATCTGGCGCT GGATAATTCG GTCACCGGCT ATAACCGCTA TCGCGCGGGA
GAGGTCGATC TCACCTGGGT TCCGGCGCAG CAAATTCCCG CCATTGAAAA ATCACTGCCT
GGCGAGCTAC GAATTATTCC GCGTCTGAAC AGCGAATATT ACAACTTCAA CCTTGAGAAA
CCGCCATTTA ACGATGTGCG GGTGCGTCGG GCGCTATATC TTACGGTTGA TCGACAGCTT
ATTGCGCAAA AGGTACTGGG GTTGAGAACG CCCGCAACTA CGCTGACGCC GCCAGAGGTA
AAAGGCTTTA GCGCGACGAC GTTCGATGAA CTGCAAAAGC CGCTGAGTGA GCGCGTCGCG
ATGGCAAAAG CCTTGTTGAA ACAGGCGGGA TACGACGCCT CTCATCCGCT TCGCTTTGAG
CTGTTCTACA ACAAGTACGA TCTGCATGAA AAGACCGCGA TAGCGTTGTC TTCCGAATGG
AAAAAATGGC TGGGTGCACA GGTGACGCTG CGCACAATGG AGTGGAAAAC TTATCTTGAT
GCCCGACGAG CCGGTGATTT CATGTTGTCT CGGCAGTCGT GGGATGCGAC GTACAATGAT
GCCTCCACCT TTCTGAACAC GCTTAAGAGC GATAGCGAGG AAAATGTCGG CCACTGGAAA
AACGCGCAGT ATGACGCCTT ACTAAACCAG GCGGCACAAA CGGCTGATGC AACAAAGCGT
AATGCGTTGT ATCAGCAGGC AGAAGTGATC ATCAACCAGC AGGCACCGCT CATTCCTGTC
TATTATCAGC CGTTAATCAA ACTGCTTAAA CCCTACGTTG GCGGTTTTCC GCTGCATAAT
CCCCAGGATT ATGTCTACAG CAAAGAGTTG TATATCAAGG CACATTGA
 
Protein sequence
MYTRNLLWLV SLVSAAPLYA ADVPANTPLA PQQVFRYNNH SDPGTLDPQK VEENTAAQIV 
LDLFEGLVWM DGEGQVQPAQ AERWEILDGG KRYIFHLRSG LQWSDGQPLT AEDFVLGWQR
AVDPKTASPF AGYLAQAHIN NAAAIVAGKA DVTSLGVKAT DDRTLEVTLE QPVPWFTTML
AWPTLFPVPH HVIAKHGDSW SKPENMVYNG AFVLDQWVVN EKITARKNPK YRDAQHTVLQ
QVEYLALDNS VTGYNRYRAG EVDLTWVPAQ QIPAIEKSLP GELRIIPRLN SEYYNFNLEK
PPFNDVRVRR ALYLTVDRQL IAQKVLGLRT PATTLTPPEV KGFSATTFDE LQKPLSERVA
MAKALLKQAG YDASHPLRFE LFYNKYDLHE KTAIALSSEW KKWLGAQVTL RTMEWKTYLD
ARRAGDFMLS RQSWDATYND ASTFLNTLKS DSEENVGHWK NAQYDALLNQ AAQTADATKR
NALYQQAEVI INQQAPLIPV YYQPLIKLLK PYVGGFPLHN PQDYVYSKEL YIKAH