Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2936 |
Symbol | |
ID | 6144549 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3011479 |
End bp | 3012768 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641617805 |
Product | serine transporter family protein |
Protein accession | YP_001744960 |
Protein GI | 170683290 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00814] serine transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.0834253 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGAAACGA CTCAAACCAG CACGATTGCG TCGAAAGACT CTCGTAGTGC CTGGCGCAAG ACAGACACCA TGTGGATGCT GGGCCTTTAC GGCACGGCAA TCGGCGCGGG CGTGCTGTTC CTGCCAATCA ACGCCGGTGT TGGCGGTATG ATCCCGCTGA TCATCATGGC TATCCTTGCG TTCCCGATGA CATTTTTTGC TCACCGCGGC CTGACTCGCT TCGTACTGTC TGGTAAAAAC CCAGGCGAAG ACATCACCGA GGTTGTAGAA GAACACTTTG GTATTGGCGC AGGTAAACTG ATTACCCTGC TCTACTTCTT CGCTATCTAC CCGATCCTGC TGGTTTATAG CGTGGCAATC ACCAATACAG TTGAAAGCTT CATGTCTCAC CAGCTGGGTA TGACACCACC GCCGCGTGCG ATTCTGTCGC TGATCCTGAT CGTGGGTATG ATGACCATCG TTCGCTTCGG TGAGCAGATG ATCGTTAAAG CGATGAGCAT TCTGGTATTC CCGTTTGTTG GCGTACTGAT GCTGCTGGCT CTGTACCTGA TCCCGCAGTG GAACGGCGCA GCACTGGAAA CACTGTCTCT GGACACTGCA TCTGCAACCG GAAACGGTCT GTGGATGACC CTGTGGCTGG CAATTCCGGT AATGGTGTTC TCGTTCAACC ACTCTCCGAT CATCTCTTCT TTCGCCGTTG CGAAGCGTGA AGAGTACGGC GATATGGCAG AACAGAAATG CTCTAAGATC CTGGCATTCG CACACATCAT GATGGTGCTG ACCGTAATGT TCTTCGTCTT CAGCTGCGTA CTGAGCCTGA CTCCGGCAGA CCTGGCTGCG GCTAAAGAGC AGAACATCTC GATTCTGTCT TACCTGGCTA ACCACTTTAA CGCGCCGATC ATCGCGTGGA TGGCTCCGAT TATCGCGATT ATCGCTATCA CCAAATCCTT CCTCGGCCAC TACCTGGGCG CACGTGAAGG CTTCAACGGT ATGGTGATTA AATCTCTGCG TGGTAAAGGT AAGTCTATCG AAATCAACAA GCTGAACCGT ATCACTGCGC TGTTCATGCT GGTAACGACC TGGATTGTTG CCACCCTGAA CCCGAGCATC CTGGGTATGA TTGAAACCCT GGGCGGCCCA ATCATCGCGA TGATCCTGTT CCTGATGCCG ATGTACGCAA TTCAGAAAGT ACCGGCAATG CGTAAGTACA GCGGTCACAT CAGCAACGTA TTCGTTGTCG TGATGGGTCT GATTGCAATC TCCGCAATCT TCTACTCTCT GTTCAGCTAA
|
Protein sequence | METTQTSTIA SKDSRSAWRK TDTMWMLGLY GTAIGAGVLF LPINAGVGGM IPLIIMAILA FPMTFFAHRG LTRFVLSGKN PGEDITEVVE EHFGIGAGKL ITLLYFFAIY PILLVYSVAI TNTVESFMSH QLGMTPPPRA ILSLILIVGM MTIVRFGEQM IVKAMSILVF PFVGVLMLLA LYLIPQWNGA ALETLSLDTA SATGNGLWMT LWLAIPVMVF SFNHSPIISS FAVAKREEYG DMAEQKCSKI LAFAHIMMVL TVMFFVFSCV LSLTPADLAA AKEQNISILS YLANHFNAPI IAWMAPIIAI IAITKSFLGH YLGAREGFNG MVIKSLRGKG KSIEINKLNR ITALFMLVTT WIVATLNPSI LGMIETLGGP IIAMILFLMP MYAIQKVPAM RKYSGHISNV FVVVMGLIAI SAIFYSLFS
|
| |