Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3858 |
Symbol | |
ID | 6147299 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 3927035 |
End bp | 3928306 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | 641618684 |
Product | serine transporter family protein |
Protein accession | YP_001745824 |
Protein GI | 170682329 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00814] serine transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 57 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGCACA ACACACTACC GAAACACGAT CAGAAATTGC CGTTTACACG CTACGACTTC GGCTGGGTTT TATTATGCAT AGGCATGGCG ATTGGTGCCG GAACCGTGCT GATGCCAGTA CAAATTGGCT TGAAGGGAAT TTGGGTATTT ATTACCGCAG CGATCATTGC TTATCCTGCC ACCTGGGTAG TGCAGGACAT TTATTTAAAA ACCCTTTCTG AAAGCGATTC CTGTAATGAC TACACCGATA TTATCAGTCA TTACCTGGGG AAGAACTGGG GAATTTTCCT CGGGGTTATC TACTTTTTGA TGATTATCCA CGGGATTTTT ATCTACTCTC TCTCCGTGGT TTTCGACAGC GCCTCGTACC TGAAAACCTT CGGTTTAACC GATGCCGATC TTTCACAATC TCTACTTTAT AAAGTCGCCA TTTTCGCCGT ACTGGTGGCA ATTGCGTCTG GTGGTGAACG ATTACTGTTT AAGATTTCCG GGCCAATGGT GGTGGTCAAA GTAGGGATTA TCGTCGTGTT CGGTTTTGCG ATGATCCCGC ACTGGAATTT CGCCAATATA ACCGCCTTCC CGCAAGCCTC CGTCTTTTTC CGCGATGTCC TGCTTACCAT TCCCTTTTGC TTCTTTTCTG CGGTATTTAT TCAGGTACTT AACCCAATGA ATATTGCCTA TCGTAAACGG GAAGCGGATA AAGTACTGGC AACCCGGCTC GCGCTGCGTA CCCACCGAAT TAGTTATGTC ACGCTCATCG CGGTGATCCT GTTTTTTGCC TTTTCGTTTA CCTTCTCAAT TAGCCACGAA GAAGCCGTTT CTGCCTTTGA ACAAAATATC TCGGCACTGG CACTGGCCGC GCAGGTGATC CCTGGGCATA TCATTCATAT CACCTCTACG GTGCTGAATA TCTTTGCCGT ACTGACCGCA TTCTTTGGCA TTTATCTCGG TTTCCACGAG GCCATTAAAG GCATTATTCT CAATCTGTTA AGCCGAATTA TTGATACCAA GAAAATTAAT TCACGCGTGC TGACTCTGGC GATCTGCGCT TTTATCGTCA TTACGTTGAC GATTTGGGTT TCGTTTCGTG TATCGGTGCT GGTGTTCTTT CAGTTGGGAA GCCCGTTATA TGGCATTGTG TCGTGCCTCA TTCCGTTTTT CCTGATCTAT AAAGTCGCAC AACTGGAAAA ACTTCGCGGA TTTAAAGCCT GGCTGATTCT GCTTTACGGC ATTTTGCTAT GCTTGTCGCC ACTGTTGAAG CTGATTGAGT AA
|
Protein sequence | MQHNTLPKHD QKLPFTRYDF GWVLLCIGMA IGAGTVLMPV QIGLKGIWVF ITAAIIAYPA TWVVQDIYLK TLSESDSCND YTDIISHYLG KNWGIFLGVI YFLMIIHGIF IYSLSVVFDS ASYLKTFGLT DADLSQSLLY KVAIFAVLVA IASGGERLLF KISGPMVVVK VGIIVVFGFA MIPHWNFANI TAFPQASVFF RDVLLTIPFC FFSAVFIQVL NPMNIAYRKR EADKVLATRL ALRTHRISYV TLIAVILFFA FSFTFSISHE EAVSAFEQNI SALALAAQVI PGHIIHITST VLNIFAVLTA FFGIYLGFHE AIKGIILNLL SRIIDTKKIN SRVLTLAICA FIVITLTIWV SFRVSVLVFF QLGSPLYGIV SCLIPFFLIY KVAQLEKLRG FKAWLILLYG ILLCLSPLLK LIE
|
| |