Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_1826 |
Symbol | |
ID | 6146424 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 1846548 |
End bp | 1847933 |
Gene Length | 1386 bp |
Protein Length | 461 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616702 |
Product | amino acid permease |
Protein accession | YP_001743880 |
Protein GI | 170682974 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 33 |
Fosmid unclonability p-value | 0.0234215 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTATTA ATTCACCACT GACTATTGCT GCGCAATCTG GCAAAACCCG TCTGCGAAAA TCACTGAAAT TGTGGCAGGT GGTGATGATG GGTCTGGCCT ATCTCACGCC GATGACCGTA TTTGATACTT TTGGTATTGT CTCCGGCATT AGCGACGGTC ACGTTCCGGC GTCCTATTTG CTGGCGCTGG CGGGCGTGCT GTTTACCGCT ATCAGCTACG GCAAACTGGT TCGCCAGTTT CCGGAAGCCG GTTCGGCCTA TACCTACGCG CAAAAGTCGA TTAACCCGCA CGTCGGATTT ATGGTCGGCT GGTCATCGCT GCTGGATTAT CTCTTTTTGC CGATGATCAA CGTCTTGTTG GCGAAAATCT ATCTCTCCGC CCTCTTCCCG GAAGTGCCGC CGTGGGTGTG GGTAGTAACC TTCGTCGCCA TTTTAACCGC CGCGAATCTG AAGAGCGTCA ACCTGGTTGC TAACTTCAAT ACCCTGTTTG TGTTGGTGCA AATCTCCATC ATGGTGGTGT TTATCTTCCT GGTGGTTCAG GGACTGCATA AAGGAGAAGG CGTTGGCACC GTCTGGTCAC TTCAGCCGTT TATCAGCGAG AACGCGCACC TGATCCCGAT TATTACCGGG GCGACGATTG TCTGTTTCTC CTTCCTCGGT TTCGATGCGG TGACCACGCT TTCGGAAGAG ACGCCGGACG CCGCACGCGT GATCCCGAAA GCCATCTTCC TGACGGCGGT CTACGGTGGC GTTATCTTTA TCGCTGCGTC GTTCTTTATG CAGCTGTTCT TTCCTGATAT CAGCCGCTTT AAAGACCCGG ACGCCGCACT GCCTGAAATT GCGCTGTACG TCGGCGGCAA GCTCTTCCAG TCGATTTTCC TCTGTACCAC GTTTGTGAAC ACGTTAGCGT CTGGCCTGGC GTCGCACGCC AGCGTGTCGC GTCTGCTGTA TGTGATGGGG CGTGACAATG TGTTTCCGGA GCGCGTGTTT GGCTATGTGC ACCCGAAATG GCGCACGCCA GCGTTGAACG TCATTATGGT CGGGATTGTT GCGCTGTCGG CGCTGTTCTT TGATTTAGTC ACCGCGACAG CATTGATTAA CTTCGGTGCT CTGGTGGCGT TTACCTTCGT GAATCTGTCG GTGTTTAATC ATTTCTGGCG GCGCAAAGGA ATGAATAAAA GCTGGAAGGA TCACTTCCAC TATTTGCTGA TGCCGCTGGT TGGCGCGCTG ACGGTGGGTG TGCTGTGGGT TAACCTCGAG TCAACGTCAC TGACACTCGG TCTGGTATGG GCTTCATTAG GCGGCGCATA TTTGTGGTAT CTGATCCGCC GCTATCGCAA AGTGCCGTTG TATGATGGTG ACAGAACGCC GGTGAGCGAA ACGTAA
|
Protein sequence | MAINSPLTIA AQSGKTRLRK SLKLWQVVMM GLAYLTPMTV FDTFGIVSGI SDGHVPASYL LALAGVLFTA ISYGKLVRQF PEAGSAYTYA QKSINPHVGF MVGWSSLLDY LFLPMINVLL AKIYLSALFP EVPPWVWVVT FVAILTAANL KSVNLVANFN TLFVLVQISI MVVFIFLVVQ GLHKGEGVGT VWSLQPFISE NAHLIPIITG ATIVCFSFLG FDAVTTLSEE TPDAARVIPK AIFLTAVYGG VIFIAASFFM QLFFPDISRF KDPDAALPEI ALYVGGKLFQ SIFLCTTFVN TLASGLASHA SVSRLLYVMG RDNVFPERVF GYVHPKWRTP ALNVIMVGIV ALSALFFDLV TATALINFGA LVAFTFVNLS VFNHFWRRKG MNKSWKDHFH YLLMPLVGAL TVGVLWVNLE STSLTLGLVW ASLGGAYLWY LIRRYRKVPL YDGDRTPVSE T
|
| |