Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4627 |
Symbol | |
ID | 6143335 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4728446 |
End bp | 4729990 |
Gene Length | 1545 bp |
Protein Length | 514 aa |
Translation table | 11 |
GC content | 47% |
IMG OID | 641619443 |
Product | amino acid permease family protein |
Protein accession | YP_001746554 |
Protein GI | 170680322 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0531] Amino acid transporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.592733 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 35 |
Fosmid unclonability p-value | 0.0531629 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGGGCGCGT TTTATACACG CGCTGAAATG AAGGATGGTT TCATGCCTCA CACGATAAAA AAGATGAGTC TGATAGGACT CATATTGATG ATCTTTACTT CCGTATTTGG ATTTGCCAAT AGCCCATCGG CTTATTACTT AATGGGTTAT AGTGCGATTC CCTTTTATAT ATTTTCTGCA TTGTTATTCT TTATTCCATT CGCCTTAATG ATGGCTGAAA TGGGAGCGGC TTATCGCAAA GAAGAAGGTG GTATCTATTC CTGGATGAAT AATAGTGTCG GACCACGTTT TGCCTTCATT GGTACATTTA TGTGGTTTTC CTCTTATATC ATCTGGATGG TGAGTACCTC AGCGAAAGTT TGGGTACCGT TCTCAACATT CCTCTATGGT AGCGACATGA CCCAGCACTG GCGTATTGCC GGACTGGAGC CTACGCAGGT GGTTGGTCTG CTGGCAGTGG CATGGATGAT TCTGGTCACC GTCGTTGCTT CAAAGGGGAT TAATAAAATT GCCCGCATTA CTGCGGTGGG CGGTATTGCA GTAATGTGTC TGAATTTAGT ATTGCTGTTA GTAAGCATTA CTATTTTGTT ATTAAATGGT GGGCATTTCG CGCAGGATAT TAATTTCCTT GCATCACCGA ACCCAGGTTA TCAGTCCGGT CTGGCAATGC TATCGTTTGT GGTATTTGCT ATTTTTGCCT ATGGCGGAAT TGAAGCGGTG GGTGGTCTGG TCGATAAAAC GGAAAATCCA GAAAAGAACT TTGCCAAAGG TATTGTTTTT GCCGCTATTG TTATTTCAAT CGGTTATTCG CTGGCAATAT TTTTATGGGG CGTCAGCACA AACTGGCAGC AGGTATTAAG TAATGGTTCC GTTAACCTCG GCAATATTAC CTATGTGCTG ATGAAGAGCC TCGGGGTGAC GCTGGGTAAC GCACTGCATT TTTCACCTGA AGCGTCATTG TCGCTGGGCG TATGGTTTGC GCGTATTACC GGACTGTCGA TGTTCCTCGC TTATACCGGT GCGTTCTTTA CGCTTTGCTA TTCACCGCTG AAAGCCATCA TCCAGGGAAC GCCGAAAGCA TTGTGGCCGG AACCGATGAC GCGCCTGAAT ACGATGGGGA TGCCGTCTAT TGCCATGTGG ATGCAGTGCG GGTTGGTTAC TATCTTCATT CTGCTGGTTT CGTTTGGTGG CGGTACCGCA TCGGCGTTCT TTAACAAGCT GACGCTGATG GCGAACGTGT CTATGACGCT TCCTTACCTG TTCCTCGCGC TGGCTTTCCC ATTTTTTAAA GCACGTCAGG ATCTCGACAG ACCGTTTGTG ATTTTCAAAA CGCATATGTC GGCAATGATT GCAACAGTGG TTGTCGTACT GGTGGTGACA TTTGCGAACG TCTTCACCAT CATTCAACCT GTGGTTGAAG CTGGAGACTG GGACAGCACA TTGTGGATGA TTGGCGGCCC TGTCTTCTTC TCGCTGTTAG CGATGGCGAT TTACCAGAAC TATTGCAGTC GCATGGCGAA CAAACCTGAG TTAGCTCTCG ACTGA
|
Protein sequence | MGAFYTRAEM KDGFMPHTIK KMSLIGLILM IFTSVFGFAN SPSAYYLMGY SAIPFYIFSA LLFFIPFALM MAEMGAAYRK EEGGIYSWMN NSVGPRFAFI GTFMWFSSYI IWMVSTSAKV WVPFSTFLYG SDMTQHWRIA GLEPTQVVGL LAVAWMILVT VVASKGINKI ARITAVGGIA VMCLNLVLLL VSITILLLNG GHFAQDINFL ASPNPGYQSG LAMLSFVVFA IFAYGGIEAV GGLVDKTENP EKNFAKGIVF AAIVISIGYS LAIFLWGVST NWQQVLSNGS VNLGNITYVL MKSLGVTLGN ALHFSPEASL SLGVWFARIT GLSMFLAYTG AFFTLCYSPL KAIIQGTPKA LWPEPMTRLN TMGMPSIAMW MQCGLVTIFI LLVSFGGGTA SAFFNKLTLM ANVSMTLPYL FLALAFPFFK ARQDLDRPFV IFKTHMSAMI ATVVVVLVVT FANVFTIIQP VVEAGDWDST LWMIGGPVFF SLLAMAIYQN YCSRMANKPE LALD
|
| |