Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3457 |
Symbol | mtr |
ID | 6143895 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3531329 |
End bp | 3532573 |
Gene Length | 1245 bp |
Protein Length | 414 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618286 |
Product | tryptophan permease |
Protein accession | YP_001745435 |
Protein GI | 170683709 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0814] Amino acid permeases |
TIGRFAM ID | [TIGR00837] aromatic amino acid transport protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.813995 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAACAC TAACCACCAC CCAAACGTCA CCGTCGCTGC TTGGCGGCGT GGTGATTATC GGCGGCACCA TTATTGGCGC AGGGATGTTT TCTCTGCCAG TGGTCATGTC CGGGGCGTGG TTCTTCTGGT CAATGGCGGC GCTGATCTTT ACCTGGTTCT GTATGCTGCA TTCCGGCTTG ATGATTCTGG AAGCTAACCT GAATTATCGA ATCGGTTCGA GCTTTGACAC CATCACCAAA GACCTGCTCG GCAAAGGCTG GAACGTAGTC AACGGCATTT CCATTGCCTT TGTGCTCTAT ATCCTGACTT ACGCCTATAT TTCTGCCAGC GGTTCGATTC TGCATCACAC CTTCGCGGAG ATGTCGCTGA ACGTCCCTGC ACGGGCAGCT GGATTTGGTT TTGCACTACT GGTGGCGTTT GTGGTGTGGT TGAGCACCAA AGCGGTCAGC CGGATGACGG CGATTGTGCT GGGGGCGAAA GTCATTACGT TTTTCCTCAC CTTCGGTAGC CTGCTGGGAC ATGTGCAGCC AACGACCTTG TTCAACGTTG CCGAAAGCAA TGCGTCTTAT GCGCCGTATC TGCTGATGAC ACTGCCATTC TGTCTGGCAT CTTTTGGTTA TCACGGTAAC GTGCCGAGCC TGATGAAGTA TTACGGCAAA GATCCGAAAA CCATCGTGAA ATGCCTGGTG TACGGTACGC TGATGGCGCT GGCGCTGTAT ACCATCTGGT TGCTGGCGAC GATGGGCAAC ATCCCTCGTC CGGAGTTTAT CGGCATCGCC GAGAAGGGCG GTAATATTGA TGTGCTGGTA CAGGCGTTAA GCGGCGTGCT GAACAGCCGT AGCCTGGATC TGCTGCTGGT TGTGTTCTCA AACTTTGCGG TAGCGAGTTC GTTCCTCGGC GTTACGCTGG GTTTGTTTGA CTATCTGGCA GATCTGTTTG GTTTCGATGA CTCGGCTATG GGCCGCTTGA AAACAGCGTT GCTGACCTTT GCCCCGCCTG TTGTGGGTGG CCTGCTGTTT CCTAACGGAT TCCTGTACGC CATTGGTTAT GCTGGCTTAG CGGCTACCAT CTGGGCGGCA ATTGTTCCGG CGCTGTTGGC CCGCGCATCG CGTAAACGCT TTGGTAGCCC GAAATTCCGC GTCTGGGGCG GCAAGCCGAT GATTATGCTG ATTCTGGTAT TTGGCGTTGG CAACGCACTG GTCCATATCT TATCGAGCTT TAATTTGCTG CCGGTGTATC AGTAA
|
Protein sequence | MATLTTTQTS PSLLGGVVII GGTIIGAGMF SLPVVMSGAW FFWSMAALIF TWFCMLHSGL MILEANLNYR IGSSFDTITK DLLGKGWNVV NGISIAFVLY ILTYAYISAS GSILHHTFAE MSLNVPARAA GFGFALLVAF VVWLSTKAVS RMTAIVLGAK VITFFLTFGS LLGHVQPTTL FNVAESNASY APYLLMTLPF CLASFGYHGN VPSLMKYYGK DPKTIVKCLV YGTLMALALY TIWLLATMGN IPRPEFIGIA EKGGNIDVLV QALSGVLNSR SLDLLLVVFS NFAVASSFLG VTLGLFDYLA DLFGFDDSAM GRLKTALLTF APPVVGGLLF PNGFLYAIGY AGLAATIWAA IVPALLARAS RKRFGSPKFR VWGGKPMIML ILVFGVGNAL VHILSSFNLL PVYQ
|
| |