Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4015 |
Symbol | |
ID | 6143293 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | + |
Start bp | 4094606 |
End bp | 4095529 |
Gene Length | 924 bp |
Protein Length | 307 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 641618840 |
Product | carboxylate/amino acid/amine transporter |
Protein accession | YP_001745978 |
Protein GI | 170682291 |
COG category | [R] General function prediction only |
COG ID | [COG5006] Predicted permease, DMT superfamily |
TIGRFAM ID | [TIGR00950] Carboxylate/Amino Acid/Amine Transporter |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.972202 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 56 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGTTCCA CCAGAAAGGG GATGCTGAAC GTTCTGATTG CCGCCGTGTT GTGGGGAAGT TCAGGGGTCT GCGCGCAATA CATCATGGAG CAAAGCCAGA TGTCGTCGCA GTTTTTGACT ATGACGCGTT TGATATTCGC CGGTTTGATT CTGCTGACGC TGTCATTTGT TCATGGCGAT AAAATCTTTT CTATTATTAA CAATCACAAA GATGCCATTA GTCTGCTGAT TTTTTCCGTG GTTGGCGCGC TAACCGTACA GCTCACTTTT TTGCTAACCA TCGAAAAATC GAACGCAGCC ACGGCAACGG TGCTGCAATT CCTCTCACCG ACGATTATCG TCGCCTGGTT CTCACTGGTA CGTAAATCGC GCCCGGGCAT TCTGGTTTTT TGCGCTATTT TGACATCGCT GATCGGGACT TTTTTATTGG TGACACACGG TAATCCGACG TCATTATCGA TCTCTCCTGC CGCGTTATTC TGGGGCATTG CCTCGGCATT TGCTGCTGCA TTCTATACCA CCTATCCCTC AACGCTGATT GCCCGCTATG GCACGTTACC GGTCGTCGGC TGGAGTATGC TCATTGGTGG TCTGATCCTG CTGCCATTTT ATGCCAGACA AGGGACAAAT TTTGTGGTTA ATGGCAGTTT GATTCTGGCG TTTTTTTATT TGGTGGTCAT TGGTACGTCC CTGACATTTA GTCTGTACCT GAAAGGAGCA CAATTAATTG GCGGTCCAAA AGCCAGCATT TTGAGCTGTG CAGAACCATT AAGTAGCGCG CTGCTCTCTT TGCTGTTGCT GGGGATCACA TTCACATTAC CGGACTGGCT GGGAACGCTT CTGATTCTGT CATCAGTGAT TTTGATTTCA ATGGATTCCC GTCGCCGCGC CAGAAAAATA AATCGTCCGG CGCGGCATGA GTGA
|
Protein sequence | MGSTRKGMLN VLIAAVLWGS SGVCAQYIME QSQMSSQFLT MTRLIFAGLI LLTLSFVHGD KIFSIINNHK DAISLLIFSV VGALTVQLTF LLTIEKSNAA TATVLQFLSP TIIVAWFSLV RKSRPGILVF CAILTSLIGT FLLVTHGNPT SLSISPAALF WGIASAFAAA FYTTYPSTLI ARYGTLPVVG WSMLIGGLIL LPFYARQGTN FVVNGSLILA FFYLVVIGTS LTFSLYLKGA QLIGGPKASI LSCAEPLSSA LLSLLLLGIT FTLPDWLGTL LILSSVILIS MDSRRRARKI NRPARHE
|
| |