Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_0724 |
Symbol | |
ID | 6145701 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 730643 |
End bp | 732124 |
Gene Length | 1482 bp |
Protein Length | 493 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641615614 |
Product | amino acid/peptide transporter |
Protein accession | YP_001742813 |
Protein GI | 170682417 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.470702 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.430255 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAAAC ACGCATCACA GCCGCGCGCT ATTTACTATG TCGTTGCGCT GCAAATCTGG GAATATTTTA GCTTTTACGG CATGCGTGCC CTGCTGATTC TCTATCTCAC CAATCAACTA AAATACAACG ATACTCACGC CTACGAGTTA TTTAGCGCCT ACTGTTCGCT GGTGTATGTC ACGCCAATCC TCGGTGGCTT TTTGGCGGAT AAAGTTCTCG GCAATCGCAT GGCGGTGATG CTGGGGGCGT TGTTGATGGC GATCGGTCAT GTGGTGCTGG GTGCCAGTGA GATCCATCCG TCATTCCTCT ATCTGTCACT GGCGATTATC GTCTGCGGCT ATGGTCTGTT TAAATCCAAC GTCAGTTGCC TGCTCGGTGA GCTGTATGAG CCAACCGATC CACGTCGTGA TGGCGGTTTC TCGCTGATGT ATGCGGCGGG TAACGTGGGG TCTATTATCG CACCTATCGC CTGTGGTTTC GCCCAGGAAG AATACAGTTG GGCGATGGGC TTTGGCCTGG CGGCGGTGGG TATGATCGCG GGTCTGGTCA TTTTCTTATG TGGCAATCGT CATTTCTCTC ATACCCGCGG CGTTAACAAA AAGGTACTGC GTGCGACAAA CTTCCTCCTG CCCAACTGGG GATGGCTGCT GGTTCTGCTG GTGGCAACGC CTGCGCTGAT TACCGTGTTG TTCTGGAAAG AGTGGTCGGT ATACGCCTTG ATTGTCGCCA CGATCATCGG CCTTGGCGTT CTGGCAAAAA TTTATCGCAA AGCAGAAAAC CAAAAACAGC GTAAGGAGCT GGGGCTGATT GTTACCCTCA CCTTCTTCAG CATGTTGTTC TGGGCCTTCG CACAACAGGG CGGCAGCTCG ATTAGCCTTT ATATCGACCG CTTCGTTAAC CGCGATATGT TTGGTTATAC CGTTCCGACT GCTATGTTCC AGTCGATTAA TGCCTTCGCA GTTATGCTGT GCGGTGTGTT CCTGGCGTGG GTGGTTAAAG AGAGCGTCGC AGGTAATCGT ACCGTGCGCA TCTGGGGAAA ATTTGCTCTT GGTCTCGGCC TGATGAGCGC CGGATTCTGC ATTCTGACCT TAAGCGCCCG CTGGTCCGCA ATGTATGGTC ATTCTTCTCT GCCACTGATG GTGTTAGGCC TGGCGGTGAT GGGCTTTGCG GAACTGTTTA TCGACCCGGT TGCTATGTCG CAGATTACGC GCATTGAAAT CCCTGGCGTG ACTGGCGTAT TAACTGGCAT TTATATGCTG CTTTCCGGCG CGATTGCGAA CTATCTGGCT GGCGTTATTG CCGATCAGAC ATCGCAAGCT TCGTTTGATG CTTCCGGGGC GATCAACTAT TCCATCAATG CATATATTGA AGTGTTTGAT CAAATTACCT GGGGCGCACT GGCGTGTGTA GGAGTGGTGT TGATGATTTG GCTGTATCAG GCGCTGAAAT TCAGAAACCG CGCGCTGGCG CTGGAGTCTT AA
|
Protein sequence | MNKHASQPRA IYYVVALQIW EYFSFYGMRA LLILYLTNQL KYNDTHAYEL FSAYCSLVYV TPILGGFLAD KVLGNRMAVM LGALLMAIGH VVLGASEIHP SFLYLSLAII VCGYGLFKSN VSCLLGELYE PTDPRRDGGF SLMYAAGNVG SIIAPIACGF AQEEYSWAMG FGLAAVGMIA GLVIFLCGNR HFSHTRGVNK KVLRATNFLL PNWGWLLVLL VATPALITVL FWKEWSVYAL IVATIIGLGV LAKIYRKAEN QKQRKELGLI VTLTFFSMLF WAFAQQGGSS ISLYIDRFVN RDMFGYTVPT AMFQSINAFA VMLCGVFLAW VVKESVAGNR TVRIWGKFAL GLGLMSAGFC ILTLSARWSA MYGHSSLPLM VLGLAVMGFA ELFIDPVAMS QITRIEIPGV TGVLTGIYML LSGAIANYLA GVIADQTSQA SFDASGAINY SINAYIEVFD QITWGALACV GVVLMIWLYQ ALKFRNRALA LES
|
| |