Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4599 |
Symbol | |
ID | 6146213 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4699920 |
End bp | 4701377 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641619415 |
Product | amino acid/peptide transporter |
Protein accession | YP_001746527 |
Protein GI | 170682504 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.148198 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 50 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAACAC CCTCACAGCC GCGCGCGATA TACTATATCG TGGCGATCCA AATCTGGGAG TACTTCAGTT TTTACGGCAT GCGTGCCTTA CTCATTCTCT ATCTCACCCA TCAGCTTGGT TTTGATGATA ACCATGCCAT CAGCCTGTTC AGCGCATATG CTTCTCTGGT TTACGTTACC CCTATTCTCG GCGGCTGGCT TGCCGACCGC CTGCTCGGCA ACCGCACTGC AGTGATTGCC GGCGCGCTGT TAATGACCCT TGGCCATGTG GTGCTGGGCA TTGATACAAA TTCAACCTTT AGCCTGTATC TGGCGCTGGC AATCATTATT TGTGGCTACG GTTTATTCAA ATCAAACATC AGCTGTTTGC TTGGCGAGCT CTACGACGAG AACGATCATC GACGTGATGG CGGTTTTTCG CTGCTGTATG CTGCGGGCAA TATCGGTTCT ATCGCAGCCC CCATCGCCTG CGGCCTGGCT GCTCAGTGGT ATGGATGGCA TGTTGGCTTT GCCCTTGCGG GAGGCGGCAT GTTTATCGGT TTGTTGATTT TCTTAAGTGG TCATCGTCAT TTCCAGTCCA CGCGTAGTAT GGATAAAAAA GCGCTCACTA GCGTCAAATT TGCCTTACCA GTATGGACCT GGTTAGTGGT GATGCTCTGT TTAGCCCCAG TATTTTTTAC CCTGCTGCTG GAGAACGACT GGTCCGGATA TTTGCTGGCG ATCGTTTGCC TCATTGCCGC ACAAATCATT GCCCGCATGA TGATCAAATT CCCCGAACAC CGCCGTGCTC TTTGGCAAAT TGTATTGTTG ATGTTTGTCG GGACGTTGTT CTGGGTACTG GCACAACAGG GCGGCAGTAC CATCAGCTTG TTTATCGATC GCTTTGTGAA TCGTCAGGCA TTCAATATTG AAGTACCTAC AGCACTATTC CAGTCGGTGA ATGCTATTGC GGTGATGCTC GCTGGGGTTG TTCTGGCCTG GCTGGCGTCG CCAGAAAGCC GCGGCAATTC AGCATTGCGC GTCTGGCTGA AGTTTGCCTT TGGCTTACTG CTGATGGCTT GTGGCTTTAT GTTGCTGGCA TTTGATGCCC GACATGCAGC GGCTGGCGGT CAAGCGTCAA TGGGCGTGAT GATATCCGGG CTGGCGCTAA TGGGCTTTGC TGAACTCTTT ATTGACCCAG TGGCGATTGC GCAAATCACG CGTCTGAAAA TGTCTGGCGT ATTAACCGGT ATTTATATGC TGGCAACAGG CGCGGTCGCC AACTGGCTGG CAGGCGTCGT GGCACAGCAG ACGACAGAGT CGCAAATTAG CGGTATGGCA ATTGCCGCTT ACCAGCGATT CTTTTCTCAG ATGGGAGAGT GGACGTTGGC TTGTGTCGCG ATCATCGTGG TATTGGCCTT TGCTACCCGT TTTCTGTTTA GCACGCCGAC GAATATGATA CAGGAGAGCA ACGATTAA
|
Protein sequence | MKTPSQPRAI YYIVAIQIWE YFSFYGMRAL LILYLTHQLG FDDNHAISLF SAYASLVYVT PILGGWLADR LLGNRTAVIA GALLMTLGHV VLGIDTNSTF SLYLALAIII CGYGLFKSNI SCLLGELYDE NDHRRDGGFS LLYAAGNIGS IAAPIACGLA AQWYGWHVGF ALAGGGMFIG LLIFLSGHRH FQSTRSMDKK ALTSVKFALP VWTWLVVMLC LAPVFFTLLL ENDWSGYLLA IVCLIAAQII ARMMIKFPEH RRALWQIVLL MFVGTLFWVL AQQGGSTISL FIDRFVNRQA FNIEVPTALF QSVNAIAVML AGVVLAWLAS PESRGNSALR VWLKFAFGLL LMACGFMLLA FDARHAAAGG QASMGVMISG LALMGFAELF IDPVAIAQIT RLKMSGVLTG IYMLATGAVA NWLAGVVAQQ TTESQISGMA IAAYQRFFSQ MGEWTLACVA IIVVLAFATR FLFSTPTNMI QESND
|
| |