Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcHS_A4371 |
Symbol | |
ID | 5593370 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli HS |
Kingdom | Bacteria |
Replicon accession | NC_009800 |
Strand | - |
Start bp | 4378976 |
End bp | 4380433 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640923469 |
Product | amino acid/peptide transporter |
Protein accession | YP_001460914 |
Protein GI | 157163596 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG3104] Dipeptide/tripeptide permease |
TIGRFAM ID | [TIGR00924] amino acid/peptide transporter (Peptide:H+ symporter), bacterial |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 33 |
Plasmid unclonability p-value | 0.113976 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAC CCTCACAGCC GCGCGCGATA TACTATATCG TGGCGATCCA AATCTGGGAG TACTTCAGTT TTTACGGCAT GCGTGCCTTA CTCATTCTCT ATCTCACCCA TCAGCTTGGT TTTGATGATA ACCATGCCAT CAGCCTGTTC AGCGCATATG CTTCTCTGGT TTACGTTACC CCTATTCTCG GCGGCTGGCT TGCCGACCGC CTGCTCGGCA ACCGCACTGC AGTGATTGCC GGCGCGCTGT TAATGACCCT TGGCCATGTG GTGCTGGGCA TTGATACAAA TTCAACCTTT AGCCTGTATC TGGCGCTGGC AATCATTATT TGTGGCTACG GTTTATTCAA ATCAAACATC AGCTGTTTGC TTGGCGAGCT CTACGACGAG AACGATCATC GACGTGATGG CGGTTTTTCG CTGCTGTATG CTGCGGGCAA TATCGGTTCT ATCGCCGCCC CAATCGCCTG CGGCCTGGCT GCTCAGTGGT ATGGATGGCA TGTTGGCTTT GCCCTTGCGG GTGGCGGCAT GTTTATCGGT TTGTTGATTT TCTTAAGCGG GCATCGTCAT TTCCAGTCCA CGCGTAGTAT GGATAAAAAA GCGCTCACTA GCGTGAAATT TGCCTTACCA GTATGGAGCT GGTTAGTGGT GATGCTCTGT TTAGCCCCAG TATTTTTTAC CCTGCTGCTG GAGAACGACT GGTCCGGATA TTTGCTGGCG ATCGTTTGCC TCATTGCCGC ACAAATCATT GCCCGCATGA TGATCAAATT CCCCGAACAC CGCCGTGCTC TTTGGCAAAT TGTATTGTTG ATGTTTGTCG GGACGTTGTT CTGGGTACTG GCACAACAGG GCGGCAGTAC CATCAGCTTG TTTATCGATC GCTTTGTGAA TCGTCAGGCA TTCAATATTG AAGTACCTAC AGCACTATTC CAGTCGGTGA ATGCAATTGC AGTGATGCTC GCTGGGGTTG TGCTGGCCTG GCTGGCATCG CCAGAAAGCC GCGGCAACTC AACATTGCGC GTCTGGCTGA AGTTTGCCTT TGGCTTACTG CTGATGGCTT GTGGCTTTAT GTTGTTGGCA TTTGATGCCC GACATGCAGC GGCTGACGGT CAAGCGTCAA TGGGCGTGAT GATATCCGGG TTGGCGCTAA TGGGCTTTGC CGAACTCTTT ATTGACCCGG TGGCGATTGC GCAAATCACG CGTCTGAAAA TGTCTGGCGT ATTAACCGGT ATTTATATGC TGGCAACAGG CGCGGTCGCC AACTGGCTGG CAGGCGTCGT GGCACAGCAG ACGACAGAGT CGCAAATTAG CGGTATGGCA ATTGCAGCTT ACCAGCGATT CTTTTCTCAG ATGGGAGAGT GGACGTTGGC TTGTGTCGCG ATCATCGTCG TATTGGCCTT TGCTACCCGT TTTCTGTTTA GCACGCCGAC GAATATGATA CAGGAGAGCA ACGATTAA
|
Protein sequence | MKTPSQPRAI YYIVAIQIWE YFSFYGMRAL LILYLTHQLG FDDNHAISLF SAYASLVYVT PILGGWLADR LLGNRTAVIA GALLMTLGHV VLGIDTNSTF SLYLALAIII CGYGLFKSNI SCLLGELYDE NDHRRDGGFS LLYAAGNIGS IAAPIACGLA AQWYGWHVGF ALAGGGMFIG LLIFLSGHRH FQSTRSMDKK ALTSVKFALP VWSWLVVMLC LAPVFFTLLL ENDWSGYLLA IVCLIAAQII ARMMIKFPEH RRALWQIVLL MFVGTLFWVL AQQGGSTISL FIDRFVNRQA FNIEVPTALF QSVNAIAVML AGVVLAWLAS PESRGNSTLR VWLKFAFGLL LMACGFMLLA FDARHAAADG QASMGVMISG LALMGFAELF IDPVAIAQIT RLKMSGVLTG IYMLATGAVA NWLAGVVAQQ TTESQISGMA IAAYQRFFSQ MGEWTLACVA IIVVLAFATR FLFSTPTNMI QESND
|
| |