Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_3861 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | + |
Start bp | 4158170 |
End bp | 4159627 |
Gene Length | 1458 bp |
Protein Length | 485 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | |
Product | amino acid/peptide transporter |
Protein accession | ACX41462 |
Protein GI | 260451040 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 56 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAAACAC CCTCACAGCC GCGCGCGATA TACTATATCG TGGCGATCCA AATCTGGGAG TACTTCAGTT TTTACGGCAT GCGTGCCTTA CTCATTCTCT ATCTCACCCA TCAGCTTGGT TTTGATGATA ACCATGCCAT CAGCCTGTTC AGCGCATATG CTTCTCTGGT TTACGTTACC CCTATTCTCG GCGGCTGGCT TGCCGACCGC CTGCTCGGCA ACCGCACTGC AGTGATTGCC GGCGCGCTGT TAATGACCCT TGGCCATGTG GTGCTGGGTA TTGATACAAA TTCAACCTTT AGCCTGTATC TGGCGCTGGC AATCATTATT TGTGGCTACG GTTTATTCAA ATCAAACATC AGCTGTTTGC TTGGCGAGCT CTACGACGAG AACGATCATC GACGTGATGG CGGTTTTTCG CTGCTGTATG CTGCGGGCAA TATCGGTTCT ATCGCAGCCC CAATCGCCTG CGGCCTGGCT GCTCAGTGGT ATGGATGGCA TGTTGGCTTT GCCCTTGCGG GTGGCGGCAT GTTTATCGGT TTGTTGATTT TCTTAAGCGG TCATCGTCAT TTCCAGTCCA CACGTAGTAT GGATAAAAAA GCGCTCACTA GTGTCAAATT TGCCTTACCA GTATGGAGCT GGTTAGTGGT GATGCTCTGT TTAGCCCCAG TATTTTTTAC TCTGCTGCTG GAGAACGACT GGTCCGGATA TTTGCTGGCG ATCGTTTGCC TCATTGCCGC ACAAATCATT GCCCGCATGA TGATCAAATT CCCCGAACAT CGCCGTGCTC TTTGGCAAAT TGTATTGTTG ATGTTTGTCG GGACATTGTT CTGGGTACTG GCACAACAGG GCGGCAGTAC CATCAGCTTG TTTATCGATC GCTTTGTGAA TCGTCAGGCA TTCAATATTG AAGTACCTAC AGCACTATTC CAGTCGGTGA ATGCAATTGC GGTGATGCTC GCTGGGGTTG TACTGGCCTG GCTGGCGTCG CCAGAAAGCC GCGGCAACTC AACATTGCGC GTCTGGCTGA AGTTTGCCTT TGGCTTACTG CTGATGGCTT GTGGCTTTAT GTTGTTGGCA TTTGATGCCC GACATGCAGC GGCTGACGGT CAAGCGTCAA TGGGCGTGAT GATATCCGGG CTGGCGCTAA TGGGCTTTGC CGAACTCTTT ATTGACCCGG TGGCGATTGC GCAAATCACG CGTCTGAAAA TGTCTGGCGT ATTAACCGGG ATTTATATGC TGGCAACAGG CGCGGTCGCC AACTGGCTGG CAGGCGTCGT GGCACAGCAG ACGACAGAGT CGCAAATTAG CGGTATGGCA ATTGCAGCTT ACCAGCGATT CTTTTCTCAG ATGGGAGAGT GGACGTTGGC TTGTGTCGCG ATCATCGTCG TATTGGCCTT TGCTACCCGT TTTCTGTTTA GCACGCCGAC GAATATGATA CAGGAGAGCA ACGATTAA
|
Protein sequence | MKTPSQPRAI YYIVAIQIWE YFSFYGMRAL LILYLTHQLG FDDNHAISLF SAYASLVYVT PILGGWLADR LLGNRTAVIA GALLMTLGHV VLGIDTNSTF SLYLALAIII CGYGLFKSNI SCLLGELYDE NDHRRDGGFS LLYAAGNIGS IAAPIACGLA AQWYGWHVGF ALAGGGMFIG LLIFLSGHRH FQSTRSMDKK ALTSVKFALP VWSWLVVMLC LAPVFFTLLL ENDWSGYLLA IVCLIAAQII ARMMIKFPEH RRALWQIVLL MFVGTLFWVL AQQGGSTISL FIDRFVNRQA FNIEVPTALF QSVNAIAVML AGVVLAWLAS PESRGNSTLR VWLKFAFGLL LMACGFMLLA FDARHAAADG QASMGVMISG LALMGFAELF IDPVAIAQIT RLKMSGVLTG IYMLATGAVA NWLAGVVAQQ TTESQISGMA IAAYQRFFSQ MGEWTLACVA IIVVLAFATR FLFSTPTNMI QESND
|
| |