Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcDH1_4258 |
Symbol | |
ID | 0 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli DH1 |
Kingdom | Bacteria |
Replicon accession | CP001637 |
Strand | - |
Start bp | 4622986 |
End bp | 4624233 |
Gene Length | 1248 bp |
Protein Length | 415 aa |
Translation table | 11 |
GC content | 46% |
IMG OID | |
Product | aromatic amino acid transporter |
Protein accession | ACX41856 |
Protein GI | 260451434 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 57 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACTGATC AAGCTGAAAA AAAGCACTCT GCATTTTGGG GTGTTATGGT TATAGCAGGT ACAGTAATTG GTGGAGGTAT GTTTGCTTTA CCTGTTGATC TTGCCGGTGC CTGGTTTTTC TGGGGTGCCT TTATCCTTAT CATTGCCTGG TTTTCAATGC TTCATTCCGG GTTATTGTTA TTAGAAGCAA ATTTAAATTA TCCCGTCGGC TCCAGTTTTA ACACCATCAC CAAAGATTTA ATCGGTAACA CCTGGAACAT TATCAGCGGT ATTACCGTTG CCTTCGTTCT CTATATCCTC ACTTATGCCT ATATCTCTGC TAATGGTGCG ATCATTAGTG AAACGATATC AATGAATTTG GGTTATCACG CTAATCCACG TATTGTCGGG ATCTGCACAG CCATTTTCGT TGCCAGCGTA TTGTGGTTAA GTTCGTTAGC CGCCAGTCGT ATTACCTCAT TGTTCCTCGG GCTGAAGATT ATCTCCTTTG TGATCGTGTT TGGTTCTTTT TTCTTCCAGG TCGATTACTC CATTCTGCGC GACGCCACCA GCTCCACTGC GGGAACGTCT TACTTCCCGT ATATCTTTAT GGCTTTGCCG GTGTGTCTGG CGTCATTTGG TTTCCACGGC AATATTCCCA GCCTGATTAT TTGCTATGGA AAACGCAAAG ATAAGTTAAT CAAAAGCGTG GTATTTGGTT CGCTGCTGGC GCTGGTGATT TATCTCTTCT GGCTCTATTG CACCATGGGG AATATTCCGC GAGAAAGCTT TAAGGCGATT ATCTCCTCAG GCGGCAACGT TGATTCGCTG GTGAAATCGT TCCTCGGCAC CAAACAGCAC GGCATTATCG AGTTTTGCCT GCTGGTGTTC TCTAACTTAG CTGTTGCCAG TTCGTTCTTT GGTGTCACGC TGGGGTTGTT CGATTATCTG GCGGACCTGT TTAAGATTGA TAACTCCCAC GGCGGGCGTT TCAAAACCGT GCTGTTAACC TTCCTGCCAC CTGCGTTGTT GTATCTGATC TTCCCGAACG GCTTTATTTA CGGGATCGGC GGTGCCGGGC TGTGCGCCAC CATCTGGGCG GTCATTATTC CCGCAGTGCT TGCAATCAAA GCTCGCAAGA AGTTTCCCAA TCAGATGTTC ACGGTCTGGG GCGGCAATCT TATTCCGGCG ATTGTCATTC TCTTTGGTAT AACCGTGATT TTGTGCTGGT TCGGCAACGT CTTTAACGTG TTACCTAAAT TTGGCTAA
|
Protein sequence | MTDQAEKKHS AFWGVMVIAG TVIGGGMFAL PVDLAGAWFF WGAFILIIAW FSMLHSGLLL LEANLNYPVG SSFNTITKDL IGNTWNIISG ITVAFVLYIL TYAYISANGA IISETISMNL GYHANPRIVG ICTAIFVASV LWLSSLAASR ITSLFLGLKI ISFVIVFGSF FFQVDYSILR DATSSTAGTS YFPYIFMALP VCLASFGFHG NIPSLIICYG KRKDKLIKSV VFGSLLALVI YLFWLYCTMG NIPRESFKAI ISSGGNVDSL VKSFLGTKQH GIIEFCLLVF SNLAVASSFF GVTLGLFDYL ADLFKIDNSH GGRFKTVLLT FLPPALLYLI FPNGFIYGIG GAGLCATIWA VIIPAVLAIK ARKKFPNQMF TVWGGNLIPA IVILFGITVI LCWFGNVFNV LPKFG
|
| |