Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcE24377A_4076 |
Symbol | yiaO |
ID | 5588362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli E24377A |
Kingdom | Bacteria |
Replicon accession | NC_009801 |
Strand | + |
Start bp | 4058715 |
End bp | 4059701 |
Gene Length | 987 bp |
Protein Length | 328 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 640927695 |
Product | TRAP transporter solute receptor DctP family protein |
Protein accession | YP_001465055 |
Protein GI | 157157642 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1638] TRAP-type C4-dicarboxylate transport system, periplasmic component |
TIGRFAM ID | [TIGR00787] tripartite ATP-independent periplasmic transporter solute receptor, DctP family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 45 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAATTAC GCTCTGTAAC CTACGCATTA TTCATTGCTG GCCTGGCTGC ATTCAGCACA TCTTCTCTGG CGGCGCAATC TTTACGTTTC GGTTATGAAA CCTCACAAAC CGACTCGCAA CATATTGCGG CGAAAAAATT CAATGATTTA TTGCAGGAGA GAACCAAAGG CGAGCTGAAA TTAAAATTGT TTCCGGATAG CACCCTCGGT AACGCGCAGG CGATGATCAG CGGCGTACGT GGCGGCACCA TCGATATGGA AATGTCCGGC TCGAATAACT TTGCCGGGTT ATCACCAGTG ATGAACTTGC TTGATGTCCC TTTCCTGTTC CGCGATACCG CTCACGCGCA TAAAACGCTC GACGGCAAAG TCGGTGATGA TCTGAAAGCC TCACTTGAAG GTAAAGGACT AAAAGTACTG GCCTACTGGG AAAACGGCTG GCGCGATGTC ACCAACTCGC GCGCACCGGT TAAAACCCCC GCCGACCTGA AAGGGCTGAA AATTCGCACC AACAATAGCC CGATGAATAT CGCCGCATTC AAAGTCTTTG GCGCTAACCC GATCCCGATG CCGTTTGCCG AAGTCTATAC CGGGCTGGAA ACCCGCACTA TCGACGCTCA GGAACACCCG ATCAACGTCG TCTGGTCAGC AAAATTTTTC GAAGTGCAGA AGTACCTTTC TCTGACGCAC CACGCCTATT CCCCGCTTCT GGTGGTGATC AACAAAGCGA AGTTTGATGG CTTAACCCCG GAGTTCCAGC AAGCGCTAAT TTCATCTGCG CAGGAAGCGG GTAACTACCA GCGCAAACTG GTTGCCGAAG ATCAGCAAAA AATCATCGAC GGCATGAAAG AAGCGGGCGT GGAAGTCATC ACCGATCTCG ACCGCAAAGC CTTTAGCGAC GCACTGGGTA CTCAGGTCCG CGACATGTTT GTAAAAGATG TTCCGCAGGG TGCTGATCTA CTGAAAGCCG TGGATGAGGT GCAATAA
|
Protein sequence | MKLRSVTYAL FIAGLAAFST SSLAAQSLRF GYETSQTDSQ HIAAKKFNDL LQERTKGELK LKLFPDSTLG NAQAMISGVR GGTIDMEMSG SNNFAGLSPV MNLLDVPFLF RDTAHAHKTL DGKVGDDLKA SLEGKGLKVL AYWENGWRDV TNSRAPVKTP ADLKGLKIRT NNSPMNIAAF KVFGANPIPM PFAEVYTGLE TRTIDAQEHP INVVWSAKFF EVQKYLSLTH HAYSPLLVVI NKAKFDGLTP EFQQALISSA QEAGNYQRKL VAEDQQKIID GMKEAGVEVI TDLDRKAFSD ALGTQVRDMF VKDVPQGADL LKAVDEVQ
|
| |