Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcolC_2814 |
Symbol | |
ID | 6065034 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli ATCC 8739 |
Kingdom | Bacteria |
Replicon accession | NC_010468 |
Strand | - |
Start bp | 3076082 |
End bp | 3077620 |
Gene Length | 1539 bp |
Protein Length | 512 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641602220 |
Product | extracellular solute-binding protein |
Protein accession | YP_001725769 |
Protein GI | 170020815 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0747] ABC-type dipeptide transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.993061 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 26 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAAGAG CTGTACACCG TAGTGGGTTA GTGGCGCTGG GCATTGCGAC AGCGTTGATG GCATCTTGTG CATTCGCTGC CAAAGATGTG GTGGTGGCGG TAGGATCGAA CTTCACCACG CTCGATCCGT ATGACGCAAA TGACACGTTA TCTCAGGCCG TAGCGAAATC GTTTTACCAG GGGCTGTTTG GTCTGGATAA AGAGATGAAA CTGAAAAACG TGCTGGCGGA GAGTTATACC GTTTCCGATG ACGGCCTTAC ATACACCGTG AAATTGCGGG AAGGCATTAA ATTCCAGGAT GGCACCGATT TCAACGCAGC GGCAGTAAAA GCGAATCTGG ACCGGGCCAG CGATCCGGCG AATCATCTTA AACGCTATAA CCTGTATAAG AATATTGCCA AAACGGAGGC GATCGATCCG ACAACGGTAA AGATTACTCT CAAACAGCCG TTTTCAGCGT TTATTAATAT TCTTGCCCAT CCGGCGACCG CGATGATTTC ACCGGCAGCG CTGGAAAAAT ATGGCAAGGA GATTGGTTTT CATCCGGTGG GAACCGGACC GTATGAACTG GATACCTGGA ATCAGACCGA TTTTGTGAAG GTGAAAAAAT TCGCGGGTTA CTGGCAGCCA GGATTGCCCA AACTGGACAG CATAACCTGG CGTCCGGTGG CGGATAACAA CACCCGCGCG GCAATGCTGC AAACCGGTGA AGCGCAGTTT GCTTTCCCCA TTCCTTACGA GCAGGCCGCA CTGCTGGAGA AAAACAAAAA TATCGAGTTG ATGGCCAGTC CGTCAATTAT GCAGCGTTAT ATCAGTATGA ACGTGACGCA AAAACCGTTC GATAACCCGA AGGTCCGTGA GGCGCTGAAT TACGCCATTA ACCGCCCGGC GCTGGTGAAA GTAGCCTTCG CGGGCTATGC AACGCCAGCT ACTGGTGTGG TACCGCCGAG TATCGCCTAC GCGCAAAGTT ATAAACCGTG GCCTTACGAT CCAGTGAAAG CGCGCGAATT ACTGAAAGAG GCGAGATATC CCAACGGTTT CAGTACCACG CTGTGGTCGT CACATAACCA CAGCACCGCG CAGAAAGTGC TGCAATTTAC CCAGCAGCAG TTAGCGCAGG TCGGGATTAA AGCCCAGGTG ACTGCGATGG ATGCCGGACA GCGGGCGGCA GAAGTTGAAG GTAAAGGGCA AAAAGAGAGC GGCGTGCGGA TGTTCTACAC TGGCTGGTCG GCTTCAACCG GCGAAGCTGA CTGGGCACTA TCGCCGCTGT TTGCATCGCA AAACTGGCCA CCGACGCTGT TTAATACCGC GTTTTACAGC AATAAACAGG TGGATGACTT CCTGGATCAG GCACTGAAAA CGAATGATCC GGTGGAAAAG ACCCGCTTAT ATAAGGCGGC GCAGGATATC ATCTGGCAAG AATCGCCGTG GATCCCGCTG GTGGTAGAAA AACTGGTGTC GGCACACAGT AAAAACCTGA CCGGTTTTTG GATCATGCCA GACACCGGCT TCAGCTTTGA AGACGCGGAT TTGCAATAA
|
Protein sequence | MARAVHRSGL VALGIATALM ASCAFAAKDV VVAVGSNFTT LDPYDANDTL SQAVAKSFYQ GLFGLDKEMK LKNVLAESYT VSDDGLTYTV KLREGIKFQD GTDFNAAAVK ANLDRASDPA NHLKRYNLYK NIAKTEAIDP TTVKITLKQP FSAFINILAH PATAMISPAA LEKYGKEIGF HPVGTGPYEL DTWNQTDFVK VKKFAGYWQP GLPKLDSITW RPVADNNTRA AMLQTGEAQF AFPIPYEQAA LLEKNKNIEL MASPSIMQRY ISMNVTQKPF DNPKVREALN YAINRPALVK VAFAGYATPA TGVVPPSIAY AQSYKPWPYD PVKARELLKE ARYPNGFSTT LWSSHNHSTA QKVLQFTQQQ LAQVGIKAQV TAMDAGQRAA EVEGKGQKES GVRMFYTGWS ASTGEADWAL SPLFASQNWP PTLFNTAFYS NKQVDDFLDQ ALKTNDPVEK TRLYKAAQDI IWQESPWIPL VVEKLVSAHS KNLTGFWIMP DTGFSFEDAD LQ
|
| |