Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4738 |
Symbol | |
ID | 6972170 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 4381381 |
End bp | 4382796 |
Gene Length | 1416 bp |
Protein Length | 471 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 643388439 |
Product | hypothetical protein |
Protein accession | YP_002272867 |
Protein GI | 209400201 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG0226] ABC-type phosphate transport system, periplasmic component |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 26 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 60 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCTGGGTG TCGGGCTTAC CGGTATTATT GAAGTTTGTA ATATCCTTAT CACGCCAACA ATTTATCTTC TACTCAACGT CTTTATGCTG ACGCTGGGGG CGATAATAAT ATTTTTCTCT GGTCGCGTGT GGGCCGGTGA TAGCGCGCCA GAAAACAGAG AAATAGCCGT CTGGCGGCAA TGCTTTTTTC TCTTACCCGC GCTATTAACC CTGGTTGGCT GGATAATCAC GCTACATCTG GCAGATTATC AATTTCGCCA GATGGGAGCT GGTTGGTTGG CAAACCTTAT GCTTCCCTGG TTGGGCGTTT TTTTAGTCTC ATTAGTGGGT GGTGAGTACT GGTGGATGGT CATTATTCCC GTTGGGGCGC ATATCAGTTT TTCGCTGGGA TACGCCTGGC CGACCAGATA TCCTTTATCC GGCACGTCCG GACTACGTTG CCGTAACTTA CTCCTGTTTC TACTTCTCTT ACTTGGTATT GTCGCCGGGT ATCAGGCCCA TTTATATAAG CAGCAAAATC CTGGTGTCGG TGTACGCGAA AATATTGATA TCAGGGCCTG GCGACCCGAT AAACTCAATA ATCGACTGAC GCCGCTGCGT GGCAAACCGC AAATTCAGTT TAGGCAAAAC TGGCCGCGAA TCGATGGCGC CACGGCTGCG TACCCAATTT ATGCTTCTGC ATTTTATGCA TTAAGTGTAA TACCAGAGGA TTTTCACGTT TGGGAATATC TGGAGAACTC TCGTACCCCC GATGCATATA ACCGGATTGT TAAAGGTGAT GCCGATATTA TTTTTGTGGC GCAACCCTCC GGCGGGCAGA AAAAACGCGC TGAGGAATCG GGCGTCACTT TGCTATACAC GCCATTTGCC CGTGAAGCAT TTGTTTTCAT CGTCAATGCG GATAATCCGG TTAATTCCCT GACTGAACAA CAGGTGCGTG ACATTTTCAG TGGTGCAATT ACCAATTGGC GTACTGTTGG CGGTAACGAT CAGGAGATCC AGACCTGGCA GCGCCCGGAA GACTCTGGCA GCCAGACAGT GATGCAATCA CAGGTCATGA AAAAAGTCCG CATGATCTCG CCGCAGGAAA CGGAAGTGGC AAGCGTGATG GAGGGAATGA TTAAAGTCGT TGCCGAATAC CGTAATACAA ACAACGCAAT AGGCTATACC TTCCGCTATT ACGCGACGCA AATGAATGCT GATAAAAATA TAAGATTGCT AGCGATTAAC GGTATTACAC CGACGGCGGA AAACATTCGC AACGGCAAAT ATGCGTACAT CGTCGATGCA TTTATGGTGA CGAGAGAAAA TACAACGTCA GAAACACAAA AACTGGTCGA ATGGTTTTTA ACGCCGCAGG GGCAGAGTCT GGTAGAAGAT GTGGGATATG TGCCGCTGTA TCTAACAATG GAATAA
|
Protein sequence | MLGVGLTGII EVCNILITPT IYLLLNVFML TLGAIIIFFS GRVWAGDSAP ENREIAVWRQ CFFLLPALLT LVGWIITLHL ADYQFRQMGA GWLANLMLPW LGVFLVSLVG GEYWWMVIIP VGAHISFSLG YAWPTRYPLS GTSGLRCRNL LLFLLLLLGI VAGYQAHLYK QQNPGVGVRE NIDIRAWRPD KLNNRLTPLR GKPQIQFRQN WPRIDGATAA YPIYASAFYA LSVIPEDFHV WEYLENSRTP DAYNRIVKGD ADIIFVAQPS GGQKKRAEES GVTLLYTPFA REAFVFIVNA DNPVNSLTEQ QVRDIFSGAI TNWRTVGGND QEIQTWQRPE DSGSQTVMQS QVMKKVRMIS PQETEVASVM EGMIKVVAEY RNTNNAIGYT FRYYATQMNA DKNIRLLAIN GITPTAENIR NGKYAYIVDA FMVTRENTTS ETQKLVEWFL TPQGQSLVED VGYVPLYLTM E
|
| |