Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0970 |
Symbol | |
ID | 6969905 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 985391 |
End bp | 986656 |
Gene Length | 1266 bp |
Protein Length | 421 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643384990 |
Product | hypothetical protein |
Protein accession | YP_002269490 |
Protein GI | 209397491 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 14 |
Plasmid unclonability p-value | 0.348574 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 61 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCTTCTA CTTTTACCAG CGACACATTG CCTGCCGATC ACAAAGCAGC TATCCGTCAG ATGAAGCACG CGCTGCGGGC GCAGCTTGGC GACGTCCAGC AGATCTTTAA TCAGCTAAGC GATGACATTG CCACGCGAGT GGCTGAAATC AACGCACTCA AAGCACAGGG CGATGCCGTC TGGCCGGTGC TGTCTTATGC CGATATCAAA GCAGGTCATG TTACTGCAGA GCAGCGCGAA CAGATTAAAC GTCGCGGTTG TGCGGTGATA AAAGGCCATT TCCCCCGCGA ACAAGCGCTA GGCTGGGATC AGTCGATGCT GGACTATCTG GACCGCAACC GCTTTGACGA GGTCTACAAA GGCCCCGGCG ATAATTTCTT CGGGACGCTC AGCGCTTCAC GTCCCGAGAT TTACCCCATC TACTGGTCGC AGGCGCAAAT GCAGGCCCGC CAGAGTGAAG AAATGGCGAA TGCGCAGTCG TTTCTCAATC GTCTGTGGAC ATTTGAAAGT GATGGAAAGC AATGGTTTAA CCCGGATGTG AGCGTCATCT ACCCTGACCG TATCCGCCGC CGTCCGCCCG GAACGACCTC CAAAGGTCTT GGAGCGCATA CCGACTCCGG GGCGCTGGAA CGCTGGCTGC TTCCAGCGTA TCAGCACGTT TTCGCTAACG TCTTTAATGG CAATCTGGCG AAATACGATC CCTGGCATGC GGCACATCGT ACGGAAGTTG AAGAGTACAC GGTGGACAAC ACCACCAAAT GTTCCGTGTT TCGGACATTC CAGGGCTGGA CAGCGCTCTC TGATATGCTG CCTAGTCAGG GGTTGCTGCA CGTCGTGCCC ATTCCTGAAG CCATGGCGTA CGTACTGTTA CGTCCGCTGC TTGATGATGT GCCGGAGGAT GAACTGTGCG GCGTAGCGCC CGGAAGAGTG TTGCCGGTAT CAGAGCAATG GCATCCACTG TTAATTGAGG CGTTAACCAG CATTCCAAAA CTCGAGGCCG GAGACTCCGT CTGGTGGCAC TGCGACGTCA TCCATTCCGT TGCCCCCGTT GAAAATCAAC AGGGTTGGGG CAACGTGATG TACATTCCTG CGGCACCGAT GTGCGAGAAA AATCTTGCCT ACGCGCACAA GGTGAAGGCC GCACTGGAAA AAGGCGCATC GCCGGGCGAC TTCCCGCGCG AGGACTATGA AACAAACTGG GAAGGACGCT TTACGCTTGC CGACCTCAAC ATTCACGGTA AGCGAGCGTT GGGCATGGAT GTTTGA
|
Protein sequence | MASTFTSDTL PADHKAAIRQ MKHALRAQLG DVQQIFNQLS DDIATRVAEI NALKAQGDAV WPVLSYADIK AGHVTAEQRE QIKRRGCAVI KGHFPREQAL GWDQSMLDYL DRNRFDEVYK GPGDNFFGTL SASRPEIYPI YWSQAQMQAR QSEEMANAQS FLNRLWTFES DGKQWFNPDV SVIYPDRIRR RPPGTTSKGL GAHTDSGALE RWLLPAYQHV FANVFNGNLA KYDPWHAAHR TEVEEYTVDN TTKCSVFRTF QGWTALSDML PSQGLLHVVP IPEAMAYVLL RPLLDDVPED ELCGVAPGRV LPVSEQWHPL LIEALTSIPK LEAGDSVWWH CDVIHSVAPV ENQQGWGNVM YIPAAPMCEK NLAYAHKVKA ALEKGASPGD FPREDYETNW EGRFTLADLN IHGKRALGMD V
|
| |