Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1852 |
Symbol | |
ID | 6971286 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1754235 |
End bp | 1756181 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643385788 |
Product | hypothetical protein |
Protein accession | YP_002270277 |
Protein GI | 209399249 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0338393 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 0.0000000502034 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCTGATGCA CTTGCGCAAA AAATTCGTGA AGGATGGCAA CCATATGGTG GGCCGTTTTC TTCGTATACG GATGATGGCG CAGCACTTAT TCAGGCGATT GTCGCAGAAG GTGATGTGAG CACACCTGTT GTGGTGAAGC CGACAGGTGG AGAAGGTGCA GTAATCAGCG CCACCAGCGA CCCCGGGTAT TACTTTGTTG TGGTTCTGGC AGGGCAGTCA AACGGCATGT CGTATGGTGA AGGTCTTCCG CTGCCGGAGA CATATGACCG TCCGGACCCG CGCATTAAGC AGCTGGCGCG TCGCAGTACG GTGACACCGG GCGGTGTCGC CTGTAAATAT AACGACATCA TTCCGGCGGA CCATTGTCTG CATGATGTGC AGGACATGAG CCGCCTTAAC CATCCGAAAG CGGACCTGTC AAAGGGGCAG TACGGAACCG TGGGGCAGGG GCTGCATATC GCCAAAAAAT TGCTGCCGTT TATACCGGCG AATGCGGGCA TTCTGCTGGT TCCGTGCTGT CGTGGTGGTT CAGCGTTCAC CACCGGAGCT GATGGCACAT ACAGTGACGC GAGTGGTGCC TCGGAGAATT CAACCCGCTG GGGTGTGGAC AAGCCGCTGT ATAAGGACCT TATCGGTCGA ACAAAAGCAG CACTGAAGAA GAATCCGAAA AATGTGCTGT TTGCCGTGGT GTGGATGCAG GGGGAATTTG ATTTTGGCGG TACGCCGGCA AATCACGCAG CACAGTTTGG TGCGCTGGTT GATAAATTCC GTGCAGACCT GGCGGATATG GCAGGTCAGT GCGTCGGTGG CTCTGCTGGC GGTGTTCCCT GGATATGTGG AGATACGACG TATTTCTGGA AGCAGAAGAA CGAATCCACG TACCAGACGG TGTACGGCAG CTATAAAAAC AAAACGGAAA AGAATATCCA TTTCGTACCG TTCATGACCG ATGAGAACGG GGTGAATGTG CCGACGAACA AACCGGAAGA AGACCCGGAC ATTCCGGGTA TCGGATATTA CGGTTCGAAA TGGCGTGACA GCTCAGCCAC CTGGACGTCA CAGGACAGGG CGAGCCATTT CAGTTCATGG GCTCGCCGCG GGATTATTTC CGACCGTCTG GCAACGGCGA TTTTGCGCCA TGCGGGAAGA GTGGCGCTAA ACGCGGGGGC ATCATCGACA GTATCAGAGG TGCGCCCGTC ATCGCCTTCC GGTGCAGAAG CCACAGGCGT CACAACACTG CTCTCTTACC TTGCCAGCGA GTCAGAGGGA AGCCTGAAAG TACAGGGATG GTCAGCCAGT GGCGGCAGGG CAGAAGTGGT CAGCGATGCG GAGGGAACCG GAGGTAAGGC AGTGAAGCTG ACCAAGGAAG CCGGTAAAAG CAGCTGGGTG CTGGAGTACG CCGCGGGCAA CGGTGCGGCT CTGTTACAGA AAGGGGGGCA GATTCGCTGC CGCTTTAAGG TTTCGGGAGC GCTGGCTGCG AACCAGTATG TTATGGCGTT TTACTGGCCG GTATCTTCAC TGCCACAGGG CGTTGCCCTG ACCGGAGACG GGGGGAATAA CCTGCTGGCA GCGTTCTACA TCCAGACAGA TGCAAAAGAC CTGAATGTGA TGTACCACAA TGCGAAAGTG GCGACAAACA ACCTGAAACT GGGAACCTTT GGCGCATTTG ATAACGAATG GCATACGCTG GCTTTCCGCT TTGCCGGGAA TAACAGCCTG CAGGTGACGC CGGTTATTGA TGGTCAGGAT GGCACACCGT TCACGCTGAC GCAGTCACCG GTCAGTGCCT TTGCGGCGGA TAAACTGCAT GTGACAGACA TTACCAGAGG TGCGACTTAC CCGGTACTGA TAGACAGCAT TGCGGTGGAA GTGAACAGCA CAGACACTGC GGCATGA
|
Protein sequence | MTFKHYDVVR AASPSDLADA LAQKIREGWQ PYGGPFSSYT DDGAALIQAI VAEGDVSTPV VVKPTGGEGA VISATSDPGY YFVVVLAGQS NGMSYGEGLP LPETYDRPDP RIKQLARRST VTPGGVACKY NDIIPADHCL HDVQDMSRLN HPKADLSKGQ YGTVGQGLHI AKKLLPFIPA NAGILLVPCC RGGSAFTTGA DGTYSDASGA SENSTRWGVD KPLYKDLIGR TKAALKKNPK NVLFAVVWMQ GEFDFGGTPA NHAAQFGALV DKFRADLADM AGQCVGGSAG GVPWICGDTT YFWKQKNEST YQTVYGSYKN KTEKNIHFVP FMTDENGVNV PTNKPEEDPD IPGIGYYGSK WRDSSATWTS QDRASHFSSW ARRGIISDRL ATAILRHAGR VALNAGASST VSEVRPSSPS GAEATGVTTL LSYLASESEG SLKVQGWSAS GGRAEVVSDA EGTGGKAVKL TKEAGKSSWV LEYAAGNGAA LLQKGGQIRC RFKVSGALAA NQYVMAFYWP VSSLPQGVAL TGDGGNNLLA AFYIQTDAKD LNVMYHNAKV ATNNLKLGTF GAFDNEWHTL AFRFAGNNSL QVTPVIDGQD GTPFTLTQSP VSAFAADKLH VTDITRGATY PVLIDSIAVE VNSTDTAA
|
| |