Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3214 |
Symbol | |
ID | 6971663 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2961641 |
End bp | 2963587 |
Gene Length | 1947 bp |
Protein Length | 648 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 643387033 |
Product | hypothetical protein |
Protein accession | YP_002271500 |
Protein GI | 209398441 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 38 |
Fosmid unclonability p-value | 0.0279968 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACATTTA AACATTATGA TGTTGTCAGG GCGGCGTCGC CGTCAGACCT TGCTGATGCA CTTGCGCAAA AAATTCGTGA AGGATGGCAA CCATACGGTG GGCCGTTTTC TTCGTATACG GATGATGGCG CAGCACTTAT TCAGGCGATT GTCGCAGAAG GTGATGTGAG CACACCTGTT GTGGTGAAGC TGACAGGTGG AGAAGGTGCA GTAATCAGCG CCACCAGAGA CCCGGAGTAT TACTTTATTG TGGTTCTGGC GGGGCAGTCA AACAGCATGG CATATGGTGA AGGCCTTCCG CTGCCGGAGA CATATGACCG TCCGGACCCG CGTATTAAGC AGCTGGCGCG CCGCAGTACG GTGACACCGG GCGGTGTCGC CTGTAAATAT AACGACATCA TTCCGGCGGA CCATTGTCTG CATGATGTGC AGGACATGAG CCGCCTTAAC CATCCGAAAG CGGACCTGTC AAAGGGGCAG TACGGAACCG TGGGGCAGGG GCTGCATATC GCCAAAAAAT TGCTGCCGTT TATACCGGCG AATGCGGGCA TTCTGCTGGT TCCGTGCTGT CGTGGTGGTT CAGCGTTCAC CACCGGAGCT GATGGCACAT ACAGTGACGC GAGTGGTGCT TCGGAGAATT CAACCCGCTG GGGTGTGGAC AAGCCGCTGT ATAAGGACCT TATCGGTCGA ACAAAAGCAG CACTGAAGAA GAACCCGAAA AATGTGCTGT TTGCCGTGGT GTGGATGCAG GGGGAATTTG ATTTTGGCGG TACGCCGGCA AATCACGCAG CACAGTTTGG TGCGCTGGTT GATAAATTCC GTGCAGACCT GGCGGATATG GCAGGTCAGT GCGTCGGTGG CTCTGCTGAC GGTGTTCCCT GGATATGCGG GGACACGACG TATTTCTGGA AGCAGAAGAA CGAAGCCACC TACCAGACGG TGTACGGCAG CTACAAAAAC AAAACGGAAA AGAATATCCA TTTCGTACCG TTCATGACCG ATGAGAACGG GGTGAATGTG CCGACGAACA AACCGGAAGA AGACCCGGAC ATTCCGGGTA TCGGATATTA CGGTTCGAAA TGGCGTGACA GCTCAGCCAC CTGGACGTCA CAGGACAGGG CGAGCCATTT CAGTTCATGG GCTCGCCGCG GGATTATTTC CGACCGTCTG GCAACGGCGA TTTTGCGCCA TGCGGGAAGA GTGGCGCTAA ACGCGGGGGC ATCATCGACA GTATCAGAGG TGCGCCCGTC ATCGCCTTCC GGTGCAGAAG CCACAGGCGT CACAACACTG CTCTCTTACC TTGCCAGCGA GTCAGAGGGA AGCCTGAAAG TACAGGGATG GTCAGCCAGT GGCGGCAGGG CAGAAGTGGT CAGCGATGCG GAGGGAACCG GAGGTAAGGC AGTGAAGCTG ACCAAGGAAG CCGGTAAAAG CAGCTGGGTG CTGGAGTACG CCGCGGGCAA CGGTGCGGCT CTGTTACAGA AAGGGGGGCA GATTCGCTGC CGCTTTAAGG TTTCGGGAGC GCTGGCTGCG AACCAGTATG TTATGGCGTT TTACTGGCCG GTATCTTCAC TGCCACAGGG CGTTGCCCTG ACCGGAGACG GGGGGAATAA CCTGCTGGCA GCGTTCTACA TCCAGACAGA TGCAAAAGAC CTGAATGTGA TGTACCACAA TGCGAAAGTG GCGACAAACA ACCTGAAACT GGGAACCTTT GGCGCATTTG ATAACGAATG GCATACGCTG GCTTTCCGCT TTGCCGGGAA TAACAGCCTG CAGGTGACGC CGGTTATTGA TGGTCAGGAT GGCACACCGT TCACGCTGAC GCAGTCACCG GTCAGTGCCT TTGCGGCGGA TAAACTGCAT GTGACAGACA TTACCAGAGG TGCGACTTAC CCGGTACTGA TAGACAGCAT TGCGGTGGAA GTGAACAGCA CAGACACTGC GGCATGA
|
Protein sequence | MTFKHYDVVR AASPSDLADA LAQKIREGWQ PYGGPFSSYT DDGAALIQAI VAEGDVSTPV VVKLTGGEGA VISATRDPEY YFIVVLAGQS NSMAYGEGLP LPETYDRPDP RIKQLARRST VTPGGVACKY NDIIPADHCL HDVQDMSRLN HPKADLSKGQ YGTVGQGLHI AKKLLPFIPA NAGILLVPCC RGGSAFTTGA DGTYSDASGA SENSTRWGVD KPLYKDLIGR TKAALKKNPK NVLFAVVWMQ GEFDFGGTPA NHAAQFGALV DKFRADLADM AGQCVGGSAD GVPWICGDTT YFWKQKNEAT YQTVYGSYKN KTEKNIHFVP FMTDENGVNV PTNKPEEDPD IPGIGYYGSK WRDSSATWTS QDRASHFSSW ARRGIISDRL ATAILRHAGR VALNAGASST VSEVRPSSPS GAEATGVTTL LSYLASESEG SLKVQGWSAS GGRAEVVSDA EGTGGKAVKL TKEAGKSSWV LEYAAGNGAA LLQKGGQIRC RFKVSGALAA NQYVMAFYWP VSSLPQGVAL TGDGGNNLLA AFYIQTDAKD LNVMYHNAKV ATNNLKLGTF GAFDNEWHTL AFRFAGNNSL QVTPVIDGQD GTPFTLTQSP VSAFAADKLH VTDITRGATY PVLIDSIAVE VNSTDTAA
|
| |