Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_2904 |
Symbol | |
ID | 6971326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 2693401 |
End bp | 2695338 |
Gene Length | 1938 bp |
Protein Length | 645 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643386748 |
Product | hypothetical protein |
Protein accession | YP_002271219 |
Protein GI | 209398353 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.148641 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 17 |
Fosmid unclonability p-value | 0.0000000129832 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACATTTA AACACTACGA TGTGGTCAGG GCGGCATCGC CGTCAGACCT TGCTGAACGA CTGACACAAA AACTGAAGGA GGGCTGGCAG CCATTTGGCA GTCCGGTGGC CATCACGCCT TACACCCTGA TGCAGGCCAT TGCGGCGGAA GGTGATGTCA CCACACCAGT GGCGGTGACC GGTAATGAGG GTAAGGCGGT GGCTGTCAGT GCCACCAGAG CCCCGGAGTA TTACTTTGTT GTGGTTCTGG CAGGGCAGTC AAACGGCATG TCGTATGGTG AAGGTCTTCC GCTGCCGGAG ACATATGACC GTCCGGAGCC GCGTATTAAG CAACTGGCGC GTCGCAGTAC GGTGACACCG GGTGGTGCAG CATGCAGATA TAACGACATC ATTCCGGCGG ACCATTGTCT GCATGATGTG CAGGACATGA GCCGCCTTAA CCATCCGAAA GCGGACCTGT CAAAGGGGCA GTACGGAACC GTGGGGCAGG GGCTGCATAT CGCCAAAAAA TTGCTGCCGT TTATACCGGC GAATGCGGGC ATTCTGCTGG TTCCGTGCTG TCGTGGTGGT TCAGCGTTCA CCACCGGAGC TGATGGCACA TACAGTGACG CGAGTGGTGC TTCGGAGAAT TCAACCCGCT GGGGTGTGGA CAAGCCGCTG TATAAGGACC TTATCGGTCG AACAAAAGCA GCACTGAAGA AGAACCCGAA AAATGTGCTG TTTGCCGTGG TGTGGATGCA GGGGGAATTT GATTTTGGCG GTACGCCGGC AAATCACGCA GCACAGTTTG GTGCGCTGGT TGATAAATTC CGTGCAGACC TGGCGGATAT GGCAGGTCAG TGCGTCGGTG GCTCTGCTGA CGGTGTTCCC TGGATATGCG GGGACACGAC GTATTTCTGG AAGCAGAAGA ACGAAGCCAC CTACCAGACG GTGTACGGCA GCTACAAAAA CAAAACGGAA AAAAATATCC ATTTCGTACC GTTCATGACG GATGAGAACG GGGTGAATGT GCCGACGAAC AAACCGGAAG AAGACCCGGA CATTCCGGGT ATCGGATATT ACGGTTCGAA ATGGCGTGAC AGCTCAGCCA CCTGGACGTC ACAGGACAGG GCGAGCCATT TCAGCGCCTG GGCACGCCGT GGGATTATTT CCGACCGTCT GGCAACGGCG ATTTTGCGCC ATGCGGGAAG AGTGGCGCTA AACGCGGGGG CATCATCGAC AGTATCAGAG GTGCGCCCGT CATCGCCTTC CGGTGCAGAA GCCACAGGCA TCACAACACT GCTCTCTTAC CTTGCCAGCG AGTCAGAGGG AAGCCTGAAA GTACAGGGAT GGTCAGCCAG TGGCGGCAGG GCAGAAGTGG TCAGCGATGC GGAGGGAACC GGAGGTAAGG CAGTGAAGCT GACCAAGGAA GCCGGTAAAA GCAGCTGGGT GCTGGAGTAC GCCGCGGGCA ACGGTGCGGC TCTGTTACAG AAAGGGGGGC AGATTCGCTG CCGCTTTAAG GTTTCGGGAG CGCTGGCTGC GAACCAGTAT GTTATGGCGT TTTACTGGCC GGTATCTTCA CTGCCACAGG GCGTTGCCCT GACCGGAGAC GGGGGGAATA ACCTGCTGGC AGCGTTCTAC ATCCAGACAG ATGCAAAAGA CCTGAATGTG ATGTACCACA ATGCGAAAGT GGCGACAAAC AACCTGAAAC TGGGAAGCTT TGGCGCATTT GATAACGAAT GGCATGCGCT GGCTTTCCGC TTTGCCGGGA ATAACAGCCT TCAGGTGACG CCGGTTATTG ATGGTCAGGA TGGTACACCG TTCACGCTGA CGCAGTCACC GGTCAGTGCC TTTGCGGCGG ATAAACTGCA TGTGACAGAC ATTACCAGGA ATGCGACTTA CCCGGTGCTG ATTGACAGCA TTGCGGTGGA AGTGAACAGC ACAGACACTG CGGCATGA
|
Protein sequence | MTFKHYDVVR AASPSDLAER LTQKLKEGWQ PFGSPVAITP YTLMQAIAAE GDVTTPVAVT GNEGKAVAVS ATRAPEYYFV VVLAGQSNGM SYGEGLPLPE TYDRPEPRIK QLARRSTVTP GGAACRYNDI IPADHCLHDV QDMSRLNHPK ADLSKGQYGT VGQGLHIAKK LLPFIPANAG ILLVPCCRGG SAFTTGADGT YSDASGASEN STRWGVDKPL YKDLIGRTKA ALKKNPKNVL FAVVWMQGEF DFGGTPANHA AQFGALVDKF RADLADMAGQ CVGGSADGVP WICGDTTYFW KQKNEATYQT VYGSYKNKTE KNIHFVPFMT DENGVNVPTN KPEEDPDIPG IGYYGSKWRD SSATWTSQDR ASHFSAWARR GIISDRLATA ILRHAGRVAL NAGASSTVSE VRPSSPSGAE ATGITTLLSY LASESEGSLK VQGWSASGGR AEVVSDAEGT GGKAVKLTKE AGKSSWVLEY AAGNGAALLQ KGGQIRCRFK VSGALAANQY VMAFYWPVSS LPQGVALTGD GGNNLLAAFY IQTDAKDLNV MYHNAKVATN NLKLGSFGAF DNEWHALAFR FAGNNSLQVT PVIDGQDGTP FTLTQSPVSA FAADKLHVTD ITRNATYPVL IDSIAVEVNS TDTAA
|
| |