Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4713 |
Symbol | |
ID | 6971666 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4352659 |
End bp | 4354980 |
Gene Length | 2322 bp |
Protein Length | 773 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643388414 |
Product | putative transcriptional accessory protein |
Protein accession | YP_002272842 |
Protein GI | 209398543 |
COG category | [K] Transcription |
COG ID | [COG2183] Transcriptional accessory protein |
TIGRFAM ID | [TIGR00426] competence protein ComEA helix-hairpin-helix repeat region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 68 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCTTTTAT TGCACGTTAT CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG GGATACCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAGTCCAT TTCCGAGCAA GGCAAACTCA CCGATGATCT GGCGAATGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC GCCGCTGCGC AATATATTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAC GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCTACGG TGGTGAGCGG TAAAGAAGAG GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTTCAGCT TTCGCTGAAT GCCGATCCAC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCACGGAT CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTGGTGAGC TGGACGTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG CACCGTGCGC GAACGTGCGG AAAATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG GCTGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACCGGGGTA AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTAGCGA CCGATACCAT TTACCCGCAC ACCGGACAAG CCGCAAAAGC AGCGATGACT GTTGCTGCGC TGTGTGAAAA GCATAACGTT GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CCGAACGTTT CTATCTCGAC GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG TCGGTTTACT CGGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG CGTGGCGCGG TTTCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC CGCAAACTGG ATGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACC GCTTCTGTTC CGCTATTAAC TCGCGTGGCG GGCCTGACGC GAATGATGGC GCAAAACATC GTGGCCTGGC GCGATGAGAA CGGCCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTCAGC CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCC TGCGCATTAA CCACGGTGAC AACCCGCTGG ACGCCTCTAC CGTTCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGCGC TGTCACCAAC GTCACCAACT TTGGCGCGTT TGTCGATATC GGCGTGCATC AGGACGGCCT GGTTCACATC TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACTATGCGT CTGGACGAGC AGCCTGGCGA AACCAACTCT CGTCGCGGCG GCGGTAACGA TCGCCCGCAA AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAACAGCGCG ATGATGGACG CGCTGGCGGC AGCAATGGGC AAAAAACGTT AA
|
Protein sequence | MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL GYLRELEERR QAILKSISEQ GKLTDDLANA INATLSKTEL EDLYLPYKPK RRTRGQIAIE AGLEPLADLL WSDPSHTPEV AAAQYIDADK GVADTKAALD GARYILMERF AEDAALLAKV RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN ADPQFDEPPK ESYCEQIITD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR ERAENEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR LDEQPGETNS RRGGGNDRPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR
|
| |