Gene ECH74115_4713 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagECH74115_4713 
Symbol 
ID6971666 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli O157:H7 str. EC4115 
KingdomBacteria 
Replicon accessionNC_011353 
Strand
Start bp4352659 
End bp4354980 
Gene Length2322 bp 
Protein Length773 aa 
Translation table11 
GC content57% 
IMG OID643388414 
Productputative transcriptional accessory protein 
Protein accessionYP_002272842 
Protein GI209398543 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones68 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGAATG ATTCGTTCTG CCGCATTATT GCGGGTGAAA TTCAGGCGCG CCCGGAACAG 
GTTGACGCTG CCGTTCGCCT GCTTGACGAA GGGAATACCG TGCCTTTTAT TGCACGTTAT
CGTAAGGAAA TCACCGGCGG TCTGGATGAC ACGCAGCTGC GTAATCTGGA AACGCGTCTG
GGATACCTGC GCGAGCTGGA AGAGAGACGT CAGGCGATCC TCAAGTCCAT TTCCGAGCAA
GGCAAACTCA CCGATGATCT GGCGAATGCC ATCAACGCCA CCCTAAGCAA AACCGAACTC
GAAGACCTCT ACCTGCCCTA CAAACCTAAA CGCCGCACCC GCGGGCAAAT CGCCATTGAA
GCAGGGCTTG AGCCGTTGGC TGACCTGCTG TGGAGCGATC CGTCACACAC GCCAGAAGTC
GCCGCTGCGC AATATATTGA TGCCGATAAA GGCGTGGCAG ATACCAAAGC CGCGCTGGAC
GGCGCGCGCT ATATCCTGAT GGAACGGTTT GCCGAAGATG CCGCGCTGCT GGCGAAAGTG
CGTGATTATC TGTGGAAGAA CGCGCATTTG GTTTCTACGG TGGTGAGCGG TAAAGAAGAG
GAAGGGGCGA AATTCCGCGA CTATTTCGAT CATCACGAAC CGTTGTCCAC GGTGCCTTCT
CACCGCGCGC TGGCGATGTT CCGTGGGCGT AACGAAGGCG TACTTCAGCT TTCGCTGAAT
GCCGATCCAC AGTTCGATGA GCCGCCCAAA GAGAGCTATT GCGAGCAAAT CATCACGGAT
CACCTTGGCC TGCGCCTGAA CAATGCCCCG GCGGATAGCT GGCGCAAAGG CGTGGTGAGC
TGGACGTGGC GCATCAAGGT GCTGATGCAT CTGGAAACCG AACTGATGGG CACCGTGCGC
GAACGTGCGG AAAATGAAGC AATCAACGTC TTTGCCCGTA ACCTGCACGA TCTGCTGATG
GCTGCCCCTG CCGGACTGCG TGCAACGATG GGCCTCGATC CGGGTCTGCG TACCGGGGTA
AAAGTGGCGG TGGTCGATGC CACTGGCAAA CTGGTAGCGA CCGATACCAT TTACCCGCAC
ACCGGACAAG CCGCAAAAGC AGCGATGACT GTTGCTGCGC TGTGTGAAAA GCATAACGTT
GAACTGGTGG CGATCGGCAA CGGTACAGCT TCCCGCGAAA CCGAACGTTT CTATCTCGAC
GTGCAGAAGC AGTTCCCGAA AGTGACCGCA CAGAAAGTGA TCGTCAGCGA AGCTGGCGCG
TCGGTTTACT CGGCTTCCGA GCTGGCAGCG CAGGAGTTCC CGGATCTCGA CGTTTCGCTG
CGTGGCGCGG TTTCTATCGC CCGCCGTTTG CAGGATCCGC TGGCGGAGCT GGTGAAAATC
GATCCGAAAT CTATCGGCGT AGGTCAGTAT CAGCATGACG TCAGCCAGAC GCAACTGGCC
CGCAAACTGG ATGCAGTAGT AGAAGACTGC GTAAACGCCG TTGGCGTCGA TCTCAACACC
GCTTCTGTTC CGCTATTAAC TCGCGTGGCG GGCCTGACGC GAATGATGGC GCAAAACATC
GTGGCCTGGC GCGATGAGAA CGGCCAGTTC CAGAACCGTC AGCAACTGCT GAAAGTCAGC
CGTCTGGGGC CGAAAGCCTT CGAGCAGTGC GCGGGCTTCC TGCGCATTAA CCACGGTGAC
AACCCGCTGG ACGCCTCTAC CGTTCACCCG GAAGCCTATC CGGTGGTGGA ACGCATTCTG
GCAGCAACAC AGCAGGCACT GAAAGATCTG ATGGGTAACA GCAGCGAACT GCGTAACCTG
AAAGCGTCTG ACTTTACTGA TGAAAAATTC GGTGTGCCGA CAGTAACTGA CATCATCAAA
GAGCTGGAAA AACCGGGTCG CGATCCGCGT CCGGAATTTA AAACCGCTCA GTTTGCCGAT
GGCGTCGAGA CAATGAACGA CCTGCAACCG GGTATGATCC TCGAAGGCGC TGTCACCAAC
GTCACCAACT TTGGCGCGTT TGTCGATATC GGCGTGCATC AGGACGGCCT GGTTCACATC
TCTTCATTGT CGAACAAGTT TGTGGAAGAT CCGCATACCG TGGTGAAAGC GGGCGACATT
GTGAAGGTGA AAGTGCTGGA AGTGGATCTT CAGCGTAAAC GTATCGCCCT GACTATGCGT
CTGGACGAGC AGCCTGGCGA AACCAACTCT CGTCGCGGCG GCGGTAACGA TCGCCCGCAA
AACAACCGCC CGGCAGCCAA ACCACGCGGT CGTGAAGCGC AGCCTGCCGG TAACAGCGCG
ATGATGGACG CGCTGGCGGC AGCAATGGGC AAAAAACGTT AA
 
Protein sequence
MMNDSFCRII AGEIQARPEQ VDAAVRLLDE GNTVPFIARY RKEITGGLDD TQLRNLETRL 
GYLRELEERR QAILKSISEQ GKLTDDLANA INATLSKTEL EDLYLPYKPK RRTRGQIAIE
AGLEPLADLL WSDPSHTPEV AAAQYIDADK GVADTKAALD GARYILMERF AEDAALLAKV
RDYLWKNAHL VSTVVSGKEE EGAKFRDYFD HHEPLSTVPS HRALAMFRGR NEGVLQLSLN
ADPQFDEPPK ESYCEQIITD HLGLRLNNAP ADSWRKGVVS WTWRIKVLMH LETELMGTVR
ERAENEAINV FARNLHDLLM AAPAGLRATM GLDPGLRTGV KVAVVDATGK LVATDTIYPH
TGQAAKAAMT VAALCEKHNV ELVAIGNGTA SRETERFYLD VQKQFPKVTA QKVIVSEAGA
SVYSASELAA QEFPDLDVSL RGAVSIARRL QDPLAELVKI DPKSIGVGQY QHDVSQTQLA
RKLDAVVEDC VNAVGVDLNT ASVPLLTRVA GLTRMMAQNI VAWRDENGQF QNRQQLLKVS
RLGPKAFEQC AGFLRINHGD NPLDASTVHP EAYPVVERIL AATQQALKDL MGNSSELRNL
KASDFTDEKF GVPTVTDIIK ELEKPGRDPR PEFKTAQFAD GVETMNDLQP GMILEGAVTN
VTNFGAFVDI GVHQDGLVHI SSLSNKFVED PHTVVKAGDI VKVKVLEVDL QRKRIALTMR
LDEQPGETNS RRGGGNDRPQ NNRPAAKPRG REAQPAGNSA MMDALAAAMG KKR