Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_5603 |
Symbol | |
ID | 6967248 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 5241654 |
End bp | 5243924 |
Gene Length | 2271 bp |
Protein Length | 756 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 643389239 |
Product | hypothetical protein |
Protein accession | YP_002273636 |
Protein GI | 209400726 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG4191] Signal transduction histidine kinase regulating C4-dicarboxylate transport system |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 55 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCGCACC CCGGACGTCA CTTTTTTGCC AGCGCGCGCG GGCGATTGCT GCTTTTGAAT TTGCTGGTGG TGGCGGTGAC GTTGATGGTC AGCGGCGTGG CGGTAATGGG CTTTCGTCAT GCCAGCCAGA TGCAGGAGCT GGTGCAGCAG CAAACGGTGG ATGATATGAC CGGCAGCCTG AATCTGGCGC GCGACACGGC AAACGTGGCA ACGGCGGCGG TGCGATTGTC GCAGGTGGTG GGAGCGCTGG AATATAAAGG CGAAGCCGAG CGGTTGCAGG AGACACAGCG GGCATTAAAA AGCTCGCTTG CACAACTGGC GAACGCGCCT TTAGCGCAGC AGGAAGCCGG GCTGGTGACG CGAATCATCA CGCGTAGCAA CGAACTGCAA ACCAGCGTCG GCGGTATGCT GGAGCGCGGG CAAAGGCGGC ATCTGGAGCG CAACGCGCTG TTAAGTTCGC TGTATCAAAA CTTAAGTTAT CTGCGGCATC TGCAAAAAGT CACTCACGCG CAAGATGATA TTTTGCTTAA CGAGATGAAC CGACTGATTG TTGCCGCTAT TGCCACCCCC GCGCCGCAGG CGATTATTCA TCAACTGGTC GGGGTAATGT CTGCGTTGCC GACCCACAGC GATACGCCAT TGGTTAATAC GTTACTCAAT GATTTCAATG ATTTCAATGA TGAAATGCGT AAACTCGCGC CGCTTTCTGC CGCTCTGGAG CAAAGCGATC TGGCAATCAG CTGGTATATG TTTCACATCA AAGCGCTGGT GGCGATCCTC AATGGCGACA TCAACCGTTA CGCCGAACAG GTGGCGACGC TCTCCGGTCA GCGTGTGGCG CAAAGTTACC AGGATTTGCG CTCCGGCGAG AGGGTGATCA TGATTTTCGC CCTGTTAGCG GTGGTGATCA CCGCGTTTGC CGGCTGGTAT ATCTATCGCA ACCTCGGCAC TAACCTGACG GCGATTTCCC GCGCCATGAG CCGCCTGGCG CACGGCGAAG CGGACGTCAC TTTTCCGGCG TTACAGCGGC GTGATGAGTT GGGTGAACTG GCGCGCGCCT TTAACGTTTT TGCCCGCAAT ACCGCGTCAC TCGAACGAAC CTCACGGCTG CTAAAAGAGA AGACCACGCA GATGGAGATT GCCCAAACTG AGCGGCAGGG GTTGGAAGAG GCGCTATTAC ACAGCCAGAA ACTGAAAGCT GTCGGGCAAC TGACGGGCGG GCTGGCCCAT GATTTTAATA ATCTGCTGGC GGTGATTATT GGCAGCCTGG AGCTGGTGAA TCCGGACTCG CCAGACGCGC CGCGAATACA GCGGGCGCTG CAAGCTGCCG AACGTGGGGC GCTACTGACA CAACGACTGC TGGCGTTTTC CCGCAAGCAG TCCTTACATC CGCAAGCCGT TGAACTGAAA ACATTGCTGG AAGAGTTAAG CGAATTAATG CGCCACTCCT TACCGGCAAC GCTGGCGCTG GAGATTGAAG CGCAGTCTCC GGCGTGGCCC GCGTGGATTG ACATCAGCCA ACTGGAAAAT GCCATCATCA ACCTGGTGAT GAACGCCCGT GACGCGATGG AAGGGCAGAC GGGAATCATC AAAATTCGTA CCTGGAATCA GCGCGTTACT CGCAGCAGTG GGCAGCGTCA GGACATGGTG GCACTGGAGG TCATCGATCA TGGCTGCGGT ATGTCGCAGG CGGTGAAAAG CCAGGTATTT GAGCCATTCT TTACCACCAA ACAGACGGGC AGCGGCAGCG GGCTGGGTTT GTCGATGGTC TATGGTTTTG TGCGCCAGTC CGGTGGGCGC GTTGAAATTG AAAGCGCGCC GGGGCAGGGG ACAACGGTCA GATTACAGCT TCCCCGCGCG CTGACGGCGG CGAAAAACCT ATCGCCAGCG GCAGTAGAAC AGGCTTCTGT CAGCGGCGAT AAACTGGTGT TAGTGCTGGA AGATGAAGCC GCGGTGCGCC AGACCATCTG CGAACAGTTG CACCTGCTGG GCTATTTGAC GCTGGAAGCA AGCAGCGGCG AACAGGCGCT GGATCTGCTG GCGGCATCAG CAGAAATCGA TATTTTTATC AGCGATTTAA TGCTCCCCGG CGGCATGAGC GGCGCAGAAG TGGTCAATGC GGCCCGCAAA CTTTACCCGC ACCTGACGCT GCTACTCATC AGCGGTCAGG ATTTACGCCC CAGCCATAAC CCCGCGCTGC CAGACGTGGC GCTGTTGCGC AAACCCTTTA CCCGTGCGCA GTTGGCGCAG GCGCTGGGGC AAGAGAATTG A
|
Protein sequence | MAHPGRHFFA SARGRLLLLN LLVVAVTLMV SGVAVMGFRH ASQMQELVQQ QTVDDMTGSL NLARDTANVA TAAVRLSQVV GALEYKGEAE RLQETQRALK SSLAQLANAP LAQQEAGLVT RIITRSNELQ TSVGGMLERG QRRHLERNAL LSSLYQNLSY LRHLQKVTHA QDDILLNEMN RLIVAAIATP APQAIIHQLV GVMSALPTHS DTPLVNTLLN DFNDFNDEMR KLAPLSAALE QSDLAISWYM FHIKALVAIL NGDINRYAEQ VATLSGQRVA QSYQDLRSGE RVIMIFALLA VVITAFAGWY IYRNLGTNLT AISRAMSRLA HGEADVTFPA LQRRDELGEL ARAFNVFARN TASLERTSRL LKEKTTQMEI AQTERQGLEE ALLHSQKLKA VGQLTGGLAH DFNNLLAVII GSLELVNPDS PDAPRIQRAL QAAERGALLT QRLLAFSRKQ SLHPQAVELK TLLEELSELM RHSLPATLAL EIEAQSPAWP AWIDISQLEN AIINLVMNAR DAMEGQTGII KIRTWNQRVT RSSGQRQDMV ALEVIDHGCG MSQAVKSQVF EPFFTTKQTG SGSGLGLSMV YGFVRQSGGR VEIESAPGQG TTVRLQLPRA LTAAKNLSPA AVEQASVSGD KLVLVLEDEA AVRQTICEQL HLLGYLTLEA SSGEQALDLL AASAEIDIFI SDLMLPGGMS GAEVVNAARK LYPHLTLLLI SGQDLRPSHN PALPDVALLR KPFTRAQLAQ ALGQEN
|
| |