Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_3843 |
Symbol | |
ID | 6968842 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 3566614 |
End bp | 3567837 |
Gene Length | 1224 bp |
Protein Length | 407 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643387626 |
Product | hypothetical protein |
Protein accession | YP_002272075 |
Protein GI | 209397410 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG2199] FOG: GGDEF domain |
TIGRFAM ID | [TIGR00254] diguanylate cyclase (GGDEF) domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 9 |
Plasmid unclonability p-value | 0.0156897 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 62 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATAACG ATAATTCTCT TAATAAGCGC CCCACGTTTA AAAGAGCATT ACGCAACATC AGTATGACCA GCATATTTAT CACTATGATG CTGATCTGGT TGCTGCTTTC CGTGACCTCG GTGCTGCCCC TGAAACAGTA CGCGCAAAAA AACCTGGCAC TGACAGCAGC AACAATGACT TACAGTCTGG AAGCAGCTGT CGTTTTTGCC GATGGCCCTG CAGCAACTGA AACACTGGCA GCGCTGGGCC AGCAAGGGCA ATTTTCAACT GCAGAAGTAC GTGATAAGCA GCAAAATATT CTGGCATCCT GGCATTACAC CCGTAAGGAT CCAGGCGATA CTTTCAGCAA TTTCATAAGC CACTGGCTCT TCCCTGCCCC CATCATTCAG CCGATTCGTC ACAATGGTGA AACCATTGGC GAAGTACGCT TAACCGCTCG CGACAGTTCA ATCAGCCATT TTATCTGGTT TTCGCTCGCC GTACTGACCG GTTGTATTCT GCTGGCATCA GGAATCGCAA TTACCCTCAC CCGCCATTTG CACAATGGCC TGGTGGAAGC ACTGAAAAAT ATCACCGATG TCGTACATGA TGTGCGTTCC AACCGCAATT TTTCCCGACG AGTTTCGGAA GAACGTATCG CTGAGTTTCA CCGCTTCGCT CTCGACTTCA ACAGTCTGCT GGATGAAATG GAAGAGTGGC AGCTTCGTTT ACAGGCCAAA AATGCGCAGC TTCTACGTAC CGCGCTACAT GACCCATTAA CCGGGCTGGC TAACCGCGCA GCGTTTCGTA GCGGCATCAA TACGTTGATG AACAATTCCG CTGCCCGAAA AACGTCGGCG TTACTATTTC TTGATGGCGA TAATTTCAAA TATATCAATG ATACCTGGGG TCATGCGACG GGCGATAGAG TCTTGATTGA AATCGCAAAA CGGTTAGCTG AATTTGGCGG GCTGCGACAT AAAGCATACC GCCTGGGCGG CGATGAATTC GCTATGGTGC TCTATGGTGT ACAGTCGGAA TCTGAAGTGC AGCAGATATG CTCAGCACTG ACACAAATCT TTAATCTCCC GTTTGATCTT CATAATGGCC ATCAGACCAC CATGACATTA AGCATTGGTT ACGCAATGAC CATTGCGCAC GCTTCTGCGG AAAAATTACA AGAGCTGGCC GATCACAATA TGTATCAGGC CAAACACCAG CGTGCCGAAA AGCTGGTGAG ATAA
|
Protein sequence | MDNDNSLNKR PTFKRALRNI SMTSIFITMM LIWLLLSVTS VLPLKQYAQK NLALTAATMT YSLEAAVVFA DGPAATETLA ALGQQGQFST AEVRDKQQNI LASWHYTRKD PGDTFSNFIS HWLFPAPIIQ PIRHNGETIG EVRLTARDSS ISHFIWFSLA VLTGCILLAS GIAITLTRHL HNGLVEALKN ITDVVHDVRS NRNFSRRVSE ERIAEFHRFA LDFNSLLDEM EEWQLRLQAK NAQLLRTALH DPLTGLANRA AFRSGINTLM NNSAARKTSA LLFLDGDNFK YINDTWGHAT GDRVLIEIAK RLAEFGGLRH KAYRLGGDEF AMVLYGVQSE SEVQQICSAL TQIFNLPFDL HNGHQTTMTL SIGYAMTIAH ASAEKLQELA DHNMYQAKHQ RAEKLVR
|
| |