Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_0241 |
Symbol | |
ID | 6970626 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | - |
Start bp | 256511 |
End bp | 257791 |
Gene Length | 1281 bp |
Protein Length | 426 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 643384312 |
Product | hypothetical protein |
Protein accession | YP_002268828 |
Protein GI | 209400996 |
COG category | [T] Signal transduction mechanisms |
COG ID | [COG3456] Uncharacterized conserved protein, contains FHA domain |
TIGRFAM ID | [TIGR03354] type VI secretion system FHA domain protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 54 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGAAG AGAAACTACA GACGCTTTCA TTGCAGGTCA TTAACGGCAG TGAGCTGGAA AGCGGGCGGG CGGCGCGCTG TCTGTTCACA CAGCAGGGAA ATGTCGGCCA TGCCCCCGAA TGCCACTGGT CGGTACAGGA TCGTCAGCAG AGCATTCCGG CCCAGGCTTT TACCGTTATC CTGCACGATG GCACATTTTG TCTACGCCCG CAGACGGCAC AACTGTGGCT GAATCAGGCA AAAGTCACAG CAACATCAGA CCTGATACAG TTGCGCCAGG GCGATGAGAT CCAGATCGGA CGGCTGATGG TGAGGGTTCA TCTGAACCGG GGAGATATTC CCCATTACGA TGAGGAAATG GCCACTCCCG AAACCATCGT TACCAATCGC GATATGCTCA CGGATACCCT GCTATCAACG GAGGGTGCGC CACACTATCC GGGAATGACT CACCGGCACC AGCTTGCAGA CACCGTGGTA AATGGTTTTT CTGCCGATCC ACTCCAGGCA CTTCAGTCCG AAAGCCTGAT TACCACGGGC GATCCGCTTT CAGGCATTGC GGCTGTCCGG CCATCGGCAC CGCTGTCCGA TCCGGCAAGT AATGGGGGGA TCAATACTCC GTTTATGGAT CTGCCGCCCA TTTATGCCAG CCCTGGCGAT CATAATGATG ACATCTCTGC GGCAGAAATG GCGCAACGCC ACCTTGCGGT CACCCCCTTA CTGCGCGGTC TTGGCGGCTC GCTTACCGTG AGCAATTCCG ACGATGCGGA TGATTTTCTG GAGGAGGCCG GACGAACGTT ACAGGCCGCA ATAAAAGGTC TGCTCGATTT GCAGCAGCAG CGTAACAGCC TCTCAGACAA ACATTTGCGC CCGCTGGAAG ATAACCCGCT GCGCCTGAAC ATGGATTACG CCACCGCGCT CGACGTGATG TTTGCCGAAG GTAAAAGCCC GGTACATCTG GCGGCTCCCG CCGCCGTCAG TGAAAGCCTG CGCAATGTCC GCCACCACGA AGAAGCTAAC CGGGCGGCGA TTGTGGAGTC GCTTCGTGTC CTGCTGGATG CTTTCTCACC ACAAAATCTG CTGCGCCGCT TTGTGCAGTA CCGCCGCAGC CATGAACTGC GCCAGCCGCT GGATGATGCC GGAGCATGGC AAATGTACAG CCATTATTAC GAAGAACTGG CCTCCGATCG CCAGCAGGGG TTTGAGATGC TGTTTAACGA GGTCTACGCC CAGGTCTATG ACCGGGTGCT TCGTGAAAAA CAGCGGGAGC CGGAAGCATG A
|
Protein sequence | MPEEKLQTLS LQVINGSELE SGRAARCLFT QQGNVGHAPE CHWSVQDRQQ SIPAQAFTVI LHDGTFCLRP QTAQLWLNQA KVTATSDLIQ LRQGDEIQIG RLMVRVHLNR GDIPHYDEEM ATPETIVTNR DMLTDTLLST EGAPHYPGMT HRHQLADTVV NGFSADPLQA LQSESLITTG DPLSGIAAVR PSAPLSDPAS NGGINTPFMD LPPIYASPGD HNDDISAAEM AQRHLAVTPL LRGLGGSLTV SNSDDADDFL EEAGRTLQAA IKGLLDLQQQ RNSLSDKHLR PLEDNPLRLN MDYATALDVM FAEGKSPVHL AAPAAVSESL RNVRHHEEAN RAAIVESLRV LLDAFSPQNL LRRFVQYRRS HELRQPLDDA GAWQMYSHYY EELASDRQQG FEMLFNEVYA QVYDRVLREK QREPEA
|
| |