Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_1434 |
Symbol | |
ID | 6968986 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 1419534 |
End bp | 1420586 |
Gene Length | 1053 bp |
Protein Length | 350 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 643385407 |
Product | hypothetical protein |
Protein accession | YP_002269901 |
Protein GI | 209397534 |
COG category | [R] General function prediction only |
COG ID | [COG1054] Predicted sulfurtransferase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.782992 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 47 |
Fosmid unclonability p-value | 0.247689 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCAGTGT TACACAACCG CATTTCCAAC GACGCGTTAA AAGCCAAAAT GTTGGCTGAG AGCGAACCGC GAACCACCAT TTCGTTTTAC AAGTATTTCC ACATCGCCGA TCCTAAGGCG ACCCGTGACG CTTTATATCA GCTGTTTACC GCGCTGAATG TTTTTGGGCG AGTGTATCTG GCGCATGAGG GCATTAACGC GCAAATCAGC GTACCTGCGA GCAATGTTAA AACATTTCGC GCGCAGCTCT ATGCCTTCGA CTCGGCTTTA GATGGCTTAC GCCTGAATAT CGCGTTGGAT GATGACGGGA AATCCTTCTG GGTACTGCGC ATGAAGGTCC GCGATCGTAT CGTTGCCGAC GGTATTGACG ATCCTCACTT TGATGCCAGC AATGTTGGTG AGTATCTGCA AGCGGCGGAA GTGAACGCCA TGCTTGACGA TCCCGATGCA CTGTTTATCG ACATGCGTAA CCACTATGAG TATGAAGTGG GGCACTTTGA AAACGCGCTG GAAATTCCGG CAGATACCTT CCGTGAGCAG CTGCCAAAAG CAGTCGAGAT GATGCAGGCA CATAAAGATA AAAAAATCGT CATGTACTGC ACCGGCGGCA TTCGTTGTGA AAAGGCCAGT GCCTGGATGA AACATAACGG ATTCAATAAA GTCTGGCATA TCGAGGGTGG AATTATTGAA TACGCCCGTA AGGCGCGCGA GCAGGGCTTG CCGGTGCGTT TTATTGGCAA AAATTTTGTT TTTGACGAGC GGATGGGCGA ACGTATATCT GATGAGATTA TCGCGCATTG CCACCAGTGC GGTGCGCCGT GCGACAGCCA TACCAACTGT AAAAATGATG GCTGCCACCT GCTGTTTATT CAGTGTCCAG TATGTGCGGA AAAATACAAA GGTTGTTGTA GTGAAATTTG CTGCGAAGAA AGCGCGTTAC CGCCAGAAGA ACAGCGACGC CGTCGGGCAG GACGTGAAAA TGGCAATAAG ATCTTTAATA AGTCTCGTGG ACGTCTGAAT ACAACACTGG GCATTCCTGA TCCAACAGAA TAA
|
Protein sequence | MPVLHNRISN DALKAKMLAE SEPRTTISFY KYFHIADPKA TRDALYQLFT ALNVFGRVYL AHEGINAQIS VPASNVKTFR AQLYAFDSAL DGLRLNIALD DDGKSFWVLR MKVRDRIVAD GIDDPHFDAS NVGEYLQAAE VNAMLDDPDA LFIDMRNHYE YEVGHFENAL EIPADTFREQ LPKAVEMMQA HKDKKIVMYC TGGIRCEKAS AWMKHNGFNK VWHIEGGIIE YARKAREQGL PVRFIGKNFV FDERMGERIS DEIIAHCHQC GAPCDSHTNC KNDGCHLLFI QCPVCAEKYK GCCSEICCEE SALPPEEQRR RRAGRENGNK IFNKSRGRLN TTLGIPDPTE
|
| |