Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | ECH74115_4479 |
Symbol | |
ID | 6968491 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli O157:H7 str. EC4115 |
Kingdom | Bacteria |
Replicon accession | NC_011353 |
Strand | + |
Start bp | 4147713 |
End bp | 4148720 |
Gene Length | 1008 bp |
Protein Length | 335 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 643388194 |
Product | hypothetical protein |
Protein accession | YP_002272631 |
Protein GI | 209398705 |
COG category | [C] Energy production and conversion |
COG ID | [COG2141] Coenzyme F420-dependent N5,N10-methylene tetrahydromethanopterin reductase and related flavin-dependent oxidoreductases |
TIGRFAM ID | [TIGR03558] luciferase family oxidoreductase, group 1 |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 73 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTGATA AAACCATTGC GTTTTCGCTA CTCGATCTGG CCCCCATTCC CGAAGGTTCT TCAGCGCGAG ATGCATTCTC CCACTCTCTC GATCTCGCCC GTCTGGCTGA AAAGCGCGGC TATCATCGCT ACTGGCTGGC AGAACACCAC AATATGACTG GCATTGCCAG TGCTGCCACG TCGGTATTGA TTGGCTATCT GGCGGCGAAT ACCACCACGC TGCATCTGGG GTCTGGCGGC GTGATGTTGC CTAACCACTC ACCGTTGGTC ATTGCAGAAC AGTTCGGCAC GCTCAATACA CTCTATCCGG GGCGAATCGA TTTGGGGCTG GGTCGTGCTC CGGGTAGTGA CCAGCGAACC ATGATGGCGC TACGTCGTCA TATGAGCGGG GATATTGATA ATTTCCCCCG CGATGTCGCG GAGCTGGTGG CCTGGTTTGA CGCCCGCGAT CCCAATCCGC ATGTGCGCCC GGTACCAGGC TACGGCGAGC AAATCCCCGT GTGGTTGTTA GGCTCCAGCC TTTACAGCGC GCAACTGGCG GCGCAGCTTG GTCTGCCGTT TGCGTTTGCC TCACACTTCG CGCCGGATAT GTTGTTCCAG GCGCTGCATC TTTATCGCAG CAACTTCAAA CCGTCGGCAC GACTGGAAAA ACCATACGCG ATGGTGTGCA TCAATATTAT CGCTGCCGAC AGCAACCGCG ACGCTGAATT TCTGTTTACC TCAATGCAGC AAGCCTTTGT GAAGCTGCGC CGTGGCGAAA CCGGGCAACT GCCGCCGCCG ATTCAAAATA TGGATCAGTT CTGGTCACCG TCTGAGCAGT ATGGCGTGCA ACAGGCGCTG AGTATGTCGC TGGTGGGCGA TAAAGCGAAA GTGCGTCATG GTTTGCAGTC GATCCTGCGC GAAACCGACG CCGATGAGAT TATGGTTAAC GGGCAGATTT TCGACCACCA GGCGCGCCTG CATTCGTTTG AGCTGGCGAT GGATGTTAAG GAAGAGTTGT TGGGATAG
|
Protein sequence | MTDKTIAFSL LDLAPIPEGS SARDAFSHSL DLARLAEKRG YHRYWLAEHH NMTGIASAAT SVLIGYLAAN TTTLHLGSGG VMLPNHSPLV IAEQFGTLNT LYPGRIDLGL GRAPGSDQRT MMALRRHMSG DIDNFPRDVA ELVAWFDARD PNPHVRPVPG YGEQIPVWLL GSSLYSAQLA AQLGLPFAFA SHFAPDMLFQ ALHLYRSNFK PSARLEKPYA MVCINIIAAD SNRDAEFLFT SMQQAFVKLR RGETGQLPPP IQNMDQFWSP SEQYGVQQAL SMSLVGDKAK VRHGLQSILR ETDADEIMVN GQIFDHQARL HSFELAMDVK EELLG
|
| |