Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2586 |
Symbol | |
ID | 6145326 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2640784 |
End bp | 2641683 |
Gene Length | 900 bp |
Protein Length | 299 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 641617457 |
Product | dyp-type peroxidase family protein |
Protein accession | YP_001744622 |
Protein GI | 170680598 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTCTCAGG TTCAGAGTGG CATTTTGCCA GAACATTGCC GCGCGGCGAT TTGGATCGAA GCCAACGTGA AAGGGGAAGT TGACGCCCTG CGTGCGGCCA GTAAAACATT TGCCGACAAA CTGGCAACTT TTGAAGCGAA ATTCCCGGAC GCGCATCTTG GTGCGGTGGT TGCCTTTGGT AACAATACCT GGCGCGCTCT GAGCGGCGGC GTTGGGGCCG AAGAGCTGAA AGATTTTCCG GGCTACGGTA AAGGTCTTGC ACCGACGACC CAGTTCGATG TGTTGATCCA CATTCTTTCT CTGCGTCACG ACGTGAACTT CTCTGTCGCT CAGGCGGCGA TGGAAGCCTT TGGTGACTGC ATTGAAGTGA AAGAAGAGAT CCACGGCTTC CGTTGGGTTG AAGAGCGTGA CCTGAGCGGC TTTGTTGACG GTACAGAAAA CCCGGCGGGT GAAGAGACGC GCCGCGAAGT GGCTGTTATC AAAGACGGCG TGGATGCGGG CGGCAGCTAT GTGTTTGTGC AGCGTTGGGA GCACAACCTG AAACAGCTCA ATCGGATGAG CGTTCACGAT CAAGAGATGA TGATCGGGCG CACCAAAGAG GCCAACGAAG AGATTGACGG CGACGAACGT CCGGAAACCT CTCACCTGAC CCGCGTTGAT CTGAAAGAAG ATGGCAAAGG GCTGAAGATT GTTCGTCAGA GCCTGCCGTA CGGCACCGCC AGTGGCACTC ACGGTCTGTA CTTCTGCGCC TACTGCGCGC GTCTGCATAA CATTGAGCAG CAACTGCTGA GCATGTTTGG CGATACCGAT GGTAAGCGTG ATGCGATGTT GCGTTTCACC AAACCGGTAA CCGGCGGCTA TTACTTTGCG CCGTCGCTGG ACAAGTTGAT GGCGTTGTAA
|
Protein sequence | MSQVQSGILP EHCRAAIWIE ANVKGEVDAL RAASKTFADK LATFEAKFPD AHLGAVVAFG NNTWRALSGG VGAEELKDFP GYGKGLAPTT QFDVLIHILS LRHDVNFSVA QAAMEAFGDC IEVKEEIHGF RWVEERDLSG FVDGTENPAG EETRREVAVI KDGVDAGGSY VFVQRWEHNL KQLNRMSVHD QEMMIGRTKE ANEEIDGDER PETSHLTRVD LKEDGKGLKI VRQSLPYGTA SGTHGLYFCA YCARLHNIEQ QLLSMFGDTD GKRDAMLRFT KPVTGGYYFA PSLDKLMAL
|
| |