Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_3821 |
Symbol | |
ID | 6144046 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 3888427 |
End bp | 3889824 |
Gene Length | 1398 bp |
Protein Length | 465 aa |
Translation table | 11 |
GC content | 48% |
IMG OID | 641618647 |
Product | di-haem cytochrome c peroxidase family protein |
Protein accession | YP_001745787 |
Protein GI | 170680372 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG1858] Cytochrome c peroxidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 36 |
Fosmid unclonability p-value | 0.057161 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATGG TCTCACGTAT TACCGCGATC GGCCTGGCTG GCGTCGCGAT TTGCTATTTA GGGTTATCTG GTTATGTGTG GTACCACGAT AATAAACGCA GTAAGCAGGC CGATGTTCAG GCATCTGCTG TCAGTGAAAA TAATCAGGTT TTAGGTTTTC TTCGCGAAAA AGGATGCGAC TATTGCCATA CGCCTTCGGC AGAATTACCC GCCTATTATT ATATTCCTGG CGCGAAACAG TTGATGGATT ACGACATTAA GCTTGGATAT AAATCTTTTA ACCTCGAGGC CGTGCGTGCG GCTCTGCTGG CTGATAAACC CGTTTCGCAA AGCGATCTGA ATAAGATTGA ATGGGTAATG CAGTATGAAA CCATGCCACC AACGCGTTAT ACCGCGCTAC ACTGGGCGGG TAAGGTGAGT GATGAAGAGC GGGCGGAAAT ACTCGCCTGG ATTGCAAAAC AACGCGCAGA ATATTACGCC AGTAATGATA CTGCTCCGGA GCATCGCAAT GAACCGGTGC AGCCCATCCC GCAAAAACTG CCTACCGATG CGCAAAAAGT GGCGTTGGGC TTTGCGCTGT ATCACGATCC CCGTTTATCG GCTGATAGCA CCATTTCATG CGCTCATTGC CATGCGTTGA ATGCGGGGGG CGTCGATGGC AGAAAAACAT CGATTGGTGT TGGTGGCGCA GTTGGGCCGA TTAACGCGCC GACGGTATTT AACTCAGTAT TTAACGTTGA GCAGTTCTGG GATGGTCGTG CGGCAACATT GCAGGATCAG GCGGGTGGAC CGCCGTTGAA CCCGATTGAA ATGGCGTCGA AATCCTGGGA CGAAATTATT GCTAAGCTGG AAAAAGATCC GCAGCTTAAA GCGCAGTTCC TCGAAGTCTA TCCGCAAGGT TTCAGCGGTG AAAATATTAC TGATGCCATT GCTGAATTTG AAAAAACATT AATTACGCCG GATTCCCCAT TTGATAAATG GTTGCGTGGA GATGAGAATG CGCTGACGGC ACAACAGAAA AAAGGCTATC AATTATTTAA AGATAATAAA TGTGCAACTT GTCATGGTGG TATTATTCTC GGCGGACGTT CCTTCGAACC GTTGGGGCTG AAAAAAGACT TTAACTTTGG TGAAATTACG GCGGCGGATA TTGGTCGTAT GAATGTGACT AAAGAAGAGC GCGATAAATT GCGTCAGAAA GTACCCGGTT TACGTAACGT TGCACTAACG GCACCGTACT TCCATCGCGG TGATGTGCCG ACGCTGGACG GGGCGGTGAA ACTGATGTTG CGCTATCAGG TAGGCAAAGA GCTGCCGCAG GAGGATGTGG ATGATATCGT AGCTTTCCTG CACAGTCTGA ACGGGGTTTA CACGCCGTAT ATGCAGGATA AACAATAA
|
Protein sequence | MKMVSRITAI GLAGVAICYL GLSGYVWYHD NKRSKQADVQ ASAVSENNQV LGFLREKGCD YCHTPSAELP AYYYIPGAKQ LMDYDIKLGY KSFNLEAVRA ALLADKPVSQ SDLNKIEWVM QYETMPPTRY TALHWAGKVS DEERAEILAW IAKQRAEYYA SNDTAPEHRN EPVQPIPQKL PTDAQKVALG FALYHDPRLS ADSTISCAHC HALNAGGVDG RKTSIGVGGA VGPINAPTVF NSVFNVEQFW DGRAATLQDQ AGGPPLNPIE MASKSWDEII AKLEKDPQLK AQFLEVYPQG FSGENITDAI AEFEKTLITP DSPFDKWLRG DENALTAQQK KGYQLFKDNK CATCHGGIIL GGRSFEPLGL KKDFNFGEIT AADIGRMNVT KEERDKLRQK VPGLRNVALT APYFHRGDVP TLDGAVKLML RYQVGKELPQ EDVDDIVAFL HSLNGVYTPY MQDKQ
|
| |