Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2104 |
Symbol | |
ID | 6142769 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2112481 |
End bp | 2113752 |
Gene Length | 1272 bp |
Protein Length | 423 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641616980 |
Product | Tat-translocated enzyme |
Protein accession | YP_001744155 |
Protein GI | 170680570 |
COG category | [P] Inorganic ion transport and metabolism |
COG ID | [COG2837] Predicted iron-dependent peroxidase |
TIGRFAM ID | [TIGR01412] Tat-translocated enzyme [TIGR01413] Dyp-type peroxidase family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 52 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAGTATG AAGATGAAAA CGGCGTGAAT GAACCGTCAC GCCGACGTTT ACTGAAAGGG ATTGGTGCGT TGGCGCTGGC GGGAAGTTGT CCGGTCGCTC ATGCACAAAA AACGCAAAGT GCACCGGGCA CGCTTTCACC GGATGCGCGC AGTGAAGCGC AGCCGTTTTA TGGCGAACAT CAGGCGGGTA TTCTGACGCC GCAGCAGGCG GCAATGATGC TGGTGGCGTT TGATGTGCTT GCCAGCGATA AAACCGATCT TGAACGGTTG TTTCGCTTGT TGACTCAGCG TTTTGCTTTT CTGACTCAGG GCGGAGCGGC ACCAGAAACG CCAAATCCGC GCCTGCCACC ACTCGATTCC GGCATTCTTG GCGGCTACAT TGCGTCCGAT AATCTCACCA TAACGTTATC GGTGGGCCAT TCATTGTTTG ATGAGCGGTT TGGCCTTACA CCGCAGATGC CGAAAAAGTT GCAGAAAATG ACGCGTTTCC CCAACGACTC GCTGGATGCG GCGTTATGTC ATGGTGATGT GTTGCTACAA ATTTGCGCCA ACACCCAGGA CACGGTGATC CATGCGCTGC GCGATATCAT CAAACACACG CCGGATTTGC TCAGCGTGCG CTGGAAGCGG GAAGGGTTTA TTTCCGATCA CGCAGCGCGT AGTAAAGGCA AAGAGACGCC GATAAATTTG CTGGGTTTTA AAGACGGCAC AGCCAATCCC GATAGCCAGA ATGCGAAGTT GATGCAAAAA GTGGTGTGGG TGACGGCAGA TCAGCAGGAG CCTGCGTGGA CAATCGGTGG CAGCTATCAG GCGGTGCGGT TGATTCAGTT TCGGGTGGAG TTTTGGGACA GAACGCCACT GAAAGAACAG CAGACGATTT TTGGTCGCGA CAAACAAACT GGTGCGCCGC TGGGCATGCA GCACGAGCAT GATGTTCCCG ATTACGCCAG CGATCCGGAA GGGAAGGTGA TCGCGCTGGA CAGCCATATT CGGCTGGCGA ATCCCCGCAC GCCAGAGAGT GAGTCAAGTC TGATGCTGCG TCGTGGCTAC AGTTATTCAC TGGGCGTCAC CAACTCCGGG CAACTCGATA TGGGGCTACT GTTTGTCTGC TACCAACACG ATCTGGAAAA GGGCTTCCTG ACAGTACAAA AAAGGTTGAA TGGCGAGGCG CTGGAGGAAT ATGTCAAACC TATCGGTGGC GGCTATTTCT TTGCGCTGCC GGGTGTGAAG GACGCGAACG ATTATCTCGG AAGCGCATTA TTGCGGGTTT AA
|
Protein sequence | MQYEDENGVN EPSRRRLLKG IGALALAGSC PVAHAQKTQS APGTLSPDAR SEAQPFYGEH QAGILTPQQA AMMLVAFDVL ASDKTDLERL FRLLTQRFAF LTQGGAAPET PNPRLPPLDS GILGGYIASD NLTITLSVGH SLFDERFGLT PQMPKKLQKM TRFPNDSLDA ALCHGDVLLQ ICANTQDTVI HALRDIIKHT PDLLSVRWKR EGFISDHAAR SKGKETPINL LGFKDGTANP DSQNAKLMQK VVWVTADQQE PAWTIGGSYQ AVRLIQFRVE FWDRTPLKEQ QTIFGRDKQT GAPLGMQHEH DVPDYASDPE GKVIALDSHI RLANPRTPES ESSLMLRRGY SYSLGVTNSG QLDMGLLFVC YQHDLEKGFL TVQKRLNGEA LEEYVKPIGG GYFFALPGVK DANDYLGSAL LRV
|
| |