Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_4168 |
Symbol | |
ID | 6146761 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 4267516 |
End bp | 4268709 |
Gene Length | 1194 bp |
Protein Length | 397 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 641618991 |
Product | putative uroporphyrinogen III C-methyltransferase |
Protein accession | YP_001746119 |
Protein GI | 170682020 |
COG category | [H] Coenzyme transport and metabolism |
COG ID | [COG2959] Uncharacterized enzyme of heme biosynthesis |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 0.524316 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 43 |
Fosmid unclonability p-value | 0.397242 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGGAAC AAGAAAAAAC CTCCGCCGTG GTTGAAGAGA CCAGGGAGGC CGTGGACACC ACGTCACAAC CTGTCGCAAC AGAAAAAAAG AGTAAGAACA ATACCGCATT GATTCTCAGC GCGGTGGCTA TCGCTATTGC TCTGGCGGCG GGCGTCGGTT TGTATGGCTG GGGTAAACAA CAGGCCGTCA ATCAGACTGC CACCAGCGAT GCCCTGGCTA ACCAACTTAC TGCACTGCAA AAAGCCCAGG AGAGCCAAAA AGCCGAGCTG GAAGGCATTA TCAAGCAACA AGCTGTACAG CTTGAGCAGG CGAATCGTCA GCAAGAAACG CTGGCAAAAC AGCTGGATGA AGTCCAACAA AAGGTCGCCA CCATTTCCGG CAGCGATGCT AAAACCTGGC TGCTGGCTCA GGCTGATTTC CTGGTGAAAC TCGCCGGACG GAAGCTGTGG AGCGATCAGG ACGTCACGAC CGCTGCAGCG TTGCTGAAAA GTGCAGACGC CAGCCTGGCG GATATGAATG ACCCGAGTCT GATTACCGTT CGCCGGGCAA TTACCGATGA TATCGCCAGC CTTTCTGCGG TATCGCAGGT GGATTATGAC GGCATTATCC TTAAGCTTAA TCAGCTTTCA AATCAGGTAG ATAACCTGCG TCTTGCTGAT AATGATAGCG ATGGTTCGCC GATGGATTCC GACGGTGAAG AGCTTTCCAG TTCCATCAGC GAATGGCGTA TCAATCTGCA AAAAAGCTGG CAGAACTTTA TGGACAACTT CATTACGATT CGCCGTCGTG ATGACACCGC CGTACCGCTG TTAGCGCCAA ATCAGGATAT CTATCTGCGC GAAAATATTC GCTCTCGCCT GCTGGTCGCA GCACAAGCTG TACCGCGTCA CCAGGAAGAG ACTTATCGCC AGGCGCTGGA GAACGTCTCC ACCTGGGTAC GTGCTTACTA CGATACTGAT GATGCCACCA CCAAAGCGTT CCTCGACGAG GTGGACCAGT TAAGCCAGCA AAATATCTCG ATGGATCTTC CGGAAACCCT GCAAAGCCAG GCGATGCTGG AAAAATTGAT GCAGACCCGC GTGCGTAACC TGCTGGCACA ACCGGCAGCA GGGACAACGG AAGCTAAACC TGCACCTGCA CCTGCACCTG CACCGCAAGC TGATACTCCG GCAGCCGCGC CGCAAGGAGA ATAA
|
Protein sequence | MTEQEKTSAV VEETREAVDT TSQPVATEKK SKNNTALILS AVAIAIALAA GVGLYGWGKQ QAVNQTATSD ALANQLTALQ KAQESQKAEL EGIIKQQAVQ LEQANRQQET LAKQLDEVQQ KVATISGSDA KTWLLAQADF LVKLAGRKLW SDQDVTTAAA LLKSADASLA DMNDPSLITV RRAITDDIAS LSAVSQVDYD GIILKLNQLS NQVDNLRLAD NDSDGSPMDS DGEELSSSIS EWRINLQKSW QNFMDNFITI RRRDDTAVPL LAPNQDIYLR ENIRSRLLVA AQAVPRHQEE TYRQALENVS TWVRAYYDTD DATTKAFLDE VDQLSQQNIS MDLPETLQSQ AMLEKLMQTR VRNLLAQPAA GTTEAKPAPA PAPAPQADTP AAAPQGE
|
| |