Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | EcSMS35_2919 |
Symbol | mazG |
ID | 6144592 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Escherichia coli SMS-3-5 |
Kingdom | Bacteria |
Replicon accession | NC_010498 |
Strand | - |
Start bp | 2993145 |
End bp | 2993936 |
Gene Length | 792 bp |
Protein Length | 263 aa |
Translation table | 11 |
GC content | 51% |
IMG OID | 641617788 |
Product | nucleoside triphosphate pyrophosphohydrolase |
Protein accession | YP_001744943 |
Protein GI | 170682107 |
COG category | [R] General function prediction only |
COG ID | [COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain |
TIGRFAM ID | [TIGR00444] MazG family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.00184519 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 48 |
Fosmid unclonability p-value | 0.79883 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATCAAA TCGACCGTTT GCTCACTATT ATGCAGCGCC TGCGCGATCC GGAAAACGGC TGCCCGTGGG ATAAAGAGCA GACATTTGCC ACCATTGCGC CTTACACCCT TGAAGAAACC TACGAGGTGC TGGACGCCAT CGCCCGTGAA GATTTTGACG ATCTGCGCGG TGAACTGGGC GATCTGCTGT TCCAGGTGGT GTTTTACGCA CAAATGGCTC AGGAAGAAGG GCGCTTTGAC TTTAATGATA TTTGCGCTGC TATTAGCGAT AAATTAGAGC GTCGCCATCC GCATGTTTTT GCTGACAGTT CTGCCGAAAA CAGTAGTGAA GTGCTCGCCC GTTGGGAGCA AATCAAAACC GAAGAACGCG CGCAGAAAGC ACAGCATTCG GCGCTGGATG ATATTCCTCG TAGTTTACCG GCTTTAATGC GTGCGCAAAA AATCCAGAAA CGTTGCGCCA ACGTTGGCTT CGACTGGACG ACGCTTGGTC CGGTAGTCGA TAAAGTCTAC GAAGAGATCG ACGAGGTGAT GTACGAGGCG CAGCAGGCTG TTGTCGACCA GGCTAAACTG GAGGAGGAAA TGGGTGACCT GCTGTTTGCC ACGGTCAATC TGGCTCGTCA TTTAGGGACA AAAGCGGAAA TCGCATTGCA AAAAGCGAAC GAAAAATTCG AGCGTCGTTT TCGCGAAGTG GAGCGTATTG TTGCCGCGCG TGGACTGGAA ATGACAGGTG TTGACCTCGA AACAATGGAA GAAGTCTGGC AACAGGTAAA ACGGCAGGAA ATTGATCTCT AA
|
Protein sequence | MNQIDRLLTI MQRLRDPENG CPWDKEQTFA TIAPYTLEET YEVLDAIARE DFDDLRGELG DLLFQVVFYA QMAQEEGRFD FNDICAAISD KLERRHPHVF ADSSAENSSE VLARWEQIKT EERAQKAQHS ALDDIPRSLP ALMRAQKIQK RCANVGFDWT TLGPVVDKVY EEIDEVMYEA QQAVVDQAKL EEEMGDLLFA TVNLARHLGT KAEIALQKAN EKFERRFREV ERIVAARGLE MTGVDLETME EVWQQVKRQE IDL
|
| |