Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | P9303_29381 |
Symbol | mazG |
ID | 4776641 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Prochlorococcus marinus str. MIT 9303 |
Kingdom | Bacteria |
Replicon accession | NC_008820 |
Strand | + |
Start bp | 2598423 |
End bp | 2599421 |
Gene Length | 999 bp |
Protein Length | 332 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 640088462 |
Product | nucleoside triphosphate pyrophosphohydrolase |
Protein accession | YP_001018933 |
Protein GI | 124024626 |
COG category | [R] General function prediction only |
COG ID | [COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain |
TIGRFAM ID | [TIGR00444] MazG family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | n/a |
Plasmid unclonability p-value | n/a |
Plasmid hitchhiking | n/a |
Plasmid clonability | n/a |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCAGCT GCGGCTTAAT CCCCAACCAA TCCTGTCACT GCTGCTCGGG CTGGAAGCCA GCGTTTGGCT ACACCTCATT CTCGATGGCG ATCCCCTCCC CGCCGAATGG CATCGCTGGC GTCATCGATG AGAAGGTAGG CCAATACAAA CCAATGGCCA TGGACGCCGA ACAGCATCTG GCTCCCACAG AAGCCATCGC GGAGCTGGTC AACATCGTGG CCCAGCTCAG GGATCCAAAG GGAGGTTGCC CCTGGGATCT GGAGCAGACC CATACCTCCT TGATTCCATG CATGTTGGAA GAAGCCCACG AAGTGGCCGA CGCCATCCGC AACGGCGATG ACAACCACCT CAGCGAAGAA CTAGGGGACC TTCTGCTGCA GGTCGTGCTA CATGCTCAGA TCGCTAACGA AGAAGGACGC TTCAATCTTG AAGACATCGC CCGAAGCATC AGCGCAAAGC TGATTCGCCG ACACCCACAT GTGTTCGCAG AGGCAGTCGC AATCGACAGC GAAGCAGTTC GGCAAAGTTG GGAATCGATC AAAGCGAGCG AGCAACCCAG CTCAGCCTCT AAAAGTCCGC TAAGCGATCG TCTACGTAGC AAGGTCAGAG GTCAGCCAGC TCTGGCTGGA GCGATGGCCA TCTCCAAAAA GGTCGCGAAC GTTGGCTTCG AGTGGAACAC CATCGATGGA GTATGGGGAA AAGTGCAAGA AGAGTTCGAG GAACTCAAAG AGGCGGTAGA GCATGAAGAC CAAGCCCATG CACAAACAGA ACTTGGTGAT GTGCTGTTCA CTCTTGTGAA TGTTGCTCGC TGGTGCGGCC TAAACCCAGA AGAAGGCCTT GCAGGTACAA ACCAACGCTT TCTTGATCGC TTTTCTCGCG TTGAAGCAGC ACTCGAGGGC CAGCTAAGCG GCCAATCACT GACGGAACTA GAACAACTTT GGCAAGAAGC CAAAGCAGCA ATTCGAGAAG AGGCTGACGA CAAAAAGATA TCGAATTAA
|
Protein sequence | MSSCGLIPNQ SCHCCSGWKP AFGYTSFSMA IPSPPNGIAG VIDEKVGQYK PMAMDAEQHL APTEAIAELV NIVAQLRDPK GGCPWDLEQT HTSLIPCMLE EAHEVADAIR NGDDNHLSEE LGDLLLQVVL HAQIANEEGR FNLEDIARSI SAKLIRRHPH VFAEAVAIDS EAVRQSWESI KASEQPSSAS KSPLSDRLRS KVRGQPALAG AMAISKKVAN VGFEWNTIDG VWGKVQEEFE ELKEAVEHED QAHAQTELGD VLFTLVNVAR WCGLNPEEGL AGTNQRFLDR FSRVEAALEG QLSGQSLTEL EQLWQEAKAA IREEADDKKI SN
|
| |