Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Haur_0849 |
Symbol | |
ID | 5732750 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Herpetosiphon aurantiacus ATCC 23779 |
Kingdom | Bacteria |
Replicon accession | NC_009972 |
Strand | - |
Start bp | 958523 |
End bp | 959668 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 641277981 |
Product | MazG family protein |
Protein accession | YP_001543625 |
Protein GI | 159897378 |
COG category | [R] General function prediction only |
COG ID | [COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain |
TIGRFAM ID | [TIGR00444] MazG family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00848879 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTTAAGCC AACTTCTTAC GCAATTGCCA CTTGATTTGA GCCAAGGCTT TCAAGTTGTG CCCGCTCATA AATTACTAGC CCCAATTCCG GCTCCTGCCG AGGGCGCTGA TCGGGCTTGG TGCGAATTAC AAAATATTGC CGAATACCCA AGCTTCGTCA GCCCACTACC GTTTCAAGCC ACTCAAGCCC TGATCATTAC TGAAATCGAT GTAAGCAAGC TAGCAACGGT TCGAGCCAGT TTATTGCGGC GCTACCCTGC CGAACATCCT GTGCATAACT TAAATGAAAC TGGCCTGAGC CAACAAACCC TCGCCACTGC TAGTAGTGCT CAAGCTTGGT ATCTGCCAGC ACTCAGCATC GAAACCGATG TGGCAAGCCC AAGCACTTTA GAATGGATTA TGGCGCGTTT GGCGGGGCCA CACGGCTGCC CATGGGATCG CAAACAAACG CATGCGAGTT TACGTGAATT TTTGCTCGAA GAAACCCATG AAACGCTCGA AGCCCTTGAT GCCGAAGATT GGCCTAATCT CAAAGAAGAA TTGGGCGATT TATTATTGCA AATTGTCTTT CATGCCGAGT TTGGTCGCCA AGCAGGCCGC TTCAACCTTG ATCAGGTCTA TACAGCGATT AACAGCAAGC TCATTCGCCG CCATCCGCAT ATTTTTGGCA CAACCGAGGT TAGCGATGCC GACGAAGTAT TACGCAACTG GGATGCGATT AAGGCAACCG AGCATCAGGA AAAAGGCAGC CAACGTGAGA GTGCGCTCGA TGGGATTGCC AAAACCCTGC CGCCGCTGGC AACCGCCCAA CTCATTGGCA AAAAAGCCGC CAAAGTTGGC TTCGACTGGC CCGATGTTAG CGGAGTTTGG GCCAAAGTCC ATGAAGAAAT TGCTGAATTG CAAGCCGCCA CTAGCCCTGA AGAACAAGCC GCTGAGTTTG GTGATGTGCT TTTTGCCCTA ACCAATCTTG CTCGTTGGCT CAAAATTGAT TCAGAAAGCG CCTTACGCGG CACGATCACC AAATTCCGCC GCCGTTTCGT GGCGGTGGAG CAGGCCGCCC AAGCCCAAGG CCGCCAACTT AGTCAACTCA GCCTAAGCGA AGCCGACACG CTCTGGGAAG CCGCCAAACG AGCTGAGAAA CAATAA
|
Protein sequence | MLSQLLTQLP LDLSQGFQVV PAHKLLAPIP APAEGADRAW CELQNIAEYP SFVSPLPFQA TQALIITEID VSKLATVRAS LLRRYPAEHP VHNLNETGLS QQTLATASSA QAWYLPALSI ETDVASPSTL EWIMARLAGP HGCPWDRKQT HASLREFLLE ETHETLEALD AEDWPNLKEE LGDLLLQIVF HAEFGRQAGR FNLDQVYTAI NSKLIRRHPH IFGTTEVSDA DEVLRNWDAI KATEHQEKGS QRESALDGIA KTLPPLATAQ LIGKKAAKVG FDWPDVSGVW AKVHEEIAEL QAATSPEEQA AEFGDVLFAL TNLARWLKID SESALRGTIT KFRRRFVAVE QAAQAQGRQL SQLSLSEADT LWEAAKRAEK Q
|
| |