Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1122 |
Symbol | mazG |
ID | 4240623 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1259016 |
End bp | 1260005 |
Gene Length | 990 bp |
Protein Length | 329 aa |
Translation table | 11 |
GC content | 33% |
IMG OID | 638104685 |
Product | nucleoside triphosphate pyrophosphohydrolase |
Protein accession | YP_719334 |
Protein GI | 113461265 |
COG category | [R] General function prediction only |
COG ID | [COG3956] Protein containing tetrapyrrole methyltransferase domain and MazG-like (predicted pyrophosphatase) domain |
TIGRFAM ID | [TIGR00444] MazG family protein |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 1 |
Plasmid unclonability p-value | 0.000000252195 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | GTGTTTATAC ATAATTGCCT CCTTAGCTTA ATGAATTTAA CTAACTGTTG GAACTATACT AGCATTGATA AAAATAAAAA AATTTGCACC AGATCACGTT TTAAAAAATT TTTCATTTTT TTGTTCATTC AAGGCTATGA TATCCTTATT TCAAATACTA TTTATTTCAT GAGATTATTT ATGAGCTACC ACATTCAAGA TTTTATTCGT ATCATTGCCC AATTACGAGA TCCTAATAAT GGTTGTCCTT GGGATTTAAA GCAAAATTAT CGTTCGATGA TCACTTGTTT AATTGAAGAA ACTTATGAGG TCATTGAAGC AATAGAGCAA AATAACATGT CTGATCTGAA AGAAGAATTA GGTGATTTGC TCTTACAAGT TGTATTTTTT AGCCAACTTG CTACGGAAGA TCAATATTTT ACTTTTGATG ACGTAGTACA TACCGTTACA GAAAAAATCT TGCGTCGCCA CCCACACGTA TTCGGAGAAC AAAAAGCCAG TAATGCAGAC GAGGCATTAG AAAATTGGAA CAAAGCAAAA GCAAGTGAAT ATATTCAAAA AGGACATCAA TCAATCTTAG ATAATATTCC CCATGCTTTT CCGGCATTGA AGCGTGCCGA AAAATTACAA AAACGCTGTG CCAAAATAGG TTTTGACTGG AATAATGTCA ATCCTGTGAT AGCTAAAGTA CAAGAGGAAT TAGAAGAAGT TAAACAGGAA TATCAAACAA ATCCTATCAA CCAAACAAAA ATTGAAGAAG AAATCGGCGA TCTTCTTTTT GCGGTAGTAA ATTTAAGTCG TCATCTAAAA TGTAATCCTG AAGAAAGCTT GCGTAAAGCC AATAAAAAAT TTGAACGCCG TTTTCGTGCC GTCGAGCAAA AATTGCGTGA AGCACACAAA ACCTTCGAAA ATACCTCTTT AACTGAAATG GATATTTTTT GGGATGAAGT TAAACGAGAT GAAATCACAC AAACTACGCA AGGAAAATAA
|
Protein sequence | MFIHNCLLSL MNLTNCWNYT SIDKNKKICT RSRFKKFFIF LFIQGYDILI SNTIYFMRLF MSYHIQDFIR IIAQLRDPNN GCPWDLKQNY RSMITCLIEE TYEVIEAIEQ NNMSDLKEEL GDLLLQVVFF SQLATEDQYF TFDDVVHTVT EKILRRHPHV FGEQKASNAD EALENWNKAK ASEYIQKGHQ SILDNIPHAF PALKRAEKLQ KRCAKIGFDW NNVNPVIAKV QEELEEVKQE YQTNPINQTK IEEEIGDLLF AVVNLSRHLK CNPEESLRKA NKKFERRFRA VEQKLREAHK TFENTSLTEM DIFWDEVKRD EITQTTQGK
|
| |