Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | HS_1667 |
Symbol | mutY |
ID | 4241194 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Haemophilus somnus 129PT |
Kingdom | Bacteria |
Replicon accession | NC_008309 |
Strand | + |
Start bp | 1895732 |
End bp | 1896844 |
Gene Length | 1113 bp |
Protein Length | 370 aa |
Translation table | 11 |
GC content | 39% |
IMG OID | 638105253 |
Product | A/G-specific DNA-adenine glycosylase |
Protein accession | YP_719872 |
Protein GI | 113461803 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1194] A/G-specific DNA glycosylase |
TIGRFAM ID | [TIGR01084] A/G-specific adenine glycosylase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCAGGCTC AATCCTCTAT TAACAATCCT TTTGCTTATA CGGTACTTAA ATGGTATCGG CAATTCGGGC GTAAAAACTT ACCTTGGCAG CAAAATAAAA CTCTATATGG TGTTTGGTTA TCCGAGGTGA TGTTGCAACA AACTCAAGTA GCGACAGTTA TTCCTTATTT TGAACGTTTT ATTAAAGTTT TTCCAAATAT TACCGCACTT GCCAATGCAC CTTTAGATGA AGTGCTACAC TTATGGACAG GACTTGGTTA TTATGCAAGA GCTCGCAATT TGCATAGGGC TGCACAAACC ATAAGAGATC AGTATCAAGG TGAATTTCCA ACGGATTTTC AGCACGTGTG GGCTTTACCC GGTATTGGAC GTAGTACAGC TGGGGCGGTT TTATCTTCAG TACTTAATCA GCCTTATCCT ATTTTGGACG GTAATGTAAA GCGAGTGCTG ACTCGTTATT TTCAAGTGCA AGGTTGGACT GGTGATAAAA AAGTAGAGGA TAAGCTTTGG CAATTGAGTG CAGAGGTTAC CCCTACAGAG CAGGTCGCAG ATTTTAATCA AGCTATGATG GACTTAGGTG CGATGGTTTG TACTCGTACA AAACCTAAGT GCTTGTTGTG TCCATTAGCC ATAAAGTGCG GTGCTAATTT AAACAATAAT TGGGTAGATT TTCCGTCTAA AAAGCCGAAG AAATCGTTAC CTGAGAGAAA AAGTTATTTT CTGATTTTGG AGAATCAAGG TAAGGTTGCT TTAGAACAAC GCCCTATTTC AGGACTTTGG GGCGGATTAT ATTGTTTTCC GCAGTTTGAT ACCCTGACTG AATTATTGGC TTATCTTTCT CAGCAAGGTA TTCAACAATA TCAACAATGG ACGGCATTTC GTCATACATT CAGCCATTTT CATTTAGATA TTTATCCAAT TTATGCACAA ATACAAACTC AGGAAGTAGA ATTTGATCGC ACAGATTGGA AAAAAATTGC AGAAAATAAC GTAGAATATG GTTCTCCTAT ATCAAGTGCG GTCAAATATT GGTATGATCC TACCAATCCA AGCCAAATTG GTTTGGCTGT GCCGGTTAAA AATTTATTGA TCGAATTTCA AAAAAGGAAA TAA
|
Protein sequence | MQAQSSINNP FAYTVLKWYR QFGRKNLPWQ QNKTLYGVWL SEVMLQQTQV ATVIPYFERF IKVFPNITAL ANAPLDEVLH LWTGLGYYAR ARNLHRAAQT IRDQYQGEFP TDFQHVWALP GIGRSTAGAV LSSVLNQPYP ILDGNVKRVL TRYFQVQGWT GDKKVEDKLW QLSAEVTPTE QVADFNQAMM DLGAMVCTRT KPKCLLCPLA IKCGANLNNN WVDFPSKKPK KSLPERKSYF LILENQGKVA LEQRPISGLW GGLYCFPQFD TLTELLAYLS QQGIQQYQQW TAFRHTFSHF HLDIYPIYAQ IQTQEVEFDR TDWKKIAENN VEYGSPISSA VKYWYDPTNP SQIGLAVPVK NLLIEFQKRK
|
| |