Gene HS_1667 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHS_1667 
SymbolmutY 
ID4241194 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHaemophilus somnus 129PT 
KingdomBacteria 
Replicon accessionNC_008309 
Strand
Start bp1895732 
End bp1896844 
Gene Length1113 bp 
Protein Length370 aa 
Translation table11 
GC content39% 
IMG OID638105253 
ProductA/G-specific DNA-adenine glycosylase 
Protein accessionYP_719872 
Protein GI113461803 
COG category[L] Replication, recombination and repair 
COG ID[COG1194] A/G-specific DNA glycosylase 
TIGRFAM ID[TIGR01084] A/G-specific adenine glycosylase 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGGCTC AATCCTCTAT TAACAATCCT TTTGCTTATA CGGTACTTAA ATGGTATCGG 
CAATTCGGGC GTAAAAACTT ACCTTGGCAG CAAAATAAAA CTCTATATGG TGTTTGGTTA
TCCGAGGTGA TGTTGCAACA AACTCAAGTA GCGACAGTTA TTCCTTATTT TGAACGTTTT
ATTAAAGTTT TTCCAAATAT TACCGCACTT GCCAATGCAC CTTTAGATGA AGTGCTACAC
TTATGGACAG GACTTGGTTA TTATGCAAGA GCTCGCAATT TGCATAGGGC TGCACAAACC
ATAAGAGATC AGTATCAAGG TGAATTTCCA ACGGATTTTC AGCACGTGTG GGCTTTACCC
GGTATTGGAC GTAGTACAGC TGGGGCGGTT TTATCTTCAG TACTTAATCA GCCTTATCCT
ATTTTGGACG GTAATGTAAA GCGAGTGCTG ACTCGTTATT TTCAAGTGCA AGGTTGGACT
GGTGATAAAA AAGTAGAGGA TAAGCTTTGG CAATTGAGTG CAGAGGTTAC CCCTACAGAG
CAGGTCGCAG ATTTTAATCA AGCTATGATG GACTTAGGTG CGATGGTTTG TACTCGTACA
AAACCTAAGT GCTTGTTGTG TCCATTAGCC ATAAAGTGCG GTGCTAATTT AAACAATAAT
TGGGTAGATT TTCCGTCTAA AAAGCCGAAG AAATCGTTAC CTGAGAGAAA AAGTTATTTT
CTGATTTTGG AGAATCAAGG TAAGGTTGCT TTAGAACAAC GCCCTATTTC AGGACTTTGG
GGCGGATTAT ATTGTTTTCC GCAGTTTGAT ACCCTGACTG AATTATTGGC TTATCTTTCT
CAGCAAGGTA TTCAACAATA TCAACAATGG ACGGCATTTC GTCATACATT CAGCCATTTT
CATTTAGATA TTTATCCAAT TTATGCACAA ATACAAACTC AGGAAGTAGA ATTTGATCGC
ACAGATTGGA AAAAAATTGC AGAAAATAAC GTAGAATATG GTTCTCCTAT ATCAAGTGCG
GTCAAATATT GGTATGATCC TACCAATCCA AGCCAAATTG GTTTGGCTGT GCCGGTTAAA
AATTTATTGA TCGAATTTCA AAAAAGGAAA TAA
 
Protein sequence
MQAQSSINNP FAYTVLKWYR QFGRKNLPWQ QNKTLYGVWL SEVMLQQTQV ATVIPYFERF 
IKVFPNITAL ANAPLDEVLH LWTGLGYYAR ARNLHRAAQT IRDQYQGEFP TDFQHVWALP
GIGRSTAGAV LSSVLNQPYP ILDGNVKRVL TRYFQVQGWT GDKKVEDKLW QLSAEVTPTE
QVADFNQAMM DLGAMVCTRT KPKCLLCPLA IKCGANLNNN WVDFPSKKPK KSLPERKSYF
LILENQGKVA LEQRPISGLW GGLYCFPQFD TLTELLAYLS QQGIQQYQQW TAFRHTFSHF
HLDIYPIYAQ IQTQEVEFDR TDWKKIAENN VEYGSPISSA VKYWYDPTNP SQIGLAVPVK
NLLIEFQKRK